Галерея 2742854

Галерея 2742854
20% DISCOUNT - COUPON CODE "DELIOFFER" - Shop Now!

© Delinutshop 2022. All Rights Reserved
IFE House, Mundakkal West, Kollam 691001, Kerala, India.
Get all the latest information on Sales and Offers. Sign up for the newsletter today.

Sign in

Register

Genesis: a language for generating synthetic training programs for machine learning
Published: 06 May 2015 Publication History
CF '15: Proceedings of the 12th ACM International Conference on Computing Frontiers
CF '15
Paper Acceptance Rate 33 of 96 submissions, 34% Overall Acceptance Rate 186 of 501 submissions, 37%
Funding Sources Qualcomm Natural Sciences and Engineering Research Council of Canada
J. Bergstra, N. Pinto, and D. Cox. Machine learning for predictive auto-tuning with boosted regression trees. In Proc. of InPar , pages 1--9, 2012. Google Scholar Cross Ref C. Bienia. "Benchmarking Modern Multiprocessors" . Ph.D. dissertation, Princeton University, 2011. Google Scholar Digital Library S. Caine and E. Gordon. PDL: A tool for software design. In Proc. of National Computer Conf. and Expo , pages 271--276, 1975. Google Scholar Digital Library CodeSmith Tools, LLC. CodeSmith Generator. http://www.codesmithtools.com/product/generator. Google Scholar A. Collins, C. Fensch, and H. Leather. MaSiF: machine learning guided auto-tuning of parallel skeletons. In Proc. of PACT , pages 437--438, 2012. Google Scholar Digital Library cTuning.org. Static Features available in MILEPOST GCC V2.1. "http://ctuning.org/wiki/index.php/CTools:MilepostGCC:StaticFeatures:MILEPOST_V2.1". Google Scholar G. Fursin et al. MILEPOST GCC: machine learning based research compiler. In GCC Summit , 2008. Google Scholar A. Ganapathi et al. A case for machine learning to optimize multicore performance. In Proc. of HotPar , pages 1--6, 2009. Google Scholar Digital Library D. Grewe, Z. Wang, and M. O'Boyle. Portable mapping of data parallel programs to OpenCL for heterogeneous system. In Proc. of CGO , pages 1--10, 2013. Google Scholar Digital Library T. D. Han and T. S. Abdelrahman. Automatic tuning of local memory use on GPGPUs. In Proc. of ADAPT , 2015. Google Scholar X. Leroy et al. The objective Caml system release 3.12. "http://caml.inria.fr/pub/distrib/ocaml-3.12/ocaml-3.12-refman.pdf", 2010. Google Scholar A. Markus. Generating test programs with TestMake. In Proc. of European Tcl/Tk User Meeting , pages 127--138, 2001. Google Scholar M. Mohri. Foundations of machine learning . MIT Press, Cambridge, MA, 2012. Google Scholar Digital Library R. Plackett. Karl Pearson and the chi-squared test. International Statistical Review , pages 59--72, 1983. Google Scholar Cross Ref J. Schimmel et al. Automatic generation of parallel unit tests. In Proc. of AST , pages 40--46, 2013. Google Scholar Digital Library G. Tournavitis et al. Towards a holistic approach to auto-parallelization: integrating profile-driven parallelism detection and machine-learning based mapping. In Proc. of PLDI , pages 177--187, 2009. Google Scholar Digital Library W. Turski. Software engineering--some principles and problems. In Programming Methodology by David Gries , pages 29--36. Springer Verlag, New York, 1978. Google Scholar Cross Ref Y. Voronenko, F. de Mesmay, and M. Püschel. Computer generation of general size linear transform libraries. In Proc. of CGO , pages 102--113, 2009. Google Scholar Digital Library Z. Wang and M. O'Boyle. Partitioning streaming parallelism for multi-cores: a machine learning based approach. In Proc. of PACT , pages 307--318, 2010. Google Scholar Digital Library S. Woo et al. The SPLASH-2 programs: Characterization and methodological considerations. In Proc. of ISCA , pages 24--36, 1995. Google Scholar Digital Library X. Yang et al. Finding and understanding bugs in C compilers. In Proc. of PLDI , pages 283--294, 2011. Google Scholar Digital Library
Browse All Return Change zoom level
Close modal New Citation Alert added!

Connect

Contact
Facebook
Twitter
Linkedin

Feedback
Bug Report

The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2023 ACM, Inc.
If you 'd like us to contact you regarding your feedback, please provide your contact details here.
We describe Genesis, a language for the generation of synthetic programs for use in machine learning-based performance auto-tuning. The language allows users to annotate a template program to customize its code using statistical distributions and to generate program instances based on those distributions. This effectively allows users to generate training programs whose characteristics or features vary in a statistically controlled fashion. We describe the language constructs, a prototype preprocessor for the language, and three case studies that show the ability of Genesis to express a range of training programs in different domains. We evaluate the preprocessor's performance and the statistical quality of the samples it generates. We believe that Genesis is a useful tool for generating large and diverse sets of programs, a necessary component when training machine learning models for auto-tuning.
Check if you have access through your login credentials or your institution to get full access on this article.
Istituto di Calcolo e Reti ad Alte Prestazioni, CNR, ITALY
Institute for Computing Technology, Chinese Academy of Sciences, PRC
Association for Computing Machinery
Request permissions about this article.
View this article in digital edition.
https://dl.acm.org/doi/10.1145/2742854.2742883
This alert has been successfully added and will be sent to:
You will be notified whenever a record that you have chosen has been cited.
To manage your alert preferences, click on the button below.
We use cookies to ensure that we give you the best experience on our website.

Sign in

Register

Scaling application properties to exascale
Published: 06 May 2015 Publication History
CF '15: Proceedings of the 12th ACM International Conference on Computing Frontiers
CF '15
Paper Acceptance Rate 33 of 96 submissions, 34% Overall Acceptance Rate 186 of 501 submissions, 37%
Funding Sources Netherlands Organisation for Scientific Research (NWO) Dutch Ministry of EL&I Province of Drenthe
The LLVM compiler infrastructure project. http://www.llvm.org/. Google Scholar SPEC CPU benchmarks. http://www.spec.org/benchmarks.html. Google Scholar Mathematica 10, 2014. http://www.wolfram.com/mathematica/. Google Scholar T. Agerwala. Exascale computing: The challenges and opportunities in the next decade. In 2010 IEEE 16th International Symposium on High Performance Computer Architecture (HPCA) , pages 1--1, Jan 2010. Google Scholar Digital Library A. Almeida, M. Castel-Branco, and A. Falcao. Linear regression for calibration lines revisited: Weighting schemes for bioanalytical methods. Journal of Chromatography B , 774(2): 215--222, 2002. Google Scholar Cross Ref K. Amunts, A. Lindner, and K. Zilles. The human brain project: Neuroscience perspectives and german contributions. e-Neuroforum , 5(2): 43--50, 2014. Google Scholar A. Anghel, L. M. Vasilescu, R. Jongerius, G. Dittmann, and G. Mariani. An instrumentation approach for hardware-agnostic software characterization. In Proceedings of the 12th ACM Conference on Computing Frontiers , CF '15, New York, NY, USA, 2015. ACM. Google Scholar Digital Library M. B. Breugh, S. Eyerman, and L. Eeckhout. Mechanistic analytical modeling of superscalar in-order processor performance. ACM Trans. Archit. Code Optim. , 11(4): 50:1--50:26, Jan. 2015. Google Scholar Digital Library A. Calotoiu, T. Hoefler, M. Poke, and F. Wolf. Using automated performance modeling to find scalability bugs in complex codes. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis , SC '13, pages 45:1--45:12, New York, NY, USA, 2013. ACM. Google Scholar Digital Library E. Chung, P. Milder, J. Hoe, and K. Mai. Single-chip heterogeneous computing: Does the future include custom logic, FPGAs, and GPGPUs? In 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) , pages 225--236, 2010. Google Scholar Digital Library H. Cook and K. Skadron. Predictive design space exploration using genetically programmed response surfaces. In Proceedings of the 45th Annual Design Automation Conference , DAC '08, pages 960--965, New York, NY, USA, 2008. ACM. Google Scholar Digital Library M. Drozdowski and L. Wielebski. Isoefficiency maps for divisible computations. IEEE Transactions on Parallel and Distributed Systems , 21(6): 872--880, June 2010. Google Scholar Digital Library S. Eyerman, L. Eeckhout, T. Karkhanis, and J. E. Smith. A mechanistic performance model for superscalar out-of-order processors. ACM Trans. Comput. Syst. , 27(2): 3:1--3:37, May 2009. Google Scholar Digital Library E. Gayawan and R. A. Ipinyomi. A comparison of Akaike, Schwarz and R square criteria for model selection using some fertility models. Australian Journal of Basic and Applied Sciences , 3(4): 3524--3530, 2009. Google Scholar I. Gluhovsky. Determining output uncertainty of computer system models. Performance Evaluation , 64(2): 103--125, Feb 2007. Google Scholar Digital Library I. Gluhovsky, D. Vengerov, and B. O'Krafka. Comprehensive multivariate extrapolation modeling of multiprocessor cache miss rates. ACM Transactions on Computer Systems (TOCS) , 25(1), Feb 2007. Google Scholar Digital Library Q. Guo, T. Chen, Y. Chen, L. Li, and W. Hu. Microarchitectural design space exploration made fast. Microprocessors and Microsystems , 37(1): 41--51, 2013. Google Scholar Digital Library J. L. Hennessy and D. A. Patterson. Computer Architecture: A Quantitative Approach . Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 3 edition, 2003. Google Scholar Digital Library F. Hutter, L. Xu, H. H. Hoos, and K. Leyton-Brown. Algorithm runtime prediction: Methods & evaluation. Artif. Intell. , 206: 79--111, Jan. 2014. Google Scholar Digital Library R. Jongerius, S. Wijnholds, R. Nijboer, and H. Corporaal. An end-to-end computing model for the square kilometre array. Computer , 47(9): 48--54, Sept 2014. Google Scholar Digital Library B. Li, L. Peng, and B. Ramadass. Accurate and efficient processor performance prediction via regression tree based modeling. J. Syst. Archit. , 55(10--12): 457--467, Oct. 2009. Google Scholar Digital Library G. Mariani, R. Meeuws, G. Palermo, V.-M. Sima, C. Silvano, and K. Bertels. DRuiD: Designing reconfigurable architectures with decision-making support. In 19th Asia and South Pacific Design Automation Conference (ASP-DAC) , Singapore, 01/2014 2014. Google Scholar Cross Ref G. Mariani, G. Palermo, V. Zaccaria, and C. Silvano. OSCAR: An optimization methodology exploiting spatial correlation in multicore design spaces. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems , 31(5): 740--753, 2012. Google Scholar Digital Library G. Marin and J. Mellor-Crummey. Cross-architecture performance predictions for scientific applications using parameterized models. In Proceedings of the Joint International Conference on Measurement and Modeling of Computer Systems , SIGMETRICS '04/Performance '04, pages 2--13, New York, NY, USA, 2004. ACM. Google Scholar Digital Library D. Montgomery. Design and Analysis of Experiments, 8th Edition . John Wiley & Sons, Incorporated, 2012. Google Scholar C. Nugteren, G.-J. van den Braak, and H. Corporaal. Future of GPGPU micro-architectural parameters. In Proceedings of the Conference on Design, Automation and Test in Europe , DATE '13, pages 392--395, San Jose, CA, USA, 2013. EDA Consortium. Google Scholar Digital Library I. Sharapov, R. Kroeger, G. Delamarter, R. Cheveresan, and M. Ramsay. A case study in top-down performance estimation for a large-scale parallel application. In Proceedings of the Eleventh ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming , PPoPP '06, pages 81--89, New York, NY, USA, 2006. ACM. Google Scholar Digital Library M. Sipser. Introduction to the Theory of Computation . Thomson Course Technology, 2006. Google Scholar Digital Library S. Song, C.-Y. Su, R. Ge, A. Vishnu, and K. Cameron. Iso-energy-efficiency: An approach to power-constrained parallel computation. In 2011 IEEE International Parallel Distributed Processing Symposium (IPDPS) , pages 128--139, May 2011. Google Scholar Digital Library K. Ueno and T. Suzumura. Highly scalable graph search for the Graph500 benchmark. In Proceedings of the 21st International Symposium on High-Performance Parallel and Distributed Computing , HPDC '12, pages 149--160, New York, NY, USA, 2012. ACM. Google Scholar Digital Library E. Vermij, L. Fiorin, R. Jongerius, C. Hagleitner, and K. Bertels. Challenges in exascale radio astronomy: Can the SKA ride the technology wave? International Journal of High Performance Computing Applications , 29: 37--50, February 2015. Google Scholar Digital Library S. Vormwald, W. Wang, S. Carr, S. Seidel, and Z. Wang. Predicting remote reuse distance patterns in UPC applications. In Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model , PGAS '10, pages 1:1--1:4, New York, NY, USA, 2010. ACM. Google Scholar Digital Library S.-C. Wang. Artificial neural network. In Interdisciplinary Computing in Java Programming , volume 743 of The Springer International Series in Engineering and Computer Science , pages 81--100. Springer US, 2003. Google Scholar Cross Ref Z. Zhang and B. Xiaofeng. Comparison about the three central composite designs with simulation. In International Conference on Advanced Computer Control. ICACC '09 , pages 163--167, Jan 2009. Google Scholar Digital Library
Browse All Return Change zoom level
Close modal New Citation Alert added!

Connect

Contact
Facebook
Twitter
Linkedin

Feedback
Bug Report

The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2023 ACM, Inc.
If you 'd like us to contact you regarding your feedback, please provide your contact details here.
Exascale computing systems will execute computationally intensive tasks on unprecedented amounts of data. Tuning the design of such systems for a specific application or for an application domain is a challenging task as it is not yet possible to analyze the actual run-time behavior of exascale applications. Run-time properties, such as the memory access pattern, the available instruction-level parallelism and the instruction mix, are valuable information for architects to tune the processing elements, the memory system and the communication infrastructure.
We propose a methodology for extrapolating application properties at exascale from an analysis of workload sizes feasible on current systems. The methodology is suitable for applications scaling over different parameters (e.g., the number of vertices and edges represent two parameters in a graph algorithm). The proposed methodology combines a) a statistically sound approach for model selection and b) knowledge coming from computational theory, such as the order of complexity of the application under analysis. Compared with state-of-the-art techniques, the proposed methodology reduces the prediction error by an order of magnitude on the instruction count and improves the accuracy of memory access pattern prediction by up to 1.3×.
Check if you have access through your login credentials or your institution to get full access on this article.
Istituto di Calcolo e Reti ad Alte Prestazioni, CNR, ITALY
Institute for Computing Technology, Chinese Academy of Sciences, PRC
Association for Computing Machinery
Request permissions about this article.
View this article in digital edition.
https://dl.acm.org/doi/10.1145/2742854.2742860
This alert has been successfully added and will be sent to:
You will be notified whenever a record that you have chosen has been cited.
To manage your alert preferences, click on the button below.
We use cookies to ensure that we give you the best experience on our website.

Sign in

Register

Enhanced GPU-based distributed breadth first search
Published: 06 May 2015 Publication History
CF '15: Proceedings of the 12th ACM International Conference on Computing Frontiers
CF '15
Paper Acceptance Rate 33 of 96 submissions, 34% Overall Acceptance Rate 186 of 501 submissions, 37%
V. Agarwal, F. Petrini, D. Pasetto, and D. Bader. Scalable graph exploration on multicore processors. In High Performance Computing, Networking, Storage and Analysis (SC), 2010 International Conference for , pages 1--11, nov. 2010. Google Scholar Digital Library D. A. Bader and K. Madduri. Designing multithreaded algorithms for breadth-first search and st-connectivity on the cray mta-2. 2012 41st International Conference on Parallel Processing , 0: 523--530, 2006. Google Scholar Digital Library S. Beamer, K. Asanović, and D. Patterson. Direction-optimizing breadth-first search. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis , SC '12, pages 12:1--12:10, Los Alamitos, CA, USA, 2012. IEEE Computer Society Press. Google Scholar Digital Library S. Beamer, A. Buluc, K. Asanović, and D. A. Patterson. Distributed memory breadth-first search revisited: Enabling bottom-up search. Technical Report UCB/EECS-2013-2, EECS Department, University of California, Berkeley, Jan 2013. Google Scholar M. Bernaschi, G. Carbone, M. Fatica, E. Mastrostefano, and D. Rossetti. Breadth first search on multiple gpus. In GPU Technology Conference 2013 . NVIDIA, 2013. Google Scholar M. Bernaschi and E. Mastrostefano. Efficient breadth first search on multi-gpu systems. Journal of Parallel and Distributed Computing , 73: 1292--1305, 2013. Google Scholar Digital Library M. Bisson, M. Bernaschi, and E. Mastrostefano. Parallel distributed breadth first search on the kepler architecture. arXiv preprint arXiv:1408.1605 , 2014. Google Scholar A. Buluc and K. Madduri. Parallel breadth-first search on distributed memory systems. SC '11 Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis , 2011. Google Scholar Digital Library F. Checconi and F. Petrini. Traversing trillions of edges in real time: Graph exploration on large-scale parallel machines. In Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium , IPDPS '14, pages 425--434, Washington, DC, USA, 2014. IEEE Computer Society. Google Scholar Digital Library F. Checconi, F. Petrini, J. Willcock, A. Lumsdaine, A. R. Choudhury, and Y. Sabharwal. Breaking the speed and scalability barriers for graph exploration on distributed-memory machines. In High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for , pages 1--12. IEEE, 2012. Google Scholar Digital Library A. G. Duane Merrill, Michael Garland. High performance and scalable GPU graph traversal. Tec
Двух зрелых шлюх из Европы проебли мужчины толпой
Молодые подружки пытаются впихнуть в себя не впихуемое
Темнокожие беременные толстушки показывают свои тела полностью освобождаясь от одежды

Галерея 2742854

Report Page