Science and Innovation: Numerical Algorithms and Intelligent Software for the Evolving HPC Platform

Filter
Conference contribution

Search results

  • 2014

    Fence Scoping

    Lin, C., Nagarajan, V. & Gupta, R., 16 Nov 2014, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis: New Orleans, Louisana. Institute of Electrical and Electronics Engineers, p. 105-116 12 p. (SC '14).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
    File
  • Increasing Cache Capacity via Critical-words-Only Cache

    Huang, C-C. & Nagarajan, V., 19 Oct 2014, Computer Design (ICCD), 2014 32nd IEEE International Conference on . Institute of Electrical and Electronics Engineers, p. 125-132 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
    File
  • ATCache: Reducing DRAM-cache Latency via a Small SRAM Tag Cache

    Huang, C-C. & Nagarajan, V., Aug 2014, PACT '14 Proceedings of the 23rd international conference on Parallel architectures and compilation. ACM, p. 51-60 10 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    File
  • Extending the generalized Fermat prime search beyond one million digits

    Bethune, I. & Goetz, M., 6 May 2014, 10th International Conference, PPAM 2013, Warsaw, Poland, September 8-11, 2013, Revised Selected Papers, Part I. Wyrzykowski, R., Dongarra, J., Karczewski, K. & Waśniewski, J. (eds.). Springer, p. 106-113 20 p. (Lecture Notes in Computer Science; vol. 8384).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • TSO-CC: Consistency directed cache coherence for TSO

    Elver, M. & Nagarajan, V., Feb 2014, The International Symposium on High-Performance Computer Architecture: Orlando, Florida. 12 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
    File
  • 2013

    Address-aware Fences

    Lin, C., Nagarajan, V. & Gupta, R., 2013, Proceedings of the 27th International ACM Conference on International Conference on Supercomputing. New York, NY, USA: ACM, p. 313-324 12 p. (ICS '13).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Fast RMWs for TSO: semantics and implementation

    Rajaram, B., Nagarajan, V., Sarkar, S. & Elver, M., 2013, Proceedings of the 34th ACM SIGPLAN conference on Programming language design and implementation. ACM, p. 61-72 12 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
    File
  • Mini-Batch Primal and Dual Methods for SVMs

    Takac, M., Bijral, A., Srebro, N. & Richtarik, P., 2013, JMLR Workshop and Conference Proceedings: Proceedings of the 30th International Conference on Machine Learning. 3 ed. Vol. 28. p. 1022-1030

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2012

    Autotuning Wavefront Abstractions for Heterogenous Architectures

    Cole, M. & Mohanty, S., 2012, Applications for Multi-Core Architectures (WAMCA), 2012 Third Workshop on. Institute of Electrical and Electronics Engineers, p. 42-47 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Efficient sequential consistency via conflict ordering

    Lin, C., Nagarajan, V., Gupta, R. & Rajaram, B., 2012, Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems. New York, NY, USA: ACM, p. 273-286 14 p. (ASPLOS '12).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • MaSiF: Machine Learning Guided Auto-tuning of Parallel Skeletons

    Collins, A., Fensch, C. & Leather, H., 2012, Proceedings of the 21st International Conference on Parallel Architectures and Compilation Techniques. New York, NY, USA: ACM, p. 437-438 2 p. (PACT '12).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
    File
  • SuperCoP: A General, Correct, and Performance-efficient Supervised Memory System

    Rajaram, B., Nagarajan, V., McPherson, A. J. & Cintra, M., 2012, Proceedings of the 9th Conference on Computing Frontiers. New York, NY, USA: ACM, p. 85-94 10 p. (CF '12).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2010

    Efficient sequential consistency using conditional fences

    Lin, C., Nagarajan, V. & Gupta, R., 2010, Proceedings of the 19th international conference on Parallel Architectures And Compilation Techniques (PACT '10). New York, NY, USA: ACM, p. 295-306 12 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Partitioning streaming parallelism for multi-cores: a machine learning based approach

    Wang, Z. & O'Boyle, M. F. P., 2010, Proceedings of the 19th international conference on Parallel Architectures And Compilation Techniques (PACT '10). New York, NY, USA: ACM, p. 307-318 12 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution