Publications

2021

  • Quantifying the Design-Space Tradeoffs in Autonomous Drones [Paper] [Abstract] [ACM Artifact] [GitHub] [Drone Build Guide] [Short Video]
    Ramyad Hadidi, Bahar Asgari, Sam Jijina, Adriana Amyette, Nima Shoghi, Hyesoon Kim
    International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Virtual, USA (2021)
    Note: ACM Artifact requires a permanent storage for artifacts, that is why we have a Zenodo repository. For exploring the content of the repository, please use the GitHub link, as you can render PDF and readme files.

  • Efficiently Solving Partial Differential Equations in a Partially Reconfigurable Specialized Hardware [Paper]
    Bahar Asgari, Ramyad Hadidi, Tushar Krishna, Hyesoon Kim, Sudhakar Yalamanchili
    IEEE Transactions on Computers, (2021)

  • FAFNIR: Accelerating Sparse Gathering by Using Efficient Near-Memory Intelligent Reduction [Paper] [Short Video]
    Bahar Asgari, Ramyad Hadidi, Jiashen Cao, Da Eun Shim, Sung-Kyu Lim, Hyesoon Kim
    International Symposium on High-Performance Computer Architecture (HPCA), Seoul, South Korea (2021)

  • Context-Aware Task Handling in Resource-Constrained Robots with Virtualization [Paper]
    Ramyad Hadidi, Nima Shoghi, Bahar Asgari, Hyesoon Kim
    arXiv preprint arXiv:2104.04563(2021)

  • THIA: Accelerating Video Analytics using Early Inference and Fine-Grained Query Planning [Paper]
    Jiashen Cao, Ramyad Hadidi, Joy Arulraj, Hyesoon Kim
    arXiv preprint arXiv:2102.08481 (2021)

  • Creating Robust Deep Neural Networks With Coded Distributed Computing for IoT Systems [Paper]
    Ramyad Hadidi, Jiashen Cao, Hyesoon Kim
    arXiv preprint arXiv:2104.04447 (2021)

2020

  • Reducing Inference Latency with Concurrent Architectures for Image Recognition [Paper]
    Ramyad Hadidi, Jiashen Cao, Michael S. Ryoo, Hyesoon Kim
    arXiv preprint arXiv:2011.07092 (2020)

  • LCP: A Low-Communication Parallelization Method for Fast Neural Network Inference in Image Recognition [Paper]
    Ramyad Hadidi, Bahar Asgari, Jiashen Cao, Younmin Bae, Da Eun Shim, Hyojong Kim, Sung-Kyu Lim, Michael S. Ryoo, Hyesoon Kim
    arXiv preprint arXiv:2003.06464 (2020)

  • Copernicus: Characterizing the Performance Implications of Compression Formats Used in Sparse Workloads [Paper]
    Bahar Asgari, Ramyad Hadidi, Joshua Dierberger, Charlotte Steinichen, Hyesoon Kim
    arXiv preprint arXiv:2011.10932 (2020)

  • Secure Location-Aware Authentication and Communication for Intelligent Transportation Systems [Paper]
    Nima Shoghi Ghalehshahi, Ramyad Hadidi, Lee Jaewon, Jun Chen, Arthur Siqueria, Rahul Rajan, Shaan Dhawan, Pooya Shoghi Ghalehshahi, Hyesoon Kim
    arXiv preprint arXiv:2011.07092 (2020)

  • MEISSA: Multiplying Matrices Efficiently in a Scalable Systolic Architecture [Paper] [Video]
    Bahar Asgari, Ramyad Hadidi, Hyesoon Kim
    IEEE International Conference on Computer Design (ICCD), Hartford, CT (2020)

  • Proposing a Fast and Scalable Systolic Array to Implement Matrix Multiplications [Slides] [Video]
    Bahar Asgari, Ramyad Hadidi, Hyesoon Kim
    IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM), Fayetteville, AR (2020)

  • Understanding the Software and Hardware Stacks of a General-Purpose Cognitive Drone [Poster] [Paper]
    Sam Jijina, Adriana Amyette, Nima Shoghi, Ramyad Hadidi, Hyesoon Kim
    IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Boston, MA (2020) [Mentored Undergraduate Work]

  • Towards Collaborative Inferencing of Deep Neural Networks on Internet of Things Devices [Paper] [Author’s Copy]
    Ramyad Hadidi, Jiashen Cao, Michael S. Ryoo, Hyesoon Kim
    IEEE Internet of Things Journal (JIoT) (2020)

  • PISCES: Power-Aware Implementation of SLAM by Customizing Efficient Sparse Algebra [Paper] [Slides] [Video]
    Bahar Asgari, Ramyad Hadidi, Nima Shoghi, Hyesoon Kim
    2020 57th ACM/EDAC/IEEE Design Automation Conference (DAC), San Francisco, CA (2020)

  • Towards a General Purpose Cognitive Drone [Slides] [Paper]
    Sam Jijina, Adriana Amyette, Ramyad Hadidi, Hyesoon Kim
    The Fourth Workshop on Cognitive Architectures (CogArch 2020), co-located with HPCA 2020, San Diego, CA (2020)
    [Mentored Undergraduate Work]

  • Batch-Aware Unified Memory Management in GPUs for Irregular Workloads [Paper]
    Hyojong Kim, Jaewoong Sim, Prasun Gera, Ramyad Hadidi, Hyesoon Kim
    International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Lausanne, Switzerland (2020)

  • ALRESCHA: A Lightweight Reconfigurable Sparse-Computation Accelerator [Slides] [Paper]
    Bahar Asgari, Ramyad Hadidi, Tushar Krishna, Hyesoon Kim, Sudhakar Yalamanchili
    International Symposium on High-Performance Computer Architecture (HPCA), San Diego, CA (2020)

  • ASCELLA: Accelerating Sparse Computation by Enabling Stream Accesses to Memory [Paper] [Slides] [Video]
    Bahar Asgari, Ramyad Hadidi, Hyesoon Kim
    Design, Automation, and Test in Europe (DATE), Grenoble, France (2020)

2019

  • Characterizing the Deployment of Deep Neural Networks on Commercial Edge Devices [EdgeBench] [Slides] [Paper]
    Ramyad Hadidi, Jiashen Cao, Yilun Xie, Bahar Asgari, Tushar Krishna, Hyesoon Kim
    IEEE International Symposium on Workload Characterization (IISWC), Orlando, Florida (2019)
    [Best Paper Nominee]

  • ERIDANUS: Efficiently Running Inference of DNNs Using Systolic Arrays [IEEE Micro]
    Bahar Asgari, Ramyad Hadidi, Hyesoon Kim, Sudhakar Yalamanchili
    IEEE Micro, Special Issue on Machine Learning Acceleration (2019)

  • SLAM Performance on Embedded Robots [Paper]
    Nima Shoghi, Ramyad Hadidi, Hyesoon Kim
    Student Research Competition at Embedded System Week (SRC ESWEEK), New York, NY (2019)
    [Mentored Undergraduate Work] [ACM SRC Bronze Winner]

  • Enabling Speech to Text on Embedded Systems [Paper]
    Mohan Dodda, Taejoon Park, Sayuj Shajith, Ramyad Hadidi, Hyesoon Kim
    Student Research Competition at Embedded System Week (SRC ESWEEK), New York, NY (2019)
    [Mentored Undergraduate Work]

  • Video Analytics From Edge To Server: Work-in-Progress [Paper] [Slides]
    Jiashen Cao, Ramyad Hadidi, Joy Arulraj and Hyesoon Kim
    International Conference on Hardware/Software Codesign and System Synthesis CODES+ISSS (ESWEEK), New York, NY (2019)

  • Capella: Customizing Perception for Edge Devices by Efficiently Allocating FPGAs to DNNs [Demo Site] [Slides] [Paper]
    Younmin Bae, Ramyad Hadidi, Bahar Asgari, Jiashen Cao, Hyesoon Kim
    International Conference on Field-Programmable Logic and Applications (FPL), Demo, Barcelona, Spain (2019)
    [Mentored Undergraduate Work]

  • Characterizing the Execution of Deep Neural Networks on Collaborative Robots and Edge Devices [Slides] [Paper]
    Matthew Merck, Bingyao Wang, Lixing Liu, Chunjun Jia, Arthur Siqueira, Qiusen Huang, Abhijeet Saraha, Dongsuk Lim, Jiashen Cao, Ramyad Hadidi, Hyesoon Kim
    ACM Practice and Experience in Advanced Research Computing (PEARC), Chicago, IL (2019)
    [Mentored Undergraduate Work]

  • Understanding the Power Consumption of Executing Deep Neural Networks on a Distributed Robot System [Slides] [Paper]
    Ramyad Hadidi, Jiashen Cao, Matthew Merck, Arthur Siqueira, Qiusen Huang, Abhijeet Saraha, Chunjun Jia, Bingyao Wang, Dongsuk Lim, Lixing Liu and Hyesoon Kim
    Algorithms and Architectures for Learning in-the-Loop Systems in Autonomous Flight Workshop - Co-located with IEEE International Conference on Robotics and Automation (ICRA), Montreal, QC (2019)

  • Robustly Executing DNNs in IoT Systems Using Coded Distributed Computing [Paper] [Slides]
    Ramyad Hadidi, Jiashen Cao, Michael S. Ryoo, Hyesoon Kim
    ACM/IEE Design Automation Conference (DAC) - Late Breaking Results, Las Vegas, NV (2019)

  • LODESTAR: Creating Locally-Dense CNNs for Efficient Inference on Systolic Arrays [Paper]
    Bahar Asgari, Ramyad Hadidi, Hyesoon Kim, and Sudhakar Yalamanchili
    ACM/IEE Design Automation Conference (DAC) - Late Breaking Results, Las Vegas, NV (2019)

  • An Edge-Centric Scalable Intelligent Framework To Collaboratively Execute DNN [Demo Site] [Paper]
    Jiashen Cao, Fei Wu, Ramyad Hadidi, Lixing Liu, Tushar Krishna, Michael S. Ryoo, Hyesoon Kim
    Conference on Systems and Machine Learning (SysML), Demo Palo Alto, CA (2019)

  • Thermal-Aware Processing-in-memory Instruction Offloading [Paper]
    Lifeng Nai, Ramyad Hadidi, He Xiao, Hyojong Kim, Jaewoong Sim, and Hyesoon Kim
    Journal of Parallel and Distributed Computing (JPDC), Elsevier (2019)

  • Collaborative Execution of Deep Neural Networks on Internet of Things Devices [Paper]
    Ramyad Hadidi, Jiashen Cao, Michael S. Ryoo, Hyesoon Kim
    arXiv preprint arXiv:1901.02537 (2019)

2018

  • Distributed Perception by Collaborative Robots [Paper-(RA-L)] [Paper-(IROS)] [Spotlight] [Slides]
    Ramyad Hadidi, Jiashen Cao, Matthew Woodward, Michael S. Ryoo, Hyesoon Kim
    IEEE Robotics and Automation Letters (RA-L), Volume 3, Number 4, Pages 3709-3716, (October 2018), Acceptance Rate:<30%
    Invited for IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS’18), Madrid, Spain (2018)

  • CODA: Enabling Co-location of Computation and Data for Near-Data Processing [Paper]
    Hyojong Kim, Ramyad Hadidi, Lifeng Nai, Hyesoon Kim, Nuwan Jayasena, Yasuko Eckert, Onur Kayiran, Gabriel H. Loh
    ACM Transactions on Architecture and Code Optimization (TACO). Volume 15, Issue 3, Article 32 (September 2018), 23 pages (2018) Acceptance Rate: <30%

  • Real-Time Image Recognition Using Collaborative IoT Devices [Paper] [Slides]
    Ramyad Hadidi, Jiashen Cao, Matthew Woodward, Michael S. Ryoo, Hyesoon Kim
    1st Reproducible Tournament on Pareto-efficient Image Classification (ACM ReQuEST workshop), co-located with ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Williamsburg, VA, USA (2018)

  • Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices [Paper]
    Ramyad Hadidi, Jiashen Cao, Matthew Woodward, Michael S. Ryoo, Hyesoon Kim
    arXiv preprint arXiv:1802.02138 (2018)

  • Performance Implications of NoCs on 3D-Stacked Memories: Insights from the Hybrid Memory Cube [Paper] [Slides]
    Ramyad Hadidi, Bahar Asgari, Jeffrey Young, Burhan Ahmad Mudassar, Kartikay Garg, Tushar Krishna, Hyesoon Kim
    IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Belfast, Northern Ireland, UK

  • CoolPIM: Thermal-Aware Source Throttling for Efficient PIM Instruction Offloading [Paper] [Slides]
    Lifeng Nai, Ramyad Hadidi, He Xiao, Hyojong Kim, Jaewoong Sim, Hyesoon Kim.
    IEEE International Parallel & Distributed Processing Symposium (IPDPS’18), Vancouver, Canada (2018)

2017

  • CAIRO: A Compiler-Assisted Technique for Enabling Instruction-Level Offloading of Processing-In-Memory [Paper]
    Ramyad Hadidi, Lifeng Nai, Hyojong Kim, Hyesoon Kim
    ACM Transactions on Architecture and Code Optimization (TACO). Volume 14, Issue 4, Article 48 (December 2017), 25 pages (2017)

  • Demystifying the Characteristics of 3D-Stacked Memories: A Case Study for Hybrid Memory Cube [Paper] [Slides] [CRNCH Poster]
    Ramyad Hadidi, Bahar Asgari, Burhan Ahmad Mudassar, Saibal Mukhopadhyay, Sudhakar Yalamanchili, Hyesoon Kim
    IEEE International Symposium on Workload Characterization (IISWC), Seattle, Washington (2017)

  • GraphPIM: Enabling Instruction-Level PIM Offloading in Graph Computing Frameworks [Paper] [Slides] [Lightning]
    Lifeng Nai, Ramyad Hadidi, Jaewoong Sim, Hyojong Kim, Pranith Kumar, Hyesoon Kim
    International Symposium on High Performance Computer Architecture (HPCA), Austin, Texas (2017)

“It is one thing to stand on the comfortable ground of placid inaction and put forth words of cynical wisdom, and another to plunge into the work itself and through strenuous experience earn the right to express strong conclusions.” ― John D. Rockefeller`