About Me

I am passionate about unraveling and mastering the complexities of diverse domains. My approach is holistic, integrating various disciplines to craft innovative and comprehensive solutions.

Currently, at d-Matrix, I am part of the architecture team, advancing the future of ultra-low-latency, sustainable, and commercially viable LLM inference in data centers through the world’s first efficient memory-compute integration. Learn more about me [here.]

Last Entries

Selected Work

Invited Talk

On-Device Computing: Rain AI’s Mission for Energy-Efficient AI Hardware; UCI IAP Workshop, 2024.

Watch

PhD Research

Guidelines navigating hardware-software trade-offs for next-gen edge devices; 2014–2021.

Overview

Undergrad Research Demos

Working closely with 40+ talented undergraduates, resulting in several publications and interactive demos.

Demos