Sai Mitheran Jagadesh Kumar

I am a recent graduate from Carnegie Mellon University, majoring in Electrical and Computer Engineering, specializing in AI/ML systems. I graduated from National Institute of Technology, Tiruchirappalli in 2022 as a Gold medalist in Electronics and Communication Engineering. I was affiliated with the Max Planck Institute of Informatics, Saarbrücken, funded by the DAAD-WISE Scholarship.


I'm a recipient of the Indian Academy of Sciences Research Fellowship, the prestigious Dr. A.L. Abdussattar Memorial Award, the Sri Janardhana Iyengar Memorial Award, and the Graphics Replicability Stamp Award. Outside of research, I'm open to anything to do with Sustainability and Mental Health. Scroll down to know more!

Linktree  /  Email  /  Personal Email  /  LinkedIn  /  GitHub

profile photo

News
  • 01/24: Joining Latent AI as an AI Application Engineer (L8)!
  • 12/23: Graduated from CMU with a Master's degree in ECE and 4.0 GPA!
  • 09/23: Joining AirLab to explore large-scale scene understanding!
  • 08/23: Teaching (assistant) 18-290 (Signals and Systems) once more at CMU!
  • 07/23: Serving as a Reviewer for the IEEE Transactions on Neural Networks and Learning Systems!
  • 06/23: Transforming Edge AI at Latent AI as an MLE intern for the summer!
  • 05/23: Awarded the Sri. Janardhana Iyengar Memorial Award at NIT Trichy for the best academic performance in 2022!
  • 02/23: Paper accepted at ICRA 2023!
  • 01/23: Teaching (assistant) 18-290, Signals and Systems at CMU!
  • 10/22: Paper accepted at IEEE Robotics and Automation Letters (RA-L)!
  • 09/22: Started a Research Assistantship at CyLab, CMU in the Biometrics team!
  • 08/22: Teaching (assistant) 18-794, Pattern Recognition Theory at CMU!
  • 07/22: Paper accepted at Optik!
  • 05/22: Paper accepted at ICML 2022!
  • 03/22: Accepted to Carnegie Mellon University as a full-time grad student!
  • 01/22: Paper accepted at ICRA 2022!
  • 01/22: Selected for Research Week with Google Research, 2022
  • 12/21: Check out my Linktree!
  • 12/21: Paper accepted at AAAI 2022!
  • 11/21: Received an honorary mention and award for the AI For Good Challenge.

Research and Experience

I collaborated with researchers at the Medical Mechatronics Lab, National University of Singapore, as a Research Assistant to work on Graph-based Deep Reasoning and Surgical Scene Understanding. As a Deep Learning Engineer at AIMonk Labs Pvt. Ltd. I worked in a team of five to build Neuralmarker, transforming businesses with Computer Vision. I received a recommendation from Foxconn Country Head, Josh Foulger, to work with their Intelligent Systems Team on prototyping.

Previously, I was affiliated with the Advanced Geometric Computing Lab and the Shakti Group, Indian Institute of Technology, Madras. I was supervised by Dr. M. Ramanathan and Dr. V. Kamakoti, the Director of IIT-M. I've worked on Deep Generative Modelling of Real-time Wireless Communication Channels using UAVs, with Dr. E.S Gopi, Pattern Recognition Lab NIT Trichy, and Dr. Nalin Jayakody, the Director of the Tomsk Infocomm Lab.

Now that you've kept reading till here, check out the work done by my amazing research group at The Learning Machines.

vit Compressing Vision Transformers for Low-Resource Visual Learning
Youn, Eric and Mitheran, Sai and Prabhu, Sanjana and Chen, Siyuan
arXiv / Paper

Our work introduces a framework for compressing Vision Transformer models for efficient segmentation, with a focus on enabling deployment on resource-constrained devices like the NVIDIA Jetson Nano (4GB). Our approach combines structured pruning, distillation from a stronger teacher, and quantization strategies to significantly reduce memory usage and inference latency while maintaining high segmentation accuracy and mean IoU. This allows for the rapid deployment of Vision Transformers on the edge.

rethink Rethinking Feature Extraction: Gradient-based Localized Feature Extraction for End-to-End Surgical Downstream Tasks
Pang, Winnie and Islam, Mobarakol and Mitheran, Sai and Seenivasan, Lalithkumar and Xu, Mengya and Ren, Hongliang
ICRA, 2023 and IEEE RA-L
Code / Paper

This work develops a detector-free gradient-based localized feature extraction approach that enables end-to-end model training for downstream surgical tasks such as report generation and tool-tissue interaction graph prediction. We eliminate the need for object detection or region proposal and feature extraction networks by extracting the features of interest from the discriminative regions in the feature map of the classification models. Here, the discriminative regions are localized using gradient-based localization techniques (e.g. Grad-CAM). We show that our proposed approaches enable the real-time deployment of end-to-end models for surgical downstream tasks.

dehaze Rich Feature Distillation with Feature Affinity Module for Efficient Image Dehazing
Mitheran, Sai and Suresh, Anushri and P Gopi, Varun
Optik, Elsevier
Code / arXiv / Paper

This work introduces a simple, lightweight, and efficient framework for single-image haze removal, exploiting rich “dark-knowledge" information from a lightweight pre-trained super-resolution model via the notion of heterogeneous knowledge distillation.

lth Not All Lotteries Are Made Equal
Sahu, Surya Kant and Mitheran, Sai and Mahapatra, Ritul
ICML, 2022 (HAET Workshop)
arXiv / Paper

The Lottery Ticket Hypothesis (LTH) states that for a reasonably sized neural network, there exists a subnetwork within the same network that, when trained from the same initialization, yields no less performance than the dense counterpart. We investigate the effect of model size and the ease of finding winning tickets. Through this work, we show that winning tickets is in-fact, easier to find for smaller models.

gr_mtl Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding
Seenivasan, Lalithkumar* and Mitheran, Sai* and Islam, Mobarakol and Ren, Hongliang
ICRA, 2022 and IEEE RA-L [SOTA, Endovis18]
Code / arXiv / Paper

This paper introduces a globally-reasoned multi-task surgical scene understanding model capable of performing instrument segmentation and tool-tissue interaction detection.

audiomer Audiomer: A Convolutional Transformer for Keyword Spotting
Sahu, Surya Kant and Mitheran, Sai and Kamdar, Juhi and Gandhi, Meet
AAAI, 2022 (DSTC10 Workshop) [SOTA, Keyword Spotting]
Code / Paper

In this work, we introduce an architecture, Audiomer, where we combine 1D Residual Networks with Performer Attention to achieve state-of-the-art performance in Keyword Spotting with raw audio waveforms, out-performing all previous methods while also being computationally cheaper and parameter-efficient.

SBR Introducing Self-Attention to Target Attentive Graph Neural Networks
Mitheran, Sai and Java, Abhinav and Sahu, Surya Kant and Shaikh, Arshad
AISP, 2022
Paper / Code / arXiv

We propose using a Transformer in combination with a target attentive GNN, which allows richer Representation Learning. We outperform the existing methods on real-world benchmark datasets.

gr_mtl User-Friendly Waveguide Modes Visualiser
Mitheran, Sai and T N, Ram and S, Raghavan
IEEE Microwave Magazine 2022, Microwaves 101, Recent Trends on Metamaterial Antennas for Wireless Applications and Deep Learning Techniques, 2021
Paper / Application / Microwaves 101

This article presents the procedure and results of a web application made to visualize field lines of Electric and Magnetic waves inside a waveguide. We propose a first-of-the-kind Graphical User Interface for waveguide visualization as a public resource.

hand_drawn 'CADSketchNet' - An Annotated Sketch dataset for 3D CAD Model Retrieval with Deep Neural Networks
Manda, Bharadwaj and Dhayarkar, Shubham and Mitheran, Sai and V.K, Viekash and Muthuganapathy, Ramanathan
3DOR, 2021 and Computers & Graphics Journal
Project Page / Paper

We introduce the CADSketchNet dataset, an annotated collection of sketches of 3D CAD models, which is intended to enhance the research on developing AI-enabled search engines for 3D CAD models. We also evaluate the performance of various retrieval systems. Many experimental models are constructed and tested on CADSketchNet.


Service
Student Volunteer, ICLR 2021
icml Student Volunteer, ICML 2021

Affiliations (Upto Date)
               
               

Volunteering and Initiatives