Aditya Dewan

Aditya Dewan

(647) 408-6446 | adewan2@andrew.cmu.edu | LinkedIn | GitHub

About

Aspiring deep learning researcher working towards mastery in the field. Interested in alternative transformer architectures, statistical analysis of financial markets in quantitative trading, and loss-landscape-based knowledge distillation.

Education

B.S. Computer Science, Machine Learning Concentration Carnegie Mellon University
May 2027
Pittsburgh, PA

Courses: Deep Learning (Ph.D.), Discrete Math, Computer Systems, Functional Programming, Math Finance

Clubs: Carnegie Mellon Racing (Built SVMs from scratch, 2x speedup for GPU midline generation), Quant Club (Goldman Sachs Quantathon)

Built Malloc (Dynamic Memory Allocator) and Fully Concurrent, thread-safe file system in C from scratch

Experience

Machine Learning Summer Research Intern Goomba Lab (Mamba Architecture), CMU
May 2025 – Aug. 2025
Pittsburgh, PA
  • Implemented novel skip-connection-based Mamba model to remove multi-head-attention-based retrieval performance decline in SSM architectures
  • Analyzed State Space Models and Mamba-2 bottlenecks on knowledge retrieval benchmarks (MMLU)
Machine Learning Research Intern CMU — Dr. David Touretzky
Jul. 2023 – Aug. 2024
Pittsburgh, PA
  • Engineered React web-app to simulate textual Markov Chain models for 900+ professionals; used in Concord University study (demo)
  • Designed efficient algorithm for automated graph generation and n-gram estimation for low latency on large text datasets
Award-winning Neural Network Compression Algorithm Regeneron ISEF
Aug. 2022 – Aug. 2024
Dallas, TX
  • Devised novel compression algorithm yielding 2-24% more accuracy in 10% training time; awarded by NSA, ISEF, WAICY
  • Leveraged high-descent-potential saddle points via Hessian approximation; tested on MLPs, CNNs, ResNets
  • Project: SPRKD
Junior Machine Learning Engineer The Rounds ($40M healthcare startup)
Jul. 2023 – Jan. 2024
  • Architected first ML infrastructure for min latency and high server volume; devised novel drug vector embeddings
  • Deployed few-shot LLM drug monograph summarization API and React app for in-clinic patient diagnosis
Machine Learning Specialist Actionable.co (1000+ orgs)
Jul. 2021 – May. 2023
  • Led team of 3; developed + deployed recommendation engine API to 45k clients
  • Engineered hybrid GAN, Gradient Boosted Tree, MLP architectures for production

Honors and Awards

Optiver Market Making Competition Winner
First Place in Quantitative Trading, first freshman team to win
Hudson River Trading: Best Use of Quantitative Data Award
For machine learning insurance predictor platform
U.S. NSA: Second Award in Cybersecurity/Mathematics Research
At Regeneron ISEF (1600+) for neural network compression research (SPRKD)
Regeneron ISEF: Team Canada-ISEF Selectee
1 of 8 selected to represent Canada; Fourth Award in Robotics and Intelligent Machines
TEDx Speaker, Innovire Speaker
Mathematical Foundations of ML, AI Research (linktr.ee/AdiCMU)
Morgan Stanley/Quantbot Data Trading Competition
Sixth Place for quant trading algo w/highest SHARPE
Outstanding Research Award, WAICY 2022
For SPRKD - Saddle Point Reversion for Knowledge Distillation
3rd Place SciComm Viewpoint Challenge Winner, SCVC 2022
For viewpoint paper on leveraging loss landscape properties to enhance knowledge distillation

Projects

Expected Gradient Divergence Weighting (EGDW) for Robust Memory Updates in TITANS
Probabilistic update method for TITANS neural memory module using Markov's inequality and Jensen's inequality. Achieved lower validation cross-entropy than baseline with higher train loss, indicating better generalization. Grade: 96%.
SPRKD
SPRKD (Saddle Point Recruitment for Knowledge Distillation)
2022-2024
Novel neural network compression algorithm yielding 2-24% more accuracy in 10% training time. Awarded by NSA, ISEF, WAICY.
Born-Again Neural Networks
Born-Again Neural Networks
Implementation of Born-Again Neural Networks from scratch.
Adam Optimization
Adam Optimization From Scratch
Implementation of Adam optimizer from scratch.
Maxout Activation
Custom Maxout Activation Implementation
Custom implementation of Maxout activation function.
Autonomous Vehicle Simulator
Autonomous Vehicle Simulator
Autonomous vehicle simulator using behavioral cloning.
Symptom Diagnosis AI
Symptom Diagnosis AI
AI system for symptom diagnosis and medical assistance.
Chess AI
Chess AI
Chess-playing AI bot implementation.
Crysta
Crysta
All-in-one productivity platform for students based on non-invasive energy level tracking.

GitHub Repositories

Born-Again Neural Networks implementation
Adam optimizer from scratch
Custom Maxout activation
Behavioral cloning simulator
AI medical diagnosis system
Chess-playing AI bot
Productivity platform

Publications & Articles

Education for the Next Generation: Nurturing Effective Learning

Published on Amazon, exploring modern approaches to education and effective learning strategies.

Viewpoint Paper on Knowledge Distillation
Atherma: Solving the Energy Crisis with Nuclear
The Greatest Threat to Human Survival is Us
Why You Aren't Achieving Your Goals
Robots Care More About the Environment Than We Do
Prime Editing: The Future of Gene Editing

Talks & Presentations

Mathematical Foundations of ML, AI Research. For all talk links, visit linktr.ee/AdiCMU

TEDx Talk

Innovire Talk

News

Skills

Languages: Python, C, C++, Java, SML, SQL, HTML, CSS, JavaScript
Frameworks & Technologies: PyTorch, TensorFlow, Node.js, Flask, React.js, NumPy, Pandas, XGBoost, CUDA

Contact

Email | LinkedIn | GitHub | Twitter | Newsletter