headshot image

Anshumaan Singh

Machine Learning Systems

Interested in the full stack of machine learning — from the systems and infrastructure that run models at scale, to the research and training that makes those models better.

I want to work at the boundary where algorithms meet hardware constraints, where a smarter scheduler or a better architecture can make the difference. Whether that's optimizing inference pipelines, building distributed training systems, or pushing model quality through better fine-tuning — that's the work I'm drawn to.

About Me


~/about/personal-intro

PI Logo | Personal Introduction

I'm a Computer Science Honors student at Stony Brook University with a deep passion for Artificial Intelligence and Machine Learning.

My journey over the last two years has been a rapid cycle of learning, building, and adapting. I thrive in environments where I can dive headfirst into challenges and take ownership from start to finish.

At Mailgator.AI, I built a complete CI pipeline from scratch, accelerating QA runtimes by 26×. At Stony Brook's LUNR AI Lab, I optimized research pipelines and fine-tuned CodeLlama-7B, improving RAG accuracy and achieving 73.5% faster benchmark runtimes.

What drives me isn't just technical skill—it's a relentless work ethic and genuine curiosity. When frustrated with generic phone notifications, I taught myself Kotlin and built NotiSentry, an AI-powered notification filter. This scrappy, problem-solving approach defines how I tackle challenges.

I'm looking for opportunities where I can contribute to a larger vision, learn from talented teams, and apply my "builder, owner, partner" mindset to real-world problems. Let's connect!

~/about/skills

Skills Logo | Skills

• Programming Languages: MIPS, Python, Java, C, Kotlin (for Android Jetpack Compose), OCaml, JavaScript, HTML/CSS, Matlab.

• Frameworks & Libraries: TensorFlow, LangChain, HuggingFace, OpenCV, Pandas, NumPy, Flask, Bootstrap

• Developer Tools & Platforms: Docker, AWS, Git, GitHub, Linux/Unix, HPC Clusters (Slurm), MySQL, SQLite3

~/about/education

SBU Logo | Education

• Degree: BSc Computer Science, Specializing in AI and ML

• University: Stony Brook University, New York, USA

• CGPA: 3.9

• Honors: SUNY SOAR Research Fellow, URECA Fellow, Computer Science Honors, University Scholars, Dean's List 2023 - 2026

• Awards: Academic Achievement Award, YouAreWelcomeHere Award, Global Excellence Award

• Relevant Coursework: Object-Oriented Programming, Data Structures, System Fundamentals, Linear Algebra, Probability and Statistics, Discrete Math, Calculus I & II

~/about/hobbies

Hobbies Logo | Hobbies

• Photography: I love taking photographs of landscape, nature and the nightsky. Check out some of the pictures I have taken on my Instagram!

• Sailing: I recently started sailing with the Stony Brook University's Sailing Club and I am a huge fan now. I absolutely love sailing!

• Sketching: I recently bought a new iPad Air and I have been learning how to sketch on it. So far... I'd say for someone who has never touched a sketch pen before, I am doing pretty decent.

• Video Games: When I am not taking photographs and editing them, not sailing or sketching then I am probably playing video games. It is one of my favourite indoor passtimes.

Work Experience


~/work/bmo
Incoming AI & ML Engineering Intern | June 2026 – August 2026

Bank of Montreal (BMO) | Berkeley Heights, NJ


• Incoming software engineering internship focused on AI infrastructure and machine learning systems.

~/research/reliable-systems-lab
Undergraduate Research Assistant | January 2026 – Present

Reliable Systems Lab | Stony Brook University, NY


• Engineering a lightweight encoder-only attention architecture for multi-agent systems, leveraging permutation equivariance to process jagged input arrays and eliminate data sorting overhead.

• Improving UAV Agent target seeking efficiency by 34.6% by architecting a Curriculum Weight Tuning API using CMA-ES through a grouped parameter strategy.

• Enabling 100% collision-free horizontal scaling on the SeaWulf HPC cluster by building a distributed isolation framework using dynamic sandboxing to eliminate MATLAB/MEX race conditions.

~/work/mailgator
Software Engineer (Generative AI & Backend) | September 2025 - January 2026

Mailgator | Palo Alto, CA (Hybrid)


• Accelerating QA runtimes by 26×, enabling controlled testing across 700+ cases, by engineering and deploying a RESTful mock server (FastAPI) with full CRUD support on AWS EC2.

• Increasing data accuracy and system reliability by resolving critical parsing bugs, enhancing OpenAI prompts for data extraction and ensuring comprehensive handling of sender-recipient edge cases.

• Developing LLM systems for email analysis with a 5-person agile team, building prompt QA and UX test infrastructure for ML backend (FastAPI, Node.js, React, PostgreSQL).

~/research/lunr-ai-lab
Undergraduate AI Research Assistant (Generative AI) | February 2025 – March 2026

LUNR AI Lab | Stony Brook University, NY


• Improving Coding RAG accuracy by +5.4% (MBPP Eval) and +1.6% (ODEX Eval) by fine-tuning CodeLlama-7B on a custom 460K+ sample dataset in a 4-person team, targeting two ACL 2026 publication.

• Achieving 73.5% faster benchmark runtimes against existing baselines by designing a parallelized RAG benchmark system using vLLM and multiple commercial APIs.

• Reduced LLM inference costs by up to 100% by integrating SQLite3 caching system into the model distillation pipeline.

~/volunteer/humanity-unleashed
Volunteer AI Researcher & Data Engineer | November 2024 - February 2025

Humanity Unleashed (humun.org) | Remote


Volunteered at Humanity Unleashed, a self-funded volunteer-only research organization and contributed in the following ways:

• Built pipelines processing 700k rows of economic data, publishing 5 datasets to HuggingFace with 1,400+ downloads.

• Collaborated with 2 other peers to develop a policy generation and summarization pipeline utilizing LangChain to translate complex economic data into nuanced policy explanations.

~/academic/teaching-assistant
Teaching Assistant | January 2024 - December 2024

University Scholars Fellowship | Stony Brook University, NY


• Demonstrated leadership & mentorship skills as a leader for incoming University Scholars students at Stony Brook University.

• Acquired valuable teaching experience as a Teaching Assistant as part of the SCH 275 pre-fellowship program.

• Assisted the University Scholars Director as a Teaching Assistant by bringing my own life experiences and teaching style into the class.

• Designed creative presentations, graphics & videos to make weekly student workshops engaging and fun.

~/research/igem
Software Engineer (Generative AI & Full Stack) | February 2024 - October 2024

iGEM, Stony Brook University | Stony Brook, NY


• Achieved 90% retrieval accuracy on embedded research documents by building a RAG Q&A chatbot using Transformers and LangChain, improving research wiki UX.

• Helped secure over $50K in funding by leading a 3-person team to develop a research wiki (Flask) that attracted 15+ stakeholders.

~/work/it-assistant
IT Assistant Intern | January 2024 - July 2024

Faculty Student Association at Stony Brook University | Stony Brook University, NY


• Developed and maintained databases using MySQL and Google Sheets, automated processes by building Google AppScript algorithms like Binary Sort and Search, improving automation and work efficiency.

• Inventoried, performed maintenance and resolved issues in over 400 systems across the university campus.

• Installed and configured computer hardware & software, including Windows and Linux operating systems, in 40+ systems.

~/work/omniscience
Full Stack Web Developer Intern | April 2023 - June 2023

Omniscience Corporation, Palo Alto, CA | Remote


• Created a life insurance flask WebApp with 3000+ lines of code using HTML, CSS, JavaScript, Jinja, MySQL and Bootstrap Library.

• Containerized the application using Docker and deployed it on Amazon Web Services to ensure a reliable & scalable environment.

• Improved accessibility using Bootstrap HTML Framework, prioritizing user experience and effective communication.

Featured Projects

Highlighting my most impactful work in AI, systems programming, and full-stack development


~/projects/systems/fuzzer
Automated Systems Fuzzer

Engineered a high-performance C fuzzer utilizing Unix signals and syscalls (fork, waitpid) to stress-test executables via mutated input streams, achieving 100% process isolation and automated memory leak detection.

~/projects/tools/cmdflow
CMDFlow at HackPrinceton

Built a local-first, AI-powered command-tracking system (FastAPI, ReactJS, MongoDB) that streams shell activity (<1s latency), performs PII scrubbing, and semantically indexes commands for natural language search and automatic project-based organization.

GitHub Project Devpost
~/projects/ml/notisentry
NotiSentry: A Smart DND with LLMs

Optimizing system performance to <1% battery drain over 5 hours and achieving 1.3s worst-case latency for an intelligent notification filtering and summarization app to minimize user distractions and improve focus using Firebase Gemini API and Jetpack Compose.

~/projects/ml/replug-lsr
REPLUG LSR with vLLM

Refactored research implementation of REPLUG to enable LM-Supervised Retrieval (LSR) fine-tuning for code generation tasks by architecting a high-performance training pipeline with a local vLLM server.

GitHub Project (Coming Soon)
~/projects/low-level/red-black-tree
Red-Black Tree in MIPS Assembly

Implementation of core functions for a Red-Black Tree data structure, written entirely in MIPS assembly language, focusing on low-level memory operations, register conventions, and complex balanced data structures.

GitHub Project
~/projects/systems/fs-emulator
C Based Linux Filesystem Emulator

In-memory emulation of a Linux-like filesystem written entirely in C, managing core filesystem structures like i-nodes and data blocks, handling memory allocation, and implementing a hierarchical file and directory system from the ground up.

GitHub Project
~/projects/networking/poker-server
Multi-Client Texas Hold'em Poker Server

Multi-threaded poker server built in Java supporting concurrent client connections, game state management, and real-time gameplay with robust error handling and network communication.

GitHub Project

Additional Projects

~/projects/games/skyscrapers
Skyscrapers Puzzle Game

Implementation of the single-player puzzle game "Skyscrapers" in C, including both an interactive version and an automatic solver based on logical heuristics.

GitHub Project
~/projects/security/aflent-protocol
AFLENT Protocol and Custom Encryption

Low-level data manipulation in C, implementing a custom network protocol (AFLENT) and a block cipher for data encryption/decryption with byte and bit-level computations.

GitHub Project
~/projects/vision/sobel-operator
Sobel Operator Edge Detection

Image processing program implementing the Sobel operator algorithm to detect edges of objects in images.

GitHub Project
~/projects/nlp/text-similarity
Text Similarity Check

Java program for comparing text files and checking authorship similarity using cosine similarity algorithms.

GitHub Project
~/projects/algorithms/social-graph
Social Media Graph

Java simulation of a social media network using graph data structures to model followers and followings.

GitHub Project
~/projects/vision/box-blur
Box Blur Image Filter

Image processing program applying box blur algorithm to images, optimized for images under 800x800 pixels.

GitHub Project
~/projects/vision/bw-filter
Black & White Filter

Image filter applying black and white conversion by averaging RGB values of each pixel.

GitHub Project
~/projects/security/playfair
Playfair Encryption

Java implementation of the Playfair cipher for encrypting and decrypting text.

GitHub Project
~/projects/systems/c-tracer
C Program Tracer

Java tool for tracking C code blocks and variables initialized/updated within them.

GitHub Project
~/projects/java/hiring-sim
Hiring System Simulator

Java simulation of a complete hiring system manager for job applicants.

GitHub Project
~/projects/algorithms/luhn-verifier
Credit Card Verifier

C program that validates credit card numbers using the Luhn algorithm.

GitHub Project
~/projects/java/food-pyramid
Food Pyramid Simulator

Java program simulating a food pyramid/food chain ecosystem.

GitHub Project

Latest Achievements & Recognition


~/systems/hpc-scaling
HPC & Distributed Systems

Enabled 100% collision-free horizontal scaling on the SeaWulf HPC cluster using dynamic sandboxing and isolation.

Reliable Systems Lab

~/ml/inference-optimization
LLM Inference Optimization

Reduced LLM inference costs by up to 100% by architecting an SQLite3 caching layer for model distillation pipelines.

LUNR AI Lab

~/systems/infrastructure
Systems Infrastructure

Accelerated validation latency by 99.9% via a scalable AWS EC2 CI/CD pipeline with asynchronous integrity checks.

Mailgator

~/ml/model-fine-tuning
LLM Fine-Tuning & RAG

Improved Coding RAG accuracy by 5.4% and accelerated runtimes by 73.5% by fine-tuning CodeLlama-7B via LoRA.

LUNR AI Lab

Contact


Feel free to reach out for collaborations or just a friendly chat.

Email Me LinkedIn GitHub