Kargi Chauhan

I'm an MS candidate in Natural Language Processing at UC Santa Cruz, working with Dr. Leilani Gilpin and Dr. Chenguang Wang.

I received my Undergrad from the University of Arizona. I also served as a Research Scholar at the University of Edinburgh, where I worked on Neurosymbolic AI.

My current research centers on reward design and verification in LLM alignment specifically understanding when process supervision beats outcome supervision in reinforcement learning from verifiable rewards (RLVR), and how we can build more reliable training signals for reasoning models. I'm also exploring memory architectures for LLM agents, inspired by sleep-cycle consolidation, to help agents retain and organize knowledge over long horizons.

Previously, I worked on constraint-guided traffic generation for autonomous vehicles building neuro-symbolic systems that satisfy physical validity constraints while generating realistic driving scenarios (presented at NeurIPS 2025).

I also contribute to RLLM, an open-source project advancing reinforcement learning for language models.

If you're wondering whether an AI system's reasoning can be trusted whether it's a self-driving car explaining a split-second decision or a language model showing its work on a math proof that's exactly the kind of problem I'm trying to solve :)

News

Apr 9, 2026

Paper accepted at ACL 2026!

Feb 1, 2026

Paper accepted — presenting oral at ACM TheWebConf!

Jan 5, 2026

Demis Hassabis — Co-founder & CEO of Google DeepMind, Nobel Laureate in Chemistry — connected on research on X.

Dec 5, 2025

Presenting at NeurIPS in San Diego!

Sep 22, 2025

NeurIPS Paper got accepted!

Sep 08, 2025

Attended NeSy 2024 conference.

Sep 05, 2025

Spoke about AI research at LinkedIn Office in SF.

Aug 31, 2025

Connected with Rishabh Agarwal (Periodic Labs) on research.

May 30, 2025

Reached out to John Jumper — Nobel Laureate in Chemistry & lead of AlphaFold — who connected over email on research.

Dec 08, 2024

Completed Large Language Model Agents MOOC.

Oct 5, 2024

Built ExoSky at NASA Space Apps Challenge at NASA Ames Research Center.

May 10, 2024

Presented "Enabling Deep Space Exploration Using Small Spacecraft System Architecture" at IPSS Conference (NASA JPL).

Apr 10, 2024

Presented "From Mines to Minds" at UR Inspiration Conference.

Publications

2026

Less is More: Benchmarking LLM Based Recommendation Agents

K. Chauhan and M. Venkateswarlu.

ACM TheWebConf

2025

VFSI: Validity First Spatial Intelligence for Constraint-Guided Traffic Diffusion

K. Chauhan and L. H. Gilpin.

NeurIPS

Large Language Models and XAI [Book]

K. Chauhan and V. Pendyala.

Explainable Artificial Intelligence for Trustworthy Decisions in Smart Applications

2024

Enabling Deep Space Using Inspectors Accompanying Small Spacecraft System of System Architecture

K. Chauhan, A. T. Raj, and J. Thangavelautham.

Interplanetary Small Satellite Conference (NASA JPL)

From Mines to Minds: Exploring Immersive Learning's Influence in Mining Engineering Education

K. Chauhan, A. Anani, and S. Adewuyi.

UR Inspiration Undergraduate Research Conference (SOMP'24)

2023

Using Online Learning Modules to Improve Students' Use of Technical Standards in Additive Manufacturing Courses and Projects

H. D. Budinoff, A. Wessman, and K. Chauhan.

ASEE Annual Conference

Projects

Beyond Stars

Interactive web app exploring night sky from exoplanets using NASA's Gaia data

Tech: Python, JavaScript, Three.js, WebGL, ESA Gaia DR3 Star Catalog

Mental Health Chatbot for Counselors [GitHub]

AI-driven mental health support using fine-tuned LLaMA-3

Tech: Python, QLoRA, Unsloth, HumeAI, Hugging Face

Agentic Customer Support

Automating customer service workflows with LangGraph and OpenAI

Tech: Python, LangGraph, OpenAI GPT API, Workflow Visualization

SceneDiffuser

City-scale traffic simulation via generative world model

Tech: Python, Generative Models, Traffic Simulation

OutfitEquation [GitHub]

Agent-based fashion recommendation system

Tech: Python, Polyvore Dataset, PinAI API, Agent-based Modeling

UA Course Compass

Web application for BS Information Science students course planning

Tech: HTML, CSS, JavaScript, Node.js, PostgreSQL, NLP algorithms

Visual Reasoning with LTNs

Logic Tensor Networks for visual reasoning tasks

Tech: Python, Logic Tensor Networks, Computer Vision

Autonomous Deep Space Exploration

System-of-systems architecture with small spacecraft inspectors

Tech: Python, Machine Learning Algorithms, Synthetic Data Generation

Blog

Compiling the Subjective: RL Beyond Verifiable Rewards

2026

RLVR works brilliantly for math and code — but what do you reward when there is no right answer? A survey of four paradigms closing the verifiability gap, with a deep dive into Judge Code-guided RL, which compiles subjective rubrics into executable Python at 2× training speed.

The Light and Shade of 81,000 Voices

2026

Anthropic published the largest qualitative study ever conducted — 80,508 people across 159 countries interviewed by Claude itself. A close reading of what the data actually says about hope, fear, and the entangled nature of AI's benefits and harms.

The Owl Was Never in the Data

2025

Five convergent experiments reveal subliminal learning operates through representation alignment, not information transfer (r=0.98 CKA correlation). Testing 56 animals exposes that 78% of the token entanglement effect is measurement artifact. Instruction tuning creates the vulnerability — pretraining does not.

Reward Hacking Is a Phase Transition and We Can See It Coming

2025

Three empirical studies on process vs. outcome supervision in RLVR: reward hacking isn't gradual drift — it's an abrupt phase transition predictable with 100% precision up to 2,800 steps in advance. Process supervision yields 98x lower reward variance, and a difficulty-adaptive policy cuts L5 hacking from 50.6% to 1.5%.

Teaching Machines to Think - Agentic AI

2025

Intelligent agent leveraged using LangGraph to automate customer support workflows while visualizing them dynamically.

What Training AI Models Taught Me About My Own Brain

2025

A journey into machine learning that revealed the most fascinating model was already inside my head.

Waymo's 15 Billion Mile Journey

2025

Exploring autonomous vehicle testing in simulated environments.

Why I Choose Books Over Everything Else

2025

Reading as the ultimate form of time travel through human experience.

Why I Keep a List of Problems That Won’t Let Me Go

2025

Part confession, part playbook — on luck as a multiplier, keeping a living list of thorny questions, and why direction beats drift.

Exploring the Nexus of AI, Machine Learning, Data Science, and Deep Learning

2023

Dive into the world of AI and discover how data science and deep learning interconnect to shape innovation.

Get Ready to Unlock the AI Secrets, One Byte at a Time

2023

Break down the secrets of artificial intelligence and its incredible potential in our modern world.

Decoding the Enigma: Unveiling the True Essence of AI (AI Series — Part I)

2023

Debunk myths about artificial intelligence and explore its true capabilities in this insightful series.

Investigation on Hollywood Singer's Age and Their Followers

2023

Analyze the curious correlation between age and fan following in the glamorous world of Hollywood singers.

Effect of Script Structures on Memory Recall

2023

Uncover how the design of narratives impacts our memory retention and recall mechanisms.

Investigating Antibiotics Effect on Milk Fermentation

2023

Understand the science of antibiotics and their impact on fermentation across three milk types.

Portrayal of Hate and Love as Reciprocal Passion

2023

Dive into the depths of passion and explore how love and hate intertwine through compelling characters.

CV

You can download my full CV here.

Education

2025-Present

M.S. in Natural Language Processing

University of California, Santa Cruz

Advisors: Dr. Leilani Gilpin and Dr. Chenguang Wang

2020-2024

B.S. in Information & Data Science

University of Arizona

Minor: Game Design • GPA: 3.9/4.0

Experience

2025

ML Research Assistant

UCSC • AIEA Lab

Researching verifiable RL and neuro-symbolic pipelines for autonomous-systems reliability

2025

Visiting ML Researcher

University of Edinburgh

Hybrid LTNs + SMT program-synthesis workflows for visual reasoning

2024

Machine Learning Engineer

Stealth Startup

Post-training & deployment of LLMs for productized inference

2024

Machine Learning Intern

NASA JPL

Pose-estimation and lighting pipeline for CubeSat imagery

2024

HCI Researcher

NJIT • Ying Wu College of Computing

Fairness & explainability for aging-focused AI tools

2022-2023

Undergraduate Student Researcher

University of Arizona College of Engineering

Supply chain optimization research focusing on lean processes and human factors

Awards & Recognition

2024

NASA Ames — Galactic Problem Solver

NASA International Space Apps Challenge

2020

Global Wildcat Scholarship

University of Arizona • Four-year merit scholarship covering 100% tuition

2020-2024

Dean's List (4 years)

University of Arizona

Leadership & Volunteering

2025

Ambassador

DeepLearning.AI

Community events and learner support around practical ML

2025

Core Volunteer

Women in Big Data

Programming and mentorship for women in data careers

2022

E-Board Member

Google Developer Student Club

Organized events, led workshops, and fostered collaboration using Google technologies