Anushka Pachaury

Software Engineer

San Francisco, California, United States9 mos experience

AI EnabledAI ML Practitioner

Key Highlights

Expert in Machine Learning and AI applications.
Proven track record in developing scalable systems.
Strong background in research and practical engineering.

Stackforce AI infers this person is a Machine Learning Engineer with expertise in AI applications across various industries.

Contact

Skills

Core Skills

Machine LearningArtificial Intelligence (ai)Distributed Systems

Other Skills

Python (Programming Language)Scikit-LearnNatural Language Processing (NLP)TensorFlowPyTorchPrompt EngineeringRetrieval-Augmented Generation (RAG)Google Cloud Platform (GCP)Weights & BiasesTransformersTime Series ForecastingExtract, Transform, Load (ETL)Explainable AI (SHAP, ELI5)LinuxDeep Learning

About

Hi, my name is Anushka Pachaury, I am a Computer Engineering graduate from the University of Illinois Urbana-Champaign and a current Master's student in Computer Science at Columbia University, specializing in Machine Learning. My work sits at the intersection of research and applied engineering, where I design and deploy systems that push the boundaries of AI performance. I've had the opportunity to work across academia (Columbia University), startups (Arklex AI), and industry-leading companies such as Visa, PwC, Hewlett Packard Enterprise, and Siemens. My projects have ranged from building scalable ETL pipelines and merchant churn forecasting systems to developing multi-agent AI exam simulations and automated patent drafting pipelines using OpenAI and Claude APIs. More recently, I've worked on benchmarking studies of LLMs (LLaMA, GPT, Sparse/MoE architectures) to analyze efficiency, energy usage, and model scaling trends. What excites me most is applying machine learning to solve real-world problems, whether that’s improving inference efficiency, automating technical documentation, or enhancing interactive learning systems. I enjoy working at the intersection of research and implementation, which turns ideas into deployable solutions. Outside of work, I'm passionate about mentoring, teaching, and puzzle solving, whether that is tackling Rubik's Cube challenges or finding creative ays to approach real world technical problems

Experience

9 mos

Total Experience

3 mos

Average Tenure

3 mos

Current Experience

Google

Software Engineer

Feb 2026 – Present · 3 mos · Mountain View, California, United States · On-site

Visa acceptance solutions

Machine Learning Intern

May 2025 – Aug 2025 · 3 mos · Seattle, Washington, United States · On-site

Replaced Visa's HDFS-based batch reporting with a low latency, high throughput ClickHouse pipeline, achieving 58% faster processing speeds (29,000 vs 18,000 records/sec) and 37% higher throughput
Built a scalable ETL pipeline connecting Visa's SQL server to ClickHouse to enable sub-second query performance, materialized views for pre-computed aggregations, and integration with Tableau
Leveraged ClickHouse's columnar storage and vectorized execution engine to eliminate Hadoop's multi-hop latency and ensure consistent performance across large-scale usage datasets.
Built a time series forecasting pipeline using temporal features and an LSTM-based classifier to predict merchant attrition 3-8 months in advance, enabling proactive intervention; the model achieved 96% precision for "Will stay" and 76% recall for "Will Churn" predictions
Used SHAP-based interpretability analysis, revealing the LSTM model's over-reliance on payment volume features, leading to a redesign focused on behavioral signals and multi-dimensional churn indicators
Developed an enhanced two-step XGBoost pipeline that improved churn recall to 90% and enabled month-specific churn forecasting (eg., predicting whether a merchant would churn in 3,4,5, or 6 months).

Time Series ForecastingExtract, Transform, Load (ETL)Python (Programming Language)Explainable AI (SHAP, ELI5)LinuxDeep Learning+10

Columbia university / pwc collaboration

AI Student Researcher

Jan 2025 – Nov 2025 · 10 mos · New York, New York, United States · Remote

With mentors from PwC alongside mentors from Columbia University, Built a modular benchmarking framework to measure inference latency, memory, energy and carbon emissions across sweeps of model type, quantization(fp16/int8/int4), batch size, and I/O length
Integrated Zeus GPU power monitoring with automated experiment logging and W&B visualization
Benchmarked LLaMA-2, Deepseek-Distill, SparseBERT, and Switch Transformer on Aplaca and GSM8K, capturing performance trends across dense, sparse and MOE architectures
Identified batch size as the dominant factor: increasing from 1->4 cut latency by 60% and reduced energy/token by 50%
Sparse and MoE models achieved lowest energy/token across both GPUs
A100 delivered 10x lower latency and 75% less total energy than L4 for identical workloads

Google Cloud Platform (GCP)Weights & BiasesPython (Programming Language)Scikit-LearnMachine LearningTransformers+3

Arklex ai

AI Research Intern

Dec 2024 – Feb 2026 · 1 yr 2 mos · New York, New York, United States · On-site

Built a multi-agent AI driven simulation of Cambridge KET and PET English speaking exams using the Agent-First-Organization framework, orchestrating interactions between a GPT-4o based AI Examiner, AI Student, and a real student interface
Engineered the exam flow across three stages (personal interview, topic-based discussion, and follow-up) by designing custom worker nodes and task graphs that controlled turn-taking, topic progression, and difficulty-based response generation
Incorporated OpenAI's embedding model to semantically match student responses against a curated rubric of common grammar and vocabulary errors for accurate, rubric-aligned scoring
Extended the PET system to support image-based prompts by designing visual-based interaction modules and embedding captioned diagrams, mimicking real exam visuals.
Developing an automated patent description generation system leveraging Claude and OpenAI APIs with integrated style transfer capabilities
Building end-to-end patent figure analysis pipeline leveraging multimodal AI to automatically identify identify components validate technical accuracy, and generate formal patent descriptions from technical drawings using Chain-of-Thought prompting strategies

Python (Programming Language)Scikit-LearnMachine LearningNatural Language Processing (NLP)Artificial Intelligence (AI)TensorFlow+3

Hewlett packard enterprise

Software Engineer Intern

May 2023 – Aug 2023 · 3 mos · San Jose, California, United States · On-site

Did training on Internet Key Exchange(IKE), IPsec, SD-WAN, particularly with reference to Silver Peak's EdgeConnect and Orchestrator
Added support for the EdgeConnect to be in responder-only mode during IKE processing for IPsec tunnels in SD-WAN including Command Line Interface and backend development in C
Ran multiple trials with different Strongswan configurations to evaluate potential solutions
Worked on improvements to EdgeConnect IPsec Tunnel Debuggability from the Orchestrator by adding IKE and IPsec tunnel logs with filter patterns in all stages of standard IPsec and IKEless tunnel bring up, including key exchange and datapath in C
Added anti-replay state information for tunnel troubleshoot in C language

LinuxC++C (Programming Language)DockerBashDistributed Systems

University of illinois urbana-champaign

Machine Learning Student Researcher

Sep 2022 – Aug 2024 · 1 yr 11 mos · On-site

Applied YOLOv5 and GroundingDINO for crop disease detection, improving localization accuracy by 30% on benchmark datasets, and designed a preprocessing pipeline with Gaussian filtering and resolution normalization to enhance model input quality
Optimized training pipeline through data augmentation and learning rate scheduling, reducing false positives by 25% across varied crop disease images

Python (Programming Language)C++Machine Learning

Siemens digital industries software

Software Engineer Intern

May 2022 – Oct 2022 · 5 mos · Massachusetts, United States

Worked on distributed velsyn which is a multi-process implementation of the Velsyn compiler
Designed and developed Dominator Propagation algorithm which is used for logic optimization
Created testcases using Verilog to test full implementation of the code
Programmed in C using Linux and debugged the design using Gdb in distributed systems
Redesigned a connectivity comparison Python script used to compare netlist connectivity between - sequential and distributed compile of designs to ensure correct connectivity across cores
Connectivity script takes less than 20 minutes on a design that originally did not finish for hours

Python (Programming Language)LinuxC (Programming Language)DockerBashDistributed Systems