Anushka Pachaury

Software Engineer

San Francisco, California, United States9 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in Machine Learning and AI applications.
  • Proven track record in developing scalable systems.
  • Strong background in research and practical engineering.
Stackforce AI infers this person is a Machine Learning Engineer with expertise in AI applications across various industries.

Contact

Skills

Core Skills

Machine LearningArtificial Intelligence (ai)Distributed Systems

Other Skills

Python (Programming Language)Scikit-LearnNatural Language Processing (NLP)TensorFlowPyTorchPrompt EngineeringRetrieval-Augmented Generation (RAG)Google Cloud Platform (GCP)Weights & BiasesTransformersTime Series ForecastingExtract, Transform, Load (ETL)Explainable AI (SHAP, ELI5)LinuxDeep Learning

About

Hi, my name is Anushka Pachaury, I am a Computer Engineering graduate from the University of Illinois Urbana-Champaign and a current Master's student in Computer Science at Columbia University, specializing in Machine Learning. My work sits at the intersection of research and applied engineering, where I design and deploy systems that push the boundaries of AI performance. I've had the opportunity to work across academia (Columbia University), startups (Arklex AI), and industry-leading companies such as Visa, PwC, Hewlett Packard Enterprise, and Siemens. My projects have ranged from building scalable ETL pipelines and merchant churn forecasting systems to developing multi-agent AI exam simulations and automated patent drafting pipelines using OpenAI and Claude APIs. More recently, I've worked on benchmarking studies of LLMs (LLaMA, GPT, Sparse/MoE architectures) to analyze efficiency, energy usage, and model scaling trends. What excites me most is applying machine learning to solve real-world problems, whether that’s improving inference efficiency, automating technical documentation, or enhancing interactive learning systems. I enjoy working at the intersection of research and implementation, which turns ideas into deployable solutions. Outside of work, I'm passionate about mentoring, teaching, and puzzle solving, whether that is tackling Rubik's Cube challenges or finding creative ays to approach real world technical problems

Experience

9 mos
Total Experience
3 mos
Average Tenure
3 mos
Current Experience

Google

Software Engineer

Feb 2026Present · 3 mos · Mountain View, California, United States · On-site

Visa acceptance solutions

Machine Learning Intern

May 2025Aug 2025 · 3 mos · Seattle, Washington, United States · On-site

  • Replaced Visa's HDFS-based batch reporting with a low latency, high throughput ClickHouse pipeline, achieving 58% faster processing speeds (29,000 vs 18,000 records/sec) and 37% higher throughput
  • Built a scalable ETL pipeline connecting Visa's SQL server to ClickHouse to enable sub-second query performance, materialized views for pre-computed aggregations, and integration with Tableau
  • Leveraged ClickHouse's columnar storage and vectorized execution engine to eliminate Hadoop's multi-hop latency and ensure consistent performance across large-scale usage datasets.
  • Built a time series forecasting pipeline using temporal features and an LSTM-based classifier to predict merchant attrition 3-8 months in advance, enabling proactive intervention; the model achieved 96% precision for "Will stay" and 76% recall for "Will Churn" predictions
  • Used SHAP-based interpretability analysis, revealing the LSTM model's over-reliance on payment volume features, leading to a redesign focused on behavioral signals and multi-dimensional churn indicators
  • Developed an enhanced two-step XGBoost pipeline that improved churn recall to 90% and enabled month-specific churn forecasting (eg., predicting whether a merchant would churn in 3,4,5, or 6 months).
Time Series ForecastingExtract, Transform, Load (ETL)Python (Programming Language)Explainable AI (SHAP, ELI5)LinuxDeep Learning+10

Columbia university / pwc collaboration

AI Student Researcher

Jan 2025Nov 2025 · 10 mos · New York, New York, United States · Remote

  • With mentors from PwC alongside mentors from Columbia University, Built a modular benchmarking framework to measure inference latency, memory, energy and carbon emissions across sweeps of model type, quantization(fp16/int8/int4), batch size, and I/O length
  • Integrated Zeus GPU power monitoring with automated experiment logging and W&B visualization
  • Benchmarked LLaMA-2, Deepseek-Distill, SparseBERT, and Switch Transformer on Aplaca and GSM8K, capturing performance trends across dense, sparse and MOE architectures
  • Identified batch size as the dominant factor: increasing from 1->4 cut latency by 60% and reduced energy/token by 50%
  • Sparse and MoE models achieved lowest energy/token across both GPUs
  • A100 delivered 10x lower latency and 75% less total energy than L4 for identical workloads
Google Cloud Platform (GCP)Weights & BiasesPython (Programming Language)Scikit-LearnMachine LearningTransformers+3

Arklex ai

AI Research Intern

Dec 2024Feb 2026 · 1 yr 2 mos · New York, New York, United States · On-site

  • Built a multi-agent AI driven simulation of Cambridge KET and PET English speaking exams using the Agent-First-Organization framework, orchestrating interactions between a GPT-4o based AI Examiner, AI Student, and a real student interface
  • Engineered the exam flow across three stages (personal interview, topic-based discussion, and follow-up) by designing custom worker nodes and task graphs that controlled turn-taking, topic progression, and difficulty-based response generation
  • Incorporated OpenAI's embedding model to semantically match student responses against a curated rubric of common grammar and vocabulary errors for accurate, rubric-aligned scoring
  • Extended the PET system to support image-based prompts by designing visual-based interaction modules and embedding captioned diagrams, mimicking real exam visuals.
  • Developing an automated patent description generation system leveraging Claude and OpenAI APIs with integrated style transfer capabilities
  • Building end-to-end patent figure analysis pipeline leveraging multimodal AI to automatically identify identify components validate technical accuracy, and generate formal patent descriptions from technical drawings using Chain-of-Thought prompting strategies
Python (Programming Language)Scikit-LearnMachine LearningNatural Language Processing (NLP)Artificial Intelligence (AI)TensorFlow+3

Hewlett packard enterprise

Software Engineer Intern

May 2023Aug 2023 · 3 mos · San Jose, California, United States · On-site

  • Did training on Internet Key Exchange(IKE), IPsec, SD-WAN, particularly with reference to Silver Peak's EdgeConnect and Orchestrator
  • Added support for the EdgeConnect to be in responder-only mode during IKE processing for IPsec tunnels in SD-WAN including Command Line Interface and backend development in C
  • Ran multiple trials with different Strongswan configurations to evaluate potential solutions
  • Worked on improvements to EdgeConnect IPsec Tunnel Debuggability from the Orchestrator by adding IKE and IPsec tunnel logs with filter patterns in all stages of standard IPsec and IKEless tunnel bring up, including key exchange and datapath in C
  • Added anti-replay state information for tunnel troubleshoot in C language
LinuxC++C (Programming Language)DockerBashDistributed Systems

University of illinois urbana-champaign

Machine Learning Student Researcher

Sep 2022Aug 2024 · 1 yr 11 mos · On-site

  • Applied YOLOv5 and GroundingDINO for crop disease detection, improving localization accuracy by 30% on benchmark datasets, and designed a preprocessing pipeline with Gaussian filtering and resolution normalization to enhance model input quality
  • Optimized training pipeline through data augmentation and learning rate scheduling, reducing false positives by 25% across varied crop disease images
Python (Programming Language)C++Machine Learning

Siemens digital industries software

Software Engineer Intern

May 2022Oct 2022 · 5 mos · Massachusetts, United States

  • Worked on distributed velsyn which is a multi-process implementation of the Velsyn compiler
  • Designed and developed Dominator Propagation algorithm which is used for logic optimization
  • Created testcases using Verilog to test full implementation of the code
  • Programmed in C using Linux and debugged the design using Gdb in distributed systems
  • Redesigned a connectivity comparison Python script used to compare netlist connectivity between - sequential and distributed compile of designs to ensure correct connectivity across cores
  • Connectivity script takes less than 20 minutes on a design that originally did not finish for hours
Python (Programming Language)LinuxC (Programming Language)DockerBashDistributed Systems

Education

Columbia University

Master of Science - MS — Machine Learning

Aug 2024Dec 2025

University of Illinois Urbana-Champaign

Bachelor's degree — Computer Engineering

Aug 2020May 2024

Stackforce found 100+ more professionals with Machine Learning & Artificial Intelligence (ai)

Explore similar profiles based on matching skills and experience