Amit Kumar

Director of Engineering

Sunnyvale, California, United States18 yrs 5 mos experience
Highly Stable

Key Highlights

  • Led digital transformation initiatives at Yahoo.
  • Built one of the world's largest Knowledge Graphs.
  • Achieved significant cost savings through data pipeline optimizations.
Stackforce AI infers this person is a SaaS expert with a strong focus on data infrastructure and engineering leadership.

Contact

Skills

Core Skills

Team ManagementSoftware ManagementData InfrastructureApache SparkTechnical LeadershipData MiningRuby On RailsAgile Methodologies

Other Skills

AWS LambdaAWS NeptuneAlgorithmsAmazon EC2Amazon EKSAmazon Web Services (AWS)Apache AirflowApache HudiApache OozieApache PigBig DataCloud ComputingCross-functional Team LeadershipData LakesData Pipelines

About

Accomplished Engineering Leader with 17+ years of experience building large-scale, globally distributed systems. Proven expertise in hiring, managing, and developing high-performance engineering team, fostering talent into world-class engineers. Have been spearheading the Digital transformation of Data Platform and Serving Infrastructure of the entire Consumer Research Org to the public cloud (AWS). Built and manage one of the world’s largest Knowledge Graphs driving multimillion-dollar revenue, powering several AI/ML pipelines, and enhancing user experiences across Yahoo.

Experience

18 yrs 5 mos
Total Experience
8 yrs 7 mos
Average Tenure
1 yr 3 mos
Current Experience

Socure

Senior Engineering Manager

Mar 2025Present · 1 yr 3 mos · Remote

  • Spearheading a greenfield initiative to build a comprehensive Identity Graph covering the entire U.S. population. This foundational platform will power Socure’s AI-driven digital identity verification and fraud prevention solutions, trusted by leading enterprises and government agencies to establish trust and mitigate risk across the customer lifecycle.
Software ManagementTeam ManagementCross-functional Team Leadership

Cardlytics

Principal Software Engineer

Jan 2025Mar 2025 · 2 mos · Menlo Park, California, United States · Hybrid

  • Worked on optimizing Data Pipelines feeding into the Apache Hudi based Data Lakehouse organized in a Medallion architecture. The data consisted approximately half of all card-based transactions in the U.S. and a quarter in the U.K, and powers rewards experiences in various banking and finance apps. Was able to achieve close to 50% cost and latency savings through Spark SQL query analysis and optimizations. Also Lead the effort to run spark jobs on spot EKS instances for more cost savings.
Data InfrastructureApache SparkApache HudiData LakesAmazon EKS

Yahoo

5 roles

Sr Engineering Manager

Promoted

Aug 2022Dec 2024 · 2 yrs 4 mos

  • Leadership and Growth:
  • Stepped up to lead the Yahoo Knowledge (YK) Engineering Team, expanding it threefold across the US, India, and Canada. Mentored, and successfully advocated for team members' promotions.
  • Led the Consumer Research Org's Digital Transformation (DT) to public clouds, advocating for Infrastructure-as-Code with Terraform/Terragrunt and leveraging cloud-native solutions. Collaborated with the Central Big Data Team to identify issues, build a strategic roadmap, develop, and deliver custom Terraform modules and other solutions for AWS, tailored for Yahoo-specific needs. Adopted by several teams inside Yahoo within a year.
  • Implemented best practices in software engineering in the YK Team, including enhanced documentation, unit/integration testing, Agile development, and CI/CD with infrastructure-as-code (IAC). Achieved AWS proficiency, leading the team to be among the first at Yahoo to handle live production traffic on AWS, while also reducing latency by 30%. Set up several data pipelines in AWS utilizing MWAA, EMR, Spark, ECS/Fargate, and Lambda.
  • Successfully secured funding to set up a new Production Engineering (PE) team in India to support YK and other Consumer Research Org teams. Alongside setting processes and coached PEs to ensure best devOps practices.
  • Managed external staffing by working with agencies like Infosys and Toptal to recruit and train contractors, driving DT progress.
  • Product Achievements:
  • Successfully launched the Watch-To-Watch module in collaboration with the Web Search team, receiving company-wide acclaim. Partnered with Business Development and external vendors to introduce affiliate streaming links creating new revenue streams.
Apache AirflowData StructuresKubernetesTerragruntTeam ManagementTechnical Leadership+12

Principal Software Dev Engineer

Oct 2018Aug 2022 · 3 yrs 10 mos

  • Implemented the Image Pipelines for the Yahoo Knowledge graph end to end, which involved setting the Image Acquisition pipeline(including integration with Getty Images), image batch processing, thumbnail generation, ML feature extraction, indexing, ranking, and serving. This system powers most of the image experiences on Yahoo Web Search Knowledge Cards and other modules
  • Transitioned the Yahoo Knowledge API from RestFul API to a Graphql-based API which dramatically increased the adoption and usefulness of the API within the company. Also onboarded the Knowledge Graph on AWS Neptune to unlock the power of the cloud and horizontal scaling. Also enhanced the Graphql API by connecting to different Vespa indexes and federation to other Yahoo APIs. It now powers many systems with Yahoo including Knowledge Cards, Showtimes, X-ray, Related Entities
  • Helped build the Question Answering system over the Yahoo Knowledge Graph which provides direct answer modules in Yahoo Search Search to questions of Age, Kids, net-worth Etc.
  • Spearheaded the modernization of all serving systems including containerization, and moving to Kubernetes from Bare Metals/VMs. Added open telemetry integration, performance, and integration tests and was able to reduce hardware requirements by 70-90%
ReliabilityData StructuresKubernetesApache SparkDlibAmazon EC2+14

Tech Yahoo, Senior Software Development Engineer

Promoted

Oct 2015Sep 2018 · 2 yrs 11 mos

  • Rebuilt the Yahoo Knowledge Graph using Apache Spark to scale it from ~10 million entities to hundreds of Millions of entities with Billions of facts. Worked on several modules for Web mining, structured data extraction on the Web, large-scale knowledge graph construction and management (incl. schema alignment, entity reconciliation, anomaly detection), and entity recommendation. The data is used to power several knowledge Card and experiences in Yahoo Web Search and other properties
ReliabilityData StructuresApache SparkApache PigData InfrastructureData Mining+7

Tech Yahoo, Software Development Engineer

Mar 2014Sep 2015 · 1 yr 6 mos

Data StructuresPerformance Tuning

Tech Lead

Feb 2011Feb 2014 · 3 yrs

  • Worked at Yahoo's CPG( cloud and Platform Group), handling web-scale knowledge extraction using Hadoop,Map-Reduce, PIG, Oozie, Hbase etc.
  • Setup up several critical open source data pipelines including
  • 1. Wikipedia and Freebase: Included download and processing of monthly dump, and live wiki page changes by tracking IRC Channel updates
  • 2. Expanded Yahoo's Internal GEO warehouse by gathering and merging data from Wikipedia and OpenStreetMap, greatly increasing the coverage of data including Labels in different locales

Lime labs india pvt ltd

Senior Software Engineer

Jul 2007Feb 2011 · 3 yrs 7 mos · Noida, Uttar Pradesh, India · On-site

  • Was one of the founding engineers in the team that built Limedomains.com, a domain registration and web hosting company. Worked as a full stack engineer, using Ruby on Rails, JQuery, Mysql etc, following Agile Methodologies.
  • Worked on several modules including payment processing, Reports generation, Order Management, Management console, Shopping cart and offers etc
Ruby on RailsMySQLAgile MethodologiesJavaScriptjQuery

Education

Indian Institute of Management Bangalore

Executive General Management Program — General Management

Jan 2012Jan 2013

Indian Institute of Technology, Delhi

B.Tech + M.Tech — Computer Science and Engineering

Jan 2002Jan 2007

Delhi Public School - R. K. Puram

Science

Jan 2000Jan 2002

Indian Institute of Technology, Delhi

Bachelor of Technology - BTech — Computer Science

Stackforce found 100+ more professionals with Team Management & Software Management

Explore similar profiles based on matching skills and experience