V

Vikas Thakur

Senior Software Engineer

Bengaluru, Karnataka, India10 yrs 8 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Over 11 years of experience in data platforms.
  • Expert in distributed systems and big data analytics.
  • Proven leadership in complex data migration projects.
Stackforce AI infers this person is a Senior Software Engineer specializing in SaaS and Data Engineering.

Contact

Skills

Core Skills

Data Privacy ComplianceDistributed SystemsData MigrationData Pipeline ManagementBig Data AnalyticsMachine LearningData ManagementApplication DevelopmentWeb Development

Other Skills

AWSAirflowAlgorithmsAngular JSAngularJSApache Spark StreamingArtificial IntelligenceAzkabanBig DataBigQueryCC++CDHCore JavaCosmos

About

Highly motivated and results-oriented Senior Software Engineer - Data Platform with 11+ years of experience in distributed systems, big data analytics, data platforms, cloud platforms (GCP, AWS), and deploying machine learning models. Proven ability to lead complex data migration projects, design scalable solutions, and ensure data privacy compliance. Seeking to leverage expertise in building robust data platforms and micro-services to contribute to innovative technological advancements. Skills Expertise Area : Distributed System Design and Architecture, Building Platforms, Microservices, Event-Driven Systems, GenAI Languages : Java, Scala, Python Distributed Systems : Hadoop, Spark, Hive, Kafka, HUDI, ELK, Presto, Flink, Data-lake, Delta-lake, Dimensional Modeling, lake-house GCP Services : BigQuery, Dataproc, Serverless, DPaaS, GCP, GCS, Vertex AI AWS Services : EC2, S3, EMR, Lambda, AWS Glue, Redshift Databases : Cassandra, Cosmos DB, MySQL, HBase, Elastic Search Dashboard / Workflow : Looker, Grafana, Tableau, Kibana, Airflow, Oozie, Azkaban Web Technology: Django, Spring Boot, Rest services, NGINX Other Tools / Concepts : DSA, System Design, ML(MlLib, SciKit), OOPS, HDP, Ambari, Git, Jenkins, Jira, Confluence, Docker Soft Skills : Problem-Solving, Communication, Teamwork Expertise Area: - Big Data Spark Apache Kafka and Streaming Platform Google Cloud AWS Cloud Design and Architecture Low Level Design High Level Design Data Pipeline Object Oriented Programming(OOP) Database modeling and designing (NoSql, Sql) Platform Development Data Structures and Algorithms

Experience

10 yrs 8 mos
Total Experience
2 yrs 8 mos
Average Tenure
5 yrs 4 mos
Current Experience

Walmart global tech india

Senior Software Engineer

Feb 2021Present · 5 yrs 4 mos · Bengaluru, Karnataka, India

  • In my current position, my primary responsibility is to ensure Walmart’s strict compliance with adiverse spectrum of privacy regulations
  • that encompass critical aspects like the Right to Access, Right to Opt-Out, and Right to Delete. These responsibilities extend to
  • the meticulous adherence to major legal frameworks such as GDPR, CCPA, CPRA, and VCDPA. A significant part of my role involves
  • leading a transformative initiative wherein we are introducing a novel tool, meticulously crafted from the ground up.
  • The objective of this tool is to guarantee the har- monious alignment of all data lakes, characterized by their massive
  • petabyte-scale data volumes, with the aforementioned legal and regulatory standards.
  • Tools/Technologies - Spark, BigQuery, Kafka, Java, Scala, Cosmos, Hive, GCP, DPaaS, Serverless, Airflow and Looker
SparkBigQueryKafkaJavaScalaCosmos+8

Wynk limited

2 roles

Principal Engineer - Data Platform

Promoted

Jul 2020Feb 2021 · 7 mos

  • led a team of five in migrating the data pipeline from Hortonworks infrastructure to managed AWS Glue. This project was crucial in enhancing the efficiency and reliability of our data management. I also managed daily requirements for different analytical and reporting purposes, which allowed me to demonstrate my leadership skills and ability to manage complex migration projects.
  • Tools/Technologies - Spark, AWS, Hive, HDP(Ambari), Java, Scala, Spark MlLib, Redshift, Kafka, S3, EMR, Azkaban and Jenkins
SparkAWSHiveHDP(Ambari)JavaScala+9

Senior Data Engineer

Aug 2018Jul 2020 · 1 yr 11 mos

  • At Wynk, I developed and managed the Data Flow Pipeline for Wynk Music and Airtel TV, which was designed to perform Big Data analytics. This project involved handling enormous volumes of data daily, reaching up to 3+TB/day, with a Data Warehouse exceeding 1+PB in size. Additionally, the system was built to accommodate a billion music streams every month. A significant part of my role was creating various revenue reports for our Content Partners, which was a critical part of our business model.
  • I also designed and implemented the Overture, a Machine Learning Pipeline, from scratch. This project ranged from the proof of concept to the final product. My responsibilities included developing a spark-based ML pipeline, model development, training, and writing a scalable module. The module was designed to understand more than 60 million active users' intent by crafting features from their implicit behaviour on the app, showcasing my skills in Machine Learning and User Behavior Analysis.
  • Tools/Technologies - Spark, AWS, Hive, HDP(Ambari), Java, Scala, Spark MlLib, Redshift, Kafka, S3, EMR, Azkaban and Jenkins
SparkAWSHiveHDP(Ambari)JavaScala+9

Mobileum

Software Developer

Sep 2016Jul 2018 · 1 yr 10 mos · Gurgaon, India

  • At my previous position, I developed and managed a Data Flow Pipeline that handled significant volumes of data, approximately 1TB/day, and a Data Warehouse (DWH) that had capacity exceeding 1PB. This project involved dealing with extensive amounts of information and required exceptional skills in data management and analysis.
  • Another key project I worked on was the development of an Invoice Generation and Reconciliation system. This system was designed to automatically generate invoices for interconnect partners, allowing for the recovery of costs due to billing discrepancies and duration anomalies. I delivered this scalable application from proof of concept to final product, demonstrating my ability to manage a project from its inception to completion.
  • Additionally, I worked on the Margin Simulation and Analysis application and the Generic SQL Framework. The Margin Simulation and Analysis application provided operators with the ability to perform wholesale-retail profitability analysis and optimisation. On the other hand, the Generic SQL Framework was designed to process both old and new data streams and automate various batch jobs. The objective of this framework was to facilitate the processing of new client data. Both of these projects were delivered from scratch, from proof of concept to final product, showcasing my ability to handle diverse projects successfully.
  • Tools/Technologies - Spark, HBase, Hive, CDH, Java, Scala, Hadoop, Kafka, HDFS, Presto, Oozie and MySQL
SparkHBaseHiveCDHJavaScala+8

Ht media ltd

Software Developer

Aug 2015Aug 2016 · 1 yr · Gurgaon, India

  • At HT, I had the opportunity to work on a key project, the Micro-site Generator. This project involved creating a multi-page site to showcase various aspects of the company such as products, work culture, job openings, and contacts. The site was customizable and could be used for branding and recruitment campaigns. I was part of a two-member team responsible for delivering this application from scratch. My primary responsibilities included schema design, model development, writing APIs, and ensuring backward compatibility with the existing system.
  • Another significant project I worked on was the Shine Recruiter, a platform where recruiters could post jobs online. The unique feature of this platform was its 2-way matching engine and access to the fastest growing candidate database. My role in this project was to develop restful APIs, integrate new features like OTP, manage job dashboards, admin for managing sales and packages, and bug fixing. Both these projects have equipped me with valuable skills and experience in application development and project management.
  • Tools/Technologies: Django, MySQL, Angular JS, Mongo Engine, Django Rest Framework
DjangoMySQLAngular JSMongo EngineDjango Rest FrameworkWeb Development

Nextag inc.

Software Engineer

Jan 2014Jul 2015 · 1 yr 6 mos · Gurgaon, India

  • Involved in the end to end development of Merchant Dashboard, for managing new or already registered merchants. Using registration module, upload product module and billing module merchants are able to provide product feed and bid for CPC (cost per click) respectively.
  • Crawling reviews from different websites, we use these reviews to calculate wize-rating of product and also generate the points which is best suited feature for a particular product. Python, Numpy, Sklearn, Pandas, UrlLib, NLP.
  • Internship 5 month (Jan2014-May2014)
  • Developed a tool WizeBottomline, used by internal member of company for writing expert reviews for product, using live site snapshot of product and user reviews provided by the tool.Also developed Review crawler for 3rd Party websites, in turn used to calculate wize-rating of product and pinpoint the best features of the product(Python, sklearn, Numpy, UrlLib, Machine Learning).
PythonNumpySklearnPandasUrlLibNLP

Education

Department of Computer Science - University of Delhi

Master of Computer Applications (MCA)

Jan 2011Jan 2014

Stackforce found 100+ more professionals with Data Privacy Compliance & Distributed Systems

Explore similar profiles based on matching skills and experience