Ashutosh Kumar

Data Engineer

Bihar, India15 yrs 2 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Expert in building data lakes and warehouses.
  • Proven track record in fraud detection systems.
  • Strong background in search engine development.
Stackforce AI infers this person is a Data Engineering expert with a focus on Fintech and SaaS solutions.

Contact

Skills

Core Skills

Data EngineeringData WarehousingData Pipeline DevelopmentSearch Engine DevelopmentCredit Scoring SystemsFraud Detection SystemsDatabase ManagementDatabase DevelopmentPerformance Tuning

Other Skills

AWSApache SparkCredit ScoringData LakeData PipelineData RankingData VaultData WarehouseDatabase DesignDatabasesDimensional modelElastic SearchElasticSearchFraud DetectionFuzzy Matching

About

Worked on - > Building data lake of india's largest mobile gaming company MPL. > Building search engine project in credit bureau company which provides credit score on the basis of search output. Developed multiple core systems of search engine project which includes -Data cleansing, Data standardization, Clustering, Data mining. > Implemented multiple algorithm for text based search which include fuzzy matching and implemented very well for name and address based matching. > Worked on building antifraud system to identify credit fraud of Indian Banks Technology Stack - AWS, Elastic Search, Python, Spark, Java, Oracle, Postgres SQL, My SQL Managed team and provided end to end solution of data driven project architecture.

Experience

Mobile premier league (mpl)

2 roles

Manager, Data Engineering

Promoted

Oct 2021Present · 4 yrs 5 mos

  • Working on Data Warehouse on top of data lake. Aim is to build warehouse with clean and categorised data, which will ultimately used by analysts or data scientists for building any model without doing any data preprocessing. Using Data Vault and Dimensional model for data storage.
AWSData WarehouseData VaultDimensional modelData EngineeringData Warehousing

Sr Data Engineer

Jan 2020Oct 2021 · 1 yr 9 mos

  • Building Data Lake here which gets used by Data Analytics and Data Scientist team. Responsible for building Data Pipeline through spark, which consumes data from multiple source and stores lake data on AWS S3. Operationized code in such a way that no need to create single line of code for building any new pipeline. Improved efficiency of data engineering team and helped analysts, data scientists to create any number of pipeline without any delay.
AWSSparkData LakeData PipelineData EngineeringData Pipeline Development

Gateway technolabs pvt. ltd. (gateway group of companies)

Data Engineer

May 2018Jan 2020 · 1 yr 8 mos · Pune, Maharashtra, India

  • Worked on ChemSel(Chemical Search Engine) from scratch. Stored chemical component data on Elastic Search. Applied Elastic Custom Search analysers on data to get best search result. Able to rank best matching on top. Search result is able to give 99% accuracy considering exact and fuzzy matching. Project was for one on client ElseVier Technology.
Elastic SearchFuzzy MatchingData RankingSearch Engine DevelopmentData Engineering

Crif

2 roles

Software Architect

Promoted

Apr 2017May 2018 · 1 yr 1 mo

  • Consumer record pull for building Credit Score, used by financial institution for credit landing to customer. Worked on Application match engine, which accepts id, name, address, phone, dob and search with Cunsumer database. Database contains all credit data for all financial institutions(Bank/MFI/SHG). Fuzzy matching fetchs data from entire database and provides record within second. Responsible for building exact match as well as fuzzy match for name and address match with 99% accuracy.
Fuzzy MatchingDatabase ManagementCredit ScoringCredit Scoring SystemsData Engineering

IT Specialist

Dec 2014Apr 2017 · 2 yrs 4 mos

  • Fraud detector project which is responsible to find application fraud while credit landing from any financial institute. Worked on building fraud detection rule, database design, fraud citation.
Fraud DetectionDatabase DesignFraud Detection SystemsDatabase Management

Augmentiq data sciences

Technology Lead

Apr 2013Nov 2014 · 1 yr 7 mos · Pune

  • Worked on building database of Legal based search engine(Scorcle). Project purpose was to store all legal documents on portal and provide a social connections between Legal professionals(Lawyer, Judge, Students, Professors). stored transactional data - used MongoDB, For Social Connections data - used Neo4J) and for legal content search data - used Elastic Search.
MongoDBNeo4jElastic SearchDatabase ManagementSearch Engine Development

Crif high mark

Software Engneer(Oracle)

Jan 2011Apr 2013 · 2 yrs 3 mos · Pune

  • Worked as an oracle Software developer on the key product of highmark. Grip command of Query optimization, perform tuning, work flow design of Database. Successfully achieved business goal to enable clustering based project "LTC" over night, which is able to search 1000+ inquiry/Minute, which is more than expectation. Worked on Candidate pool which can give all possible combination of search to address personal information of candidate.
Oracle PL/SQLQuery OptimizationDatabase DevelopmentPerformance Tuning

Education

Savitribai Phule Pune University

MCA — Computer Science

Jan 2008Jan 2011

Stackforce found 100+ more professionals with Data Engineering & Data Warehousing

Explore similar profiles based on matching skills and experience