Henry Joseph

DevOps Engineer

United States6 yrs 9 mos experience
Highly StableAI ML Practitioner

Key Highlights

  • Expert in building AI-ready data platforms.
  • Proficient in cloud-based analytics solutions.
  • Strong focus on data governance and reliability.
Stackforce AI infers this person is a Data Engineering expert with a strong focus on AI and cloud solutions in the Fintech sector.

Contact

Skills

Core Skills

Data EngineeringAi/mlEtl/elt

Other Skills

DatabricksSnowflakeAzureETLCloud SolutionsSQLWindsurfOracle DatabaseJavaSoftware Development Engineer in TestQA AutomationHadoopScalaApache SparkPalantir

About

Lead Data Engineer with around 10 years of experience designing scalable data platforms, building high-performance data pipelines, and delivering cloud-based analytics solutions across financial and enterprise environments. Experienced in Databricks, Snowflake, and Azure, with a strong focus on data governance, reliability, and cost-efficient engineering. Expertise in developing end-to-end ETL/ELT pipelines using Python and SQL, with strong experience across Azure, AWS, and GCP. Skilled in building data solutions using Databricks, BigQuery, Airflow, Azure Data Factory, and dbt to support large-scale analytics, reporting, and AI/ML workflows. Experience building GenAI-ready data platforms supporting RAG pipelines, vector search, embeddings, and AI-driven analytics workflows and leveraged LLMs to enhance data engineering workflows, including schema mapping and automated documentation, and designed AI agent-based workflows to automate data validation and transformation logic. Experienced in modernizing legacy systems into cloud-native architectures, optimizing large-scale data processing, and implementing scalable data lake and warehouse solutions. Strong focus on data quality, performance optimization, and cost-efficient pipeline design. Building reliable healthcare data pipelines that turn complex data into actionable insights. Hands-on experience with distributed data processing using Spark and PySpark, along with exposure to streaming architectures (Kafka) and AI-driven pipelines, including MLflow and LLM-based data workflows and hands-on exposure to LangChain, OpenAI embeddings, AI agent orchestration, and intelligent automation for enterprise data engineering use cases. Adept at building reliable, production-grade systems using CI/CD, Terraform, and containerization. Technologies: Python • SQL • Databricks • BigQuery • Snowflake • Airflow • Azure Data Factory • dbt • PySpark • Oracle • Windsurf • AWS • Azure • GCP • Apache Flink • Java • Kafka • Palantir • MLflow • LLM Pipelines • ETL/ELT • Power BI • Data Modeling • CI/CD • Terraform • Docker • GenAI • Delta Lake • Workday • LangChain • RAG • Vector Databases • OpenAI APIs • Kubernetes • Microsoft Fabric • Azure Synapse • Terraform

Experience

6 yrs 9 mos
Total Experience
3 yrs 4 mos
Average Tenure
--
Current Experience

State street

Lead Data Engineer | AI/ML

Jan 2024Present · 2 yrs 5 mos · Austin, Texas, United States · Hybrid

DatabricksSnowflakeAzureAI/MLData Engineering

Apple

Senior Data Engineer/ETL Developer

May 2019Apr 2023 · 3 yrs 11 mos · On-site

ETLData EngineeringCloud SolutionsETL/ELT

Dell technologies

Data Engineer/SQL Developer

Jun 2016Apr 2019 · 2 yrs 10 mos · On-site

SQLData Engineering

Education

Webster University

Master of Science - MS — Information Technology

Christ University, Bangalore

Bachelor of Technology - BTech — Computer Science and Engineering

Stackforce found 100+ more professionals with Data Engineering & Ai/ml

Explore similar profiles based on matching skills and experience

Henry Joseph - DevOps Engineer | Stackforce