Haris Bin Saif

Software Engineer

Lahore District, Punjab, Pakistan3 yrs 1 mo experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in architecting compliant data infrastructures.
  • Proven track record in real-time data pipelines.
  • Skilled in transforming messy data into business-ready systems.
Stackforce AI infers this person is a Data Engineering expert in SaaS with a focus on compliance and real-time data solutions.

Contact

Skills

Core Skills

Data EngineeringData ArchitectureCloud Computing

Other Skills

Apollo GraphQLClickHouseKafkaDebeziumSpark Structured StreamingFastAPIAirflowAmazon Web Services (AWS)Azure DatabricksPostgreSQLPythonPower BIAirbyteAWSAPI Gateway

About

I’m a tech-agnostic data engineer who designs secure, compliant data and AI infrastructure. I specialize in real-time pipelines, RAG-based AI agents, and cloud-native architectures for industries where correctness and compliance matter, My work focuses on clean architecture, real-time and batch pipelines, and turning messy data into reliable, business-ready systems.

Experience

Soloinsight (cloudgate platform)

Principal Data Engineer

Oct 2024Jan 2026 · 1 yr 3 mos · On-site

  • Soloinsight is a compliance-first physical identity & access management (PIAM) platform serving global enterprises, including 14+ Fortune 100 companies. I architected and led the development of their next-generation real-time analytics and AI infrastructure.
  • Key Contributions:
  • Architected Soloinsight’s real-time analytics platform, leveraging Kafka, Debezium, ksqlDB, Spark Structured Streaming, and ClickHouse to deliver sub-second insights across enterprise clients.
  • Built a FastAPI-based Kafka Manager to simplify connector, consumer, and operational workflows for engineering teams.
  • Implemented Airflow-based ETL orchestration and delivered GraphQL APIs for internal and client-facing applications.
  • Developed a client data delivery solution, ensuring secure, reliable, and compliant data distribution across enterprise tenants.
  • Currently desiging the POC for secure, compliant AI Agents and RAG pipelines aligned with HIPAA, GDPR, and SOC 2 requirements.
Apollo GraphQLClickHouseKafkaDebeziumSpark Structured StreamingFastAPI+3

Gazelle is now part of lightcast

Data Engineer

Dec 2022Aug 2024 · 1 yr 8 mos · Remote

  • Engineered and optimized data workflows using PySpark and Databricks to power a platform identifying high-growth companies.
  • Diagnosed and resolved critical issues in data pipelines, enhancing platform stability with Python and PostgreSQL.
  • Improved data operations by implementing innovative solutions, ensuring scalability and future-proofing infrastructure.
  • Collaborated with stakeholders to align data processes with organizational objectives, contributing to strategic platform enhancements.
DatabricksPySparkPostgreSQLPythonData Engineering

Modus create

Data Engineer

Jul 2022Oct 2024 · 2 yrs 3 mos · Remote · Remote

  • Designed and deployed flexible data architectures using AWS services (API Gateway, Lambda, S3, RDS, SQS), enabling seamless data processing and reporting.
  • Transformed legacy systems by reengineering data processes with PostgreSQL and Python, significantly improving efficiency and reliability.
  • Developed scalable migration frameworks to transition enterprise systems to cloud platforms, ensuring data integrity and minimal downtime.
  • Created internal analytics platforms using Power BI, Airbyte, and custom Python scripts to empower cross-functional teams with actionable insights.
  • Simplified and optimized backend systems for modern applications, integrating LLMs and optimizing performance with Flask and React.js.
  • Spearheaded automation initiatives using Selenium and cloud-native technologies, streamlining workflows to increase operational efficiency.
  • Helped the company acquire a key client by providing solution design during sales calls, then worked as a Databricks data engineer to write complex data transformations and build efficient ETL pipelines.
Amazon Web Services (AWS)Azure DatabricksPostgreSQLPythonPower BIAirbyte+2

Afiniti

Associate Data Engineer

Jan 2021Jan 2022 · 1 yr · Lahore, Punjab, Pakistan · Remote

  • Independently managed a high-profile client, delivering tailored data solutions with Talend, PostgreSQL, and MySQL to meet complex business needs.
  • Designed, optimized, and maintained data pipelines, ensuring seamless data flow and operational efficiency.
  • Automated workflows with custom scripts, reducing manual intervention and enhancing overall performance.
  • Enhanced query performance by implementing advanced optimization techniques, improving processing times and data accessibility.
  • Collaborated with Data Analysts and Data Scientists to deliver actionable insights and support data-driven decision-making.
TalendPostgreSQLMySQLPythonSQLData Engineering

Devfactori llc.

Software Quality Assurance Engineer

Oct 2019Aug 2020 · 10 mos · Lahore, Punjab, Pakistan

  • Automated API testing and documentation using Postman and Swagger, ensuring consistency and reliability across services.
  • Developed and maintained UI automation frameworks with Cypress.io and JavaScript, reducing manual testing efforts.
  • Managed CI/CD pipelines on Azure DevOps, ensuring seamless integration and deployment processes.
  • Conducted manual testing for web and mobile applications, ensuring high-quality user experiences.
  • Worked collaboratively with cross-functional teams to enhance QA processes and align strategies with project goals.
PythonSQL

Education

National University of Computer and Emerging Sciences

Bachelor's degree — Computer Science

Jan 2015Jan 2019

Punjab College

Intermediate — Pre-Engineering

Jan 2013Jan 2015

LDA

Matriculation — Science

Jan 2011Jan 2013

Stackforce found 100+ more professionals with Data Engineering & Data Architecture

Explore similar profiles based on matching skills and experience