Kishor Bachhav

Co-Founder

Pune, Maharashtra, India20 yrs 9 mos experience
Most Likely To Switch

Key Highlights

  • Expert in architecting distributed data systems.
  • Led teams to develop high-performance data solutions.
  • Proven track record in open-source contributions.
Stackforce AI infers this person is a Backend-heavy Fullstack architect specializing in Open Source and SaaS solutions.

Contact

Skills

Core Skills

Open-source SoftwareJavaSystems DesignOptimizationComputer ScienceProject ManagementCommunication

Other Skills

GitHubCode ReviewScalabilityHigh AvailabilityHadoopDistributed SystemsGemfireBig DataMultithreadingSoftware DevelopmentCloud ComputingDatabasesVMwareAgile MethodologiesSQL

About

* Architected distributed data databases (HTAP DBs competitor to snowflake, AWS Athena, AWS Redshift ) * Architected low latency real time streaming app using Redpanda and Flink for AI quantitative analysis. AI powered development. * Designed, developed Change Data Replication using Kafka * Developed on-premise to cloud, hybrid cloud data migration tool. * Developed low latency, highly available distributed key-value data storage (Apache Geode replacement to Redis, DynamoDB) * Designed and developed Kafka like distributed data streaming queue * Developed performance benchmarking tool for query plan analysis, cloud cost analysis of distributed databases * Performance due diligence of open source distributed databases * Lead a team of hard core engineers with strategic and tactical planning * Worked as consultant at Ericsson, Space and Time, Hyperion AI to develop distributed data driven products. Expertise in Products Cloud : : AWS, Azure Big Data : Apache Spark, Presto, Hive, Cassandra, DeltaLake, Redis, PostgreSQL, Ignite, Geode Integration : Kafka, Redpanda, Flink Others : DeltaLake, OpenTelemetry, Docker, K8s, Github-Copilot, Claude

Experience

Techarchgrid

Technical Consultant

Oct 2025Present · 6 mos

Hyperion ai

Consultant

Dec 2024Aug 2025 · 8 mos

  • Built low latency real time quantitative analysis pipeline for crypto trading.

Space and time

Principal Software Engineer

May 2022Jun 2024 · 2 yrs 1 mo · Pune, Maharashtra, India

  • SxT's sub-second, zk-proven, cryptographically guaranteed, real-time onchain/offchain data analysis makes smart-contracts smarter.
  • Decentralized data warehouse accessible by smart contracts for real-time analysis to make data-driven contract execution.
  • Part of data warehouse team to build a HTAP database on top of Apache Ignite using Apache Spark.
  • Lead a team of developers to implement Change Data Replication using Kafka to provide multi region high availability.
  • Implemented grpc services for network management.
Systems DesignOptimizationComputer ScienceCode ReviewCommunicationProject Management

Ampool, inc.

Technology Leader

Sep 2019May 2022 · 2 yrs 8 mos · Pune Area, India

  • Achieved tight integration of Presto/Trino with Apache Geode cluster to deliver best performance for OLAP query.
  • TPCDS benchmarking.
  • Query performance analysis to pinpointed bottlenecks in join operators, scan operators, task de-serialization, metadata initialisation
OptimizationComputer ScienceCode ReviewCommunicationProject Management

Akupar innovations

Founder

Apr 2018Sep 2019 · 1 yr 5 mos · Pune Area, India

  • Designed and implemented SpringBoot REST Application on cloud.(https://misujami.com).
  • MiSujami is a joint family tree platform for person to create family mesh (Genealogy).
  • Technical stack selection
  • Innovated Intelligent numbering system for all close relations considering multiple marriages, multiple relations between 2 close relatives
  • Merging of graphs to build a network of all relatives
  • Design database considering graph merging/traversing
  • Built a team of smart developers and delivered MVP within 8 months.
Computer ScienceCommunicationProject Management

Snappydata

Sr. Member of Technical Staff

Feb 2016Mar 2018 · 2 yrs 1 mo · Pune

  • SnappyData is a spark database where Apache Geode is tightly integrated with Apache Spark.
  • Written a paper for VLDB, SIGMOD conferences.
  • Tune SnappyData for best performance for large volume of data on large cluster.
  • Designed test framework to automate TPCH & TPC-DS benchmarks from scratch
  • Tuning overall memory of the product.
  • Worked on multiple GC algorithms like G1 ,CMS for different workload using GC log analyser tools, JProfiler.
  • Responsible to check and intimate technical gaps between other HTAP products Memsql, SparkSql, Impla/Kudu.
  • Tune SnappyData for best performance for large volume of data on large cluster.
OptimizationComputer ScienceCode ReviewCommunication

Pivotal software, inc.

Member of Technical Staff 3

Apr 2013Mar 2018 · 4 yrs 11 mos · Pune

  • Developed following Geode Features.
  • Developed AsyncEventQueue to stream events using sliding window protocol
  • Developed Multisite WAN replication framework : auto-discovery of site, sending data serially as well as parallely.
  • Partition to Node Affinity : feature that provides standard partitioning along with specific data's affinity with specific member.
Computer ScienceCode ReviewCommunication

Vmware

Member of Technical Staff 2

Jun 2010Mar 2018 · 7 yrs 9 mos · Pune Area, India

  • GemStone Systems got acquired by VMWARE in June 2010. I continued working with Gemstone Division of VMWARE under R&D group. Working on new features of GemFire.
  • Function Execution Service on p2p as well as client server.
  • Implemented Stats collection
  • Partition Region Rebalancing
Computer ScienceCode ReviewCommunication

The apache software foundation

Open Source Committer

Apr 2010Present · 16 yrs

  • 1. SnappyData : https://github.com/TIBCOSoftware/snappydata
  • TPCH/TPCDS Benchmarking Implemenration :
  • https://github.com/search?q=repo%3ATIBCOSoftware%2Fsnappydata+kishor&type=code
  • SnappyData is a tight integration of Apache Spark with Apache Geode (connector mode is also there).
  • I spearheaded the implementation of the TPCH/TPCDS benchmarks from inception, developing a comprehensive framework that allowed for seamless modularity. This framework enabled the benchmarks to be easily rerun facilitating the observation and analysis of performance issues with precision and efficiency.
  • 2. Apache Geode: https://github.com/apache/geode
  • WAN Replication and Queue Implementation:
  • https://github.com/apache/geode/search?q=AsyncEventQueue
  • https://github.com/apache/geode/search?q=GatewaySender&type=
  • Design, coding, testing. GatewaySenders are used to replicate data from one site to another separated by geographical location parallely (ParallelGatwaySender) and serially (SerialGatewaySender)
  • AsyncEventQueue is a Kafka like mechanism for event stream handling.
  • Wan auto-discovery :
  • https://github.com/apache/geode/search?p=2&q=Discovery
  • Discovery of multiple wan site.
  • Design, coding, tests, data structures choices are driven by me. Java concurrency utilities are used extensively.
  • Fixed Partitioned Region:
  • https://github.com/apache/geode/search?q=Fixed
  • To maintain the data's affinity to a particular node. Critical aspects of this feature like bucket calculations, bucket rebalancing are designed, coded, unit tested by me.
  • PR Single Hop:
  • https://github.com/apache/geode/search?q=*hop
  • No need for extra hoping with the server cluster to find out the required data
  • Partition Region Rebalancing:
  • https://github.com/apache/geode/search?q=*rebalance
  • Whenever node goes gown OR new nodes joins the cluster, Rebalancing op is triggered through a various mechanism which rebalances data buckets (colocated) in cluster for optimized use
Open-Source SoftwareGitHubCode ReviewJava

Gemstone systems

Member Of Technical Staff

Nov 2007Mar 2018 · 10 yrs 4 mos

  • I have joined Gemstone System in Nov 2007.
  • GemStone Systems is the leading enterprise software company exclusively focused on in-memory data management and distribution. With our patented technology, enterprises can gain ultra fast and reliable access to operational data while dramatically enhancing data scalability and resiliency.
  • I am mainly focused on client-server and p2p architecture of partitioned data with high availability of data.
Computer ScienceCommunication

Perot systems

Associate

Jul 2005Nov 2007 · 2 yrs 4 mos

  • I joined Perot in July 2005 as a fresher.
  • I got trained on Java/J2EE technology and also completed my SCJP.
  • I worked in Travel Vertical where Lufthansa was our client. I was involved in developing Shipment Database and Freight Way Bill for Lufthansa Cargo. I Worked with Lufthansa team in Frankfurt to integrate our modules with their existing modules. Quite challeging job and very nice experience!!!!
Computer ScienceCommunication

Education

Pune Institute of Computer Technology

BE — Computer

Sep 2001Jun 2005

K. B. H. Vidyalaya Malegaon

Stackforce found 100+ more professionals with Open-source Software & Java

Explore similar profiles based on matching skills and experience

Kishor Bachhav - Co-Founder | Stackforce