Roman Grebennikov

CEO

Berlin, Berlin, Germany16 yrs 8 mos experience
Highly Stable

Key Highlights

  • Expert in ML-driven search and recommendations.
  • Led significant growth in e-commerce search systems.
  • Strong advocate for team mentorship and open-source contributions.
Stackforce AI infers this person is a Machine Learning and Big Data expert in the E-commerce and Fintech sectors.

Contact

Skills

Core Skills

Machine LearningSearchBig Data

Other Skills

ElasticsearchNLPdense retrievalembeddingsLLMsSBERTPyTorchS3 storageauto-scalingScalaJavaKubernetesKafkaKinesisPulsar

About

A software engineer with a track record of designing, building and leading large-scale data processing and ML-heavy applications. A pragmatic fan of functional programming, search, recommendations, Learn-to-Rank models and performance engineering.

Experience

16 yrs 8 mos
Total Experience
3 yrs 3 mos
Average Tenure
2 yrs 4 mos
Current Experience

Nixiesearch

Maintainer

Jan 2024Present · 2 yrs 4 mos · Berlin, Germany

  • Building a better Elasticsearch with S3 storage, auto-scaling and much simpler DevEx.
ElasticsearchS3 storageauto-scalingSearch

Delivery hero

Principal Engineer

Dec 2022Present · 3 yrs 5 mos · Berlin, Germany · Hybrid

  • Making search go brr with ML:
  • Making food search work better: Elasticsearch relevance, NLP, dense retrieval with embeddings, LLMs, fine-tuning with SBERT/PyTorch. Once built an LLM which can generate a dad joke punchline on any prompt.
  • Convincing team and management to do the right thing: writing RFC/PRD/OKR/WTF docs after figuring out what the actual customer problem we are facing, and how can it be solved step-by-step. Hands-on development with MVPs, prototypes, and the most challenging parts of the system.
  • Mentorship and team growth: organizing team/company hackathons, architecture review, paper reading club, and having 1-1 && pair-prog sessions about technical and ML excellence. Convincing colleagues to speak at conferences/meetups.
  • Moving industry forward: open-source (metarank, lightgbm4j, flink-scala), speaking on tech conferences/meetups, flame wars on linkedin/hackernews/reddit/slack. Editor-in-chief of a #memes channel on a company Slack.
ElasticsearchNLPdense retrievalembeddingsLLMsSBERT+3

Metarank labs

Maintainer

Jun 2022Present · 3 yrs 11 mos · Berlin, Germany

  • Pet project which has grown into a full-featured open-source Learn-to-Rank service.
  • Designed and implemented an open-source LTR engine with 2k+ github stars [Scala/Java, Kubernetes, Kafka/Kinesis/Pulsar, http4s, FS2, Cats-Effect].
  • Technical marketing: blog posts (top-HN #2), community management (300 members in slack), integration guides for open-source search engines.
  • Solution engineering for early adopters, conference speaking (MICES, Haystack, FlinkForward)
ScalaJavaKubernetesKafkaKinesisPulsar+1

Findify

ML Team Lead

Mar 2016May 2022 · 6 yrs 2 mos · Berlin Area, Germany

  • Leading a small team building smart search and recommendations for 2K+ merchants.
  • #2 employee, designed a ML ranking system helping company grow from 30 to 2000 e-commerce stores [LambdaMART, XGBoost, Apache Flink, Cassandra, Redis].
  • Migrated the offline LTR backend to online LTR, improving CTR +4% and CR +6% across the whole merchant base [Apache Flink, Cassandra].
  • Led the migration from ElasticSearch to a custom Lucene-based cloud-native & multi-tenant search backend, reducing AWS costs 2x [Scala, CE3, FS2, Kubernetes, Cassandra, Apache Pulsar]
  • Built an internal recommendation system for 20M products [MF/BPR, Java, HNSW/Annoy]
  • Pitched a pilot project to migrate to k8s [v1.6, 2017], and led the complete company migration into it afterwards [v1.8, 2018]. Introduced GitOps/IaaS approach with Terraform and ArgoCD.
  • Led a small team working on a real-time search analytics backend: sub-second dashboard update latency for 2k merchants and 80M MAU [Apache Flink, Clickhouse]
  • Migrated high-thoughput (50gb/day) logging stack from cloud SaaS solution to Grafana Loki, cutting costs 4X.
LambdaMARTXGBoostApache FlinkCassandraRedisMachine Learning+1

Sociohub

CTO, Big Data Engineer

May 2012Feb 2016 · 3 yrs 9 mos · Voronezh Region, Russian Federation

  • Designed and implemented a major part of the analytical pipeline from the ground up having limited resources:
  • Distributed large-scale web scraping (C++, Scala+akka, ~1.5b requests/month/server)
  • Data processing: creating automated data extraction system based on Spark/Cassandra and Hadoop/HBase.
  • Solving partial duplicate detection tasks in large databases.
  • Creating advanced client credit scoring models for banking and finance (Weka, R, Rapidminer).
  • Building fault-tolerant high-performance client API (Scala+spray, Cassandra, Elasticsearch)
  • Keywords: Hadoop, HBase, Cassandra, ElasticSearch, Solr, Puppet, Zookeeper, Giraph, Java, Scala, Jenkins, Maven, Zabbix, RapidMiner.
C++ScalaSparkCassandraHadoopHBase+1

Numbers ltd

BI Developer

Nov 2010May 2012 · 1 yr 6 mos

  • Datamining, OLAP: creating client segmentation schemes, client scoring.
  • Business research and modeling: building recommender systems.
  • Social networks: advanced statistical research, building automated web crawlers/spiders, deploying large NoSQL databases.

Lord finance

Researcher/developer

Jan 2009Nov 2010 · 1 yr 10 mos

  • Algorithmic trading research in option pricing. Creating and testing financial models in Mathematica & C++.
  • High-volume trading robot research.
  • Design and prototyping of trading robots.

Education

Voronezh State University

PhD — Computer Science

Jan 2007Jan 2011

Voronezh State University

Specialist — Computer Science

Jan 2002Jan 2007

Stackforce found 100+ more professionals with Machine Learning & Search

Explore similar profiles based on matching skills and experience