Lead Data Engineer | Enterprise Tech & Streaming Systems

Designing data platforms that stay fast, reliable, and cost-efficient at production scale.

I build real-time, low-latency data systems on AWS with a strong bias for operational clarity, dependable delivery, and measurable business impact. Over 8+ years, I have worked across Flink, Kafka/MSK, Spark, Kinesis Data Analytics, DynamoDB, S3, and modern lakehouse patterns to ship business-critical pipelines that teams can trust.

Start a conversation LinkedIn GitHub

Experience 8+ years across streaming, analytics, enterprise, and platform engineering

Current role Lead Data Engineer - Enterprise Tech at Flutter Entertainment

Primary value Exactly-once reliability, runtime tuning, and cost optimization

Impact Snapshot

Flutter Entertainment | PokerStars | Experian | Model N | Analytics Platforms

Streaming-first Production systems built with Flink, Kafka/MSK, Spark, and Kinesis Data Analytics.

AWS depth DynamoDB, S3, Glue, Athena, EMR, Redshift, Iceberg, and scalable data delivery.

Business impact Focused on cost savings, lower tail latency, better recovery behavior, and reliability.

Engineering style Practical architecture, strong observability, and systems built for real operators.

What Teams Get

Streaming systems designed with checkpointing, backpressure awareness, and failure recovery in mind.
Data platforms that balance throughput, cost, maintainability, and downstream usability.
Clear delivery habits: operational visibility, reproducible pipelines, and production discipline.

At A Glance

Current role

Lead Data Engineer - Enterprise Tech Flutter Entertainment

Focus

Flink, Kafka/MSK, Spark, DynamoDB, and AWS-scale delivery

Open to

Lead data engineering roles in streaming, platform reliability, and cloud data systems

Working style

Practical architecture, observability-first delivery, and systems designed for real operators

Exactly-once

Reliability designed into the pipeline, not added later.

Strong grounding in checkpoint alignment, idempotent sinks, state tuning, and long-lived stream behavior.

Low latency

Fast paths for event-driven systems that need predictable runtime behavior.

Built to reduce hot-path IO, improve recovery time, and keep production workloads responsive at scale.

Cost discipline

Performance work tied directly to infrastructure efficiency.

Experience optimizing storage layout, state management, data models, and compute patterns for real savings.

Core Stack

Technologies used repeatedly across streaming, analytics, platform engineering, and delivery.

Java Python Go Rust Scala C++ SQL Apache Flink Apache Spark Kafka / MSK Kinesis Data Analytics Apache Iceberg Delta Lake Parquet / ORC DynamoDB PostgreSQL Snowflake Databricks Airflow Dremio AWS Kubernetes Docker

About

Lead data engineering grounded in production behavior, scale, and operational confidence.

My work sits at the intersection of real-time systems, AWS data platforms, and practical engineering leadership. I care about architecture that performs well in production and stays understandable for the teams operating it.

What I Build

Streaming and batch platforms for critical data flows, with a focus on reliability, observability, storage efficiency, and low-latency execution.

How I Work

I favor simple, high-leverage architecture decisions, careful tuning, and delivery patterns that help teams move faster without sacrificing confidence in production.

Why It Matters

The strongest systems are not just scalable on paper. They recover predictably, stay observable under load, and keep costs under control as usage grows.

Reliable data platforms are built by sweating the runtime details that others skip.

Experience

Experience building real-time and analytics platforms across product and enterprise domains.

The through-line across these roles is consistent: design dependable pipelines, improve performance and cost behavior, and ship systems that downstream teams can operate with confidence.

Jun 2026 - Present

Lead Data Engineer - Enterprise Tech

Flutter Entertainment

Moved into Flutter Entertainment's Enterprise Tech organization to support broader company-wide data and platform initiatives.
Bringing production discipline from Streaming AI into enterprise-facing data workflows with a focus on reliability, clarity, and scalable delivery.

Enterprise Tech Data Platforms AWS Platform Engineering Streaming Systems

Jan 2026 - Jun 2026

Lead Big Data Engineer, Streaming AI

PokerStars

Promoted to lead the Streaming AI data engineering track, guiding architecture and delivery for real-time workloads on AWS.
Continued optimization of hot-path IO, storage layout, and state handling to improve runtime behavior and infrastructure efficiency.
Strengthened production readiness through exactly-once sinks, checkpoint strategy, and recovery tuning across Flink-based services.

Java Python Go Flink Spark Kafka / MSK Kinesis Data Analytics DynamoDB Apache Iceberg S3 AWS Airflow

Aug 2023 - Jan 2026

Senior Big Data Engineer, Streaming AI

PokerStars

Designed real-time processing on AWS using Flink on Kinesis Data Analytics, MSK, DynamoDB, and S3 for high-value production workloads.
Reduced hot-path IO and delivered major annual cost savings through DynamoDB modeling, payload compaction, and S3 layout tuning.
Built exactly-once sinks with checkpoint alignment and idempotent upserts while tuning RocksDB state and JVM behavior for recovery and latency.

Java Python Go Flink Spark Kafka / MSK Kinesis Data Analytics DynamoDB Apache Iceberg S3 AWS Airflow

Mar 2023 - Aug 2023

Senior Data Engineer

Experian PLC

Delivered regulated pipelines with secure ingestion, lineage, and data quality gates to improve analytics readiness and operational trust.
Standardized batch and streaming jobs with reproducible configuration, deployment discipline, and monitoring that reduced delivery friction.

Java Scala Python Go Spark Flink Kafka PostgreSQL Kubernetes Prometheus Apache Iceberg Grafana Data Quality Airflow

Jun 2021 - Mar 2023

Member of Technical Staff 3

Model N

Built event-driven analytics with Kafka, Spark, and Delta Lake and exposed downstream access through Dremio and REST services.
Improved query performance with partitioning, Z-ordering, predicate pushdown, and compaction to lower compute and storage cost.

Spark Flink Java Python Scala Kafka Delta Lake Dremio Apache Iceberg REST APIs Spring Boot Cost Optimization Airflow Kubernetes

Aug 2020 - Jun 2021

Software Engineer

Carelon Global Solutions

Integrated Medicare and Medicaid datasets with SQL and distributed data processing to improve revenue capture and reporting readiness.
Supported analytics workflows with reliable pipeline behavior across Spark, Flink, Kafka, PostgreSQL, and AWS services.

Spark Flink Java Python SQL Kafka PostgreSQL AWS Apache Iceberg Delta Lake Airflow

Jul 2018 - Aug 2020

Associate Software Engineer

Legato Health Technologies

Delivered optimized ETL on Teradata and Informatica while standardizing SLAs, validations, and delivery quality for healthcare data workflows.
Built a strong foundation in enterprise data movement, operational rigor, and quality-minded delivery.

Spark Teradata Python Java Kafka Informatica Apache Iceberg ETL Delta Lake Airflow

Expertise

Built around streaming systems, AWS data platforms, and operational reliability.

I bring hands-on depth across languages, data frameworks, cloud services, and platform design patterns, with a practical bias toward runtime behavior and production supportability.

Languages

Java, Python, Go, Rust, Scala, C++, SQL, and shell used with a practical bias toward maintainability and runtime performance.

Streaming and Batch

Apache Flink, Kafka/MSK, Spark, Kinesis Data Analytics, and Airflow across both event-driven and analytical workloads.

Storage and Infrastructure

AWS-centric delivery with DynamoDB, S3, EMR, Glue, Athena, Redshift, Kubernetes, and Docker for production-grade systems.

Specialties

Exactly-once processing, checkpointing, schema evolution, serialization, observability, and performance-plus-cost optimization.

Architecture Focus

Designing pipelines that stay understandable as they scale in complexity and traffic.
Keeping throughput, resilience, and cost efficiency aligned instead of trading one against another blindly.
Making operational behavior visible through stronger monitoring, lineage, and debugging hooks.

Delivery Strengths

Reproducible job configuration, disciplined deployment patterns, and production-minded defaults.
Hands-on tuning of storage layout, state handling, JVM/runtime behavior, and data models.
Clear collaboration with downstream analytics, platform, and product teams.

Where I Add Leverage

Greenfield streaming architecture and modernization of high-volume legacy pipelines.
Platform hardening for reliability, observability, and easier incident response.
Performance optimization efforts that translate directly into lower cloud spend.

Projects

Selected engineering work that reflects the systems I enjoy building most.

These projects mirror my interest in stream processing internals, data platform design, and practical developer-facing tooling for real-time systems.

Selected Projects

DataWizz

Local-first lakehouse and analytics workspace inspired by Databricks, Snowflake, Airflow, and Superset, with file ingestion, SQL exploration, Delta publishing, orchestration, and dashboards.

Lakehouse Platform Delta Lake BI Workspace

View repository

GoXStream

Flink-inspired stream processor in Go with operator graphs, checkpoints, and connectors for Kafka, files, and databases.

Go Streaming Engine Checkpointing

View repository

FlowCore

Rust-powered real-time stream processing engine inspired by Apache Flink, featuring event-time processing, windows, watermarks, late-event handling, checkpointing, and a live dashboard.

Rust Event Time Real-Time Dashboard

View repository

Astra Sentinel

Rust desktop malware triage application for fast local file inspection with hash matching, optional YARA scanning, recursive directory analysis, and JSON report export.

Rust Security Tooling YARA

View repository

Education

Jawaharlal Nehru Technological University, Hyderabad

B.Tech in Electrical Engineering

GPA 4.0/4.0 (2014 - 2018)

Professional Summary

Lead Data Engineer with hands-on depth in streaming systems, AWS-native pipelines, distributed runtime tuning, and production-grade observability.

Real-time streaming Low-latency systems AWS data platforms Cost optimization

Contact

Designing data platforms that stay fast, reliable, and cost-efficient at production scale.

Reliability designed into the pipeline, not added later.

Fast paths for event-driven systems that need predictable runtime behavior.

Performance work tied directly to infrastructure efficiency.

Lead data engineering grounded in production behavior, scale, and operational confidence.

Experience building real-time and analytics platforms across product and enterprise domains.

Lead Data Engineer - Enterprise Tech

Lead Big Data Engineer, Streaming AI

Senior Big Data Engineer, Streaming AI

Senior Data Engineer

Member of Technical Staff 3

Software Engineer

Associate Software Engineer

Built around streaming systems, AWS data platforms, and operational reliability.

Languages

Streaming and Batch

Storage and Infrastructure

Specialties

Selected engineering work that reflects the systems I enjoy building most.

DataWizz

GoXStream

FlowCore

Astra Sentinel

Jawaharlal Nehru Technological University, Hyderabad

Available for Lead data engineering roles focused on streaming systems, Platform reliability, and AWS-scale infrastructure.