Data Engineering · Cloud Architecture
Portrait of Syed Moiz

I build the data platforms behind modern clouds

I'm Syed Moiz — a Data Engineer & Cloud Architect designing reliable, scalable pipelines and lakehouses across Azure, AWS and GCP.

40+
Pipelines shipped
3
Cloud platforms
8
Certifications
3+
Years experience
Selected Work

Data platforms built to scale

A selection of production data engineering and cloud architecture projects spanning streaming, lakehouse, and warehouse workloads.

Realtime Fraud Detection Platform architecture
2026

Realtime Fraud Detection Platform

Sub-second fraud scoring across 2M events/day using Flink and a feature store.

FlinkKafkaFeastBigQuery
180ms p99 latency
Architecture Gallery

Diagrams from the field

Reference architectures distilled from real deployments. Click any diagram to view it in detail.

Skills Network

A connected toolset

The technologies I reach for, grouped by where they live in the data lifecycle. Hover a node to trace its connections.

IngestionTransformationStorageOrchestrationServing
Apache SparkdbtKafkaAirflowPythonAzure SQLDelta LakeSnowflakeBigQueryRedshiftAmazon S3Amazon EMRTerraformDockerKubernetesFlinkPower BILooker
Certification Wall

Credentialed across every cloud

Validated expertise spanning Azure, AWS, GCP and the modern data stack.

2025

Azure Solutions Architect Expert

Microsoft Azure

AZ-305
2024

Azure Data Engineer Associate

Microsoft Azure

DP-203
2024

AWS Data Engineer Associate

Amazon Web Services

DEA-C01
2023

AWS Solutions Architect Associate

Amazon Web Services

SAA-C03
2024

Professional Data Engineer

Google Cloud

GCP-PDE
2023

Professional Cloud Architect

Google Cloud

GCP-PCA
2024

Databricks Data Engineer Pro

Microsoft Azure

DB-DEP
2023

SnowPro Core

Amazon Web Services

SNOW-CORE
Writing

Notes on data engineering

Field notes on streaming, lakehouse architecture, and running data platforms across Azure, AWS, and GCP.

StreamingKafkaSpark

Designing Idempotent Streaming Pipelines

Exactly-once is a lie you tell your stakeholders. Here is how to build effectively-once pipelines with checkpoints, watermarks, and deterministic keys.

Sep 20258 min
LakehouseDatabricksCost

Lakehouse vs. Warehouse: A Cost Model

A practical framework for choosing between Delta Lakehouse and a cloud warehouse based on query patterns, concurrency, and storage economics.

Jul 202511 min
dbtMulti-cloudSQL

Running dbt at Scale Across Three Clouds

How we standardized transformation logic across Synapse, Redshift, and BigQuery with a single dbt project and provider-aware macros.

Apr 20259 min