About

PipelinePulse is a hands-on data engineering blog written by a working practitioner, not a marketing team.

I build and maintain production data pipelines every day — SQL-based ETL, scheduled workflows, Delta tables, data quality checks, and everything in between. This blog exists because most data engineering content online is either too theoretical or written by people who’ve never debugged a failing MERGE at 2am.

What you’ll find here:

• Practical tutorials on SQL optimization, ETL architecture, and pipeline scheduling

• Real-world troubleshooting guides (the kind of problems that actually break production)

• Data quality strategies and testing patterns

• How AI tools are changing data engineering workflows

• Honest tool reviews and comparisons

What makes this different: Every article comes from real production experience. When I write about fixing duplicate records in Delta tables, it’s because I’ve actually debugged an incomplete MERGE key that caused deduplication issues in a live workspace. When I cover null value handling, it’s from patching real booking data pipelines.

I’m a data engineer based in Malaysia with almost 10 years of experience building pipelines at scale using Databricks/SAS/Informatica/MSSQL, SQL, Python, and modern data stack tools.

Get in touch: No DMs about “quick consulting calls” — everything I know is in the articles.