Real-Time Fraud Detection ML Pipeline with SageMaker & Databricks

Summary

A fintech startup in Singapore, needed to move beyond legacy fraud detection based on hourly batch jobs. I worked remotely with their ML team to build a real-time machine learning pipeline that detects fraudulent mobile wallet transactions under 300ms using streaming, serverless, and cloud-native tools.

Business Challenge

As scaled across Southeast Asia, their fraud risk increased—especially during promotions. Their existing fraud detection system, based on hourly batch processing, had significant limitations:

The goal was to replace batch detection with an ML-powered scoring system capable of handling real-time transaction streams with high precision and low latency.

Solution and Architecture

We designed a real-time fraud detection pipeline using AWS and Databricks components. Transactions streamed via Kafka were enriched with user behavior and blacklist checks, then scored via a SageMaker ML endpoint—all within milliseconds.

Fraud Detection ML Pipeline Architecture Diagram

Steps Taken

Impact and Results

This solution combined low-latency architecture with advanced ML modeling to enable PaySure to respond to fraud threats in real time—improving both security and customer trust.