Me

My name is Ernesto Lora Gonzalez, bachelor's degree in Physics. Data Engineer with 2+ years of experience developing high-performance data systems in Snowflake, Databricks, and AWS. Experienced in building data pipelines to clean data efficiently, modernizing legacy data pipelines, and contributing to the development of fraud detection systems. My work also involves ensuring data governance through testing and version control, and enabling efficient data-driven decisions.

inventory Tracking

ML Fraud Detection Pipeline with SageMaker & Databricks

I worked with a Singapore-based fintech startup that needed to move away from batch fraud detection and toward a real-time ML system.

inventory Tracking

Real-Time Inventory Tracking for E-Commerce Using Kafka & Databricks

E-commerce company that was having a major issue with overselling.

Real-Time Personalization Engine Using Kafka, AWS Lambda & DynamoDB

I partnered with an e-commerce company in the US that wanted to upgrade their product recommendation engine from batch to real-time.

data lake

Small Business Data Lake for Customer Insights

A small grocery chain in Lima, PerĂº, struggled to understand the impact of their sales promotions. Their sales, loyalty, and invoice data were siloed across multiple systems, making it nearly impossible to track customer behavior or campaign effectiveness.

traffic disruptions

Logistics Twitter Hashtag Crisis Alerts

Logistics company was losing revenue due to delivery delays caused by spontaneous protests and blockades.

Fintech Weather Impact on Loan Delays

A fintech lender suspects weather affects loan repayments in rural areas. They need to correlate weather data with repayment delays (but lack centralized data).

E-Commerce CSV-to-Parquet Optimization

Supplier inventory CSVs were causing slow analytics and schema inconsistencies.

Spam Detection

Image Classification Cifar 10

We have implemented a CNN with a Cutout regularization technique to the Cifar 10 image dataset.

Spam Detection

Spam detection

A dataset of spam messages and legitimate messages.

Students

Predict Students Drop Out

A dataset from a higher education institution that includes students' social, demographic, and academic background information. Each student is classified as either a graduate, currently enrolled, or a dropout.

Air Quality

Air Quality Calibration

A dataset that includes the registration of various pollutants from a sensor and data from a certified analyzer.

Credit Card

Predict default of credit card clients

A dataset that includes the credit history and additional information of customers, classified based on whether they defaulted or paid on time.