William Kau.
AI & Data Engineer.

M2 engineering student at ESILV, building AI applications around LLMs and synthetic data. Currently a Fullstack / AI Solution intern at Aubay Solutec, with prior experience at BPCE SI and Manaos.

Based in: Paris, FR
Studying: ESILV — M2
Now: Aubay Solutec
Focus: LLMs · Data

See my work Contact me

scroll

◆Python

◆SQL

◆C#

◆Java

◆FastAPI

◆Angular

◆React

◆Next.js

◆PyTorch

◆TensorFlow

◆Hugging Face

◆LangChain

◆SDV

◆SynthCity

◆PostgreSQL

◆Oracle SQL

◆MinIO

◆GCP

◆Docker

◆Git

◆Power BI

◆Streamlit

◆Python

◆SQL

◆C#

◆Java

◆FastAPI

◆Angular

◆React

◆Next.js

◆PyTorch

◆TensorFlow

◆Hugging Face

◆LangChain

◆SDV

◆SynthCity

◆PostgreSQL

◆Oracle SQL

◆MinIO

◆GCP

◆Docker

◆Git

◆Power BI

◆Streamlit

About

Engineer driven by data and curiosity.

I'm William, a final-year engineering student in Data Science & AI at ESILV Paris La Défense. Over the past few years I've been moving back and forth between data engineering, machine learning, and fullstack development — chasing the projects where those three intersect.

My focus today is on LLMs and synthetic data: how we can train models on data that doesn't exist yet, generate it responsibly, and make AI systems people can actually trust and inspect.

Outside the screen, I'm a setter on a competitive volleyball team, I run trails, climb, and shoot photographs — interests that, more often than I expected, seep back into the way I think about code.

Years coding

Internships

1st

ESILV PI2 prize

975

TOEIC / 990

Experience

Where I've been.

Three internships at the intersection of data engineering, AI and product — each one a step deeper into building things that ship.

Feb 2026 — Present
Fullstack Developer — AI Solution @ Aubay Solutec
Paris, France
Building a fullstack synthetic data generation platform: a micro-services architecture (FastAPI / Angular) that selects and tunes generative algorithms (GAN, VAE, ARGN, DDPM) based on input data. Storage on MinIO and PostgreSQL hosted on Nexus.
FastAPIAngularPyTorchGANVAEDDPMMinIOPostgreSQL
Apr 2025 — Aug 2025
Data Analyst IT — Internship @ BPCE SI
Paris 13e, France
Optimised data pipelines through performant SQL on Oracle databases via SQL Developer. Delivered BI reports tailored to business needs, and contributed to migrating the data heritage to Google Cloud Platform.
Oracle SQLPower BIGCP
Sept 2024 — Apr 2025
Data & AI Developer — School Project @ MANAOS — BNP Paribas
Paris 8e, France
★ 1st prize — ESILV PI2 2024-2025
Built an ESG data management application in Python / Streamlit (team of 6) with an integrated open-source LLM (Hugging Face) to query the data in natural language.
PythonStreamlitHugging FaceLLMESG

Selected work

Projects I'm proud of.

A mix of school, internship and personal work — usually somewhere between AI research and shipping software.

2025

Photographic Composition Analysis

Computer vision for visual aesthetics

A vision model that scores photographs on composition rules — rule of thirds, leading lines, depth of field — with explainability via Grad-CAM. Trained on a personal dataset I annotated from my own photographs.

PythonPyTorchComputer VisionGrad-CAM

2025

Financial & ESG RAG

Retrieval-augmented Q&A with source traceability

A retrieval-augmented system for querying financial and ESG reports in natural language. Semantic retrieval pipeline with source citation and a hallucination-detection layer.

PythonLangChainRAGLLM

2024-25

MANAOS — ESG Data Platform

1st prize, ESILV PI2 2024-2025

Built with a team of 6 inside BNP Paribas' MANAOS subsidiary: an ESG data management app in Python / Streamlit with an integrated open-source LLM (Hugging Face) for natural-language queries over the data.

PythonStreamlitHugging FaceLLM

2026

Synthetic Data Generation Platform

Generative models, productionised

Ongoing at Aubay Solutec: a fullstack micro-services platform that picks and tunes generative algorithms (GAN, VAE, ARGN, DDPM) based on the input dataset. FastAPI + Angular, MinIO and PostgreSQL on Nexus.

FastAPIAngularGANVAEDDPMPostgreSQL

Off-screen

What keeps me sharp.

The hours I spend away from a keyboard — and why they end up shaping my engineering work more than I expected.

Volleyball

Setter for a competitive amateur team in Lognes. The position taught me a lot about anticipating, reading patterns, and making quick calls under pressure.

Running & Trail

Long runs and trail outings — the rhythm I rely on to think through hard problems, away from any screen.

Climbing

On the wall I get to optimise something different: balance, route reading, body tension. Debugging a route is a lot like debugging code.

Photography

I shoot mostly in available light. Photography is what got me into computer vision — and the source of the dataset for my composition-analysis project.

View gallery →

William Kau.AI & Data Engineer.