William Kau.
AI & Data Engineer.

M2 engineering student at ESILV, building AI applications around LLMs and synthetic data. Currently a Fullstack / AI Solution intern at Aubay Solutec, with prior experience at BPCE SI and Manaos.

Based in
Paris, FR
Studying
ESILV — M2
Now
Aubay Solutec
Focus
LLMs · Data
Python
SQL
C#
Java
FastAPI
Angular
React
Next.js
PyTorch
TensorFlow
Hugging Face
LangChain
SDV
SynthCity
PostgreSQL
Oracle SQL
MinIO
GCP
Docker
Git
Power BI
Streamlit
Python
SQL
C#
Java
FastAPI
Angular
React
Next.js
PyTorch
TensorFlow
Hugging Face
LangChain
SDV
SynthCity
PostgreSQL
Oracle SQL
MinIO
GCP
Docker
Git
Power BI
Streamlit
About

Engineer driven by data and curiosity.

Portrait of William Kau

I'm William, a final-year engineering student in Data Science & AI at ESILV Paris La Défense. Over the past few years I've been moving back and forth between data engineering, machine learning, and fullstack development — chasing the projects where those three intersect.

My focus today is on LLMs and synthetic data: how we can train models on data that doesn't exist yet, generate it responsibly, and make AI systems people can actually trust and inspect.

Outside the screen, I'm a setter on a competitive volleyball team, I run trails, climb, and shoot photographs — interests that, more often than I expected, seep back into the way I think about code.

5+
Years coding
3
Internships
1st
ESILV PI2 prize
975
TOEIC / 990
Experience

Where I've been.

Three internships at the intersection of data engineering, AI and product — each one a step deeper into building things that ship.

  1. Feb 2026 — Present

    Fullstack Developer — AI Solution @ Aubay Solutec

    Paris, France

    Building a fullstack synthetic data generation platform: a micro-services architecture (FastAPI / Angular) that selects and tunes generative algorithms (GAN, VAE, ARGN, DDPM) based on input data. Storage on MinIO and PostgreSQL hosted on Nexus.

    FastAPIAngularPyTorchGANVAEDDPMMinIOPostgreSQL
  2. Apr 2025 — Aug 2025

    Data Analyst IT — Internship @ BPCE SI

    Paris 13e, France

    Optimised data pipelines through performant SQL on Oracle databases via SQL Developer. Delivered BI reports tailored to business needs, and contributed to migrating the data heritage to Google Cloud Platform.

    Oracle SQLPower BIGCP
  3. Sept 2024 — Apr 2025

    Data & AI Developer — School Project @ MANAOS — BNP Paribas

    Paris 8e, France
    1st prize — ESILV PI2 2024-2025

    Built an ESG data management application in Python / Streamlit (team of 6) with an integrated open-source LLM (Hugging Face) to query the data in natural language.

    PythonStreamlitHugging FaceLLMESG
Selected work

Projects I'm proud of.

A mix of school, internship and personal work — usually somewhere between AI research and shipping software.

2025

Photographic Composition Analysis

Computer vision for visual aesthetics

A vision model that scores photographs on composition rules — rule of thirds, leading lines, depth of field — with explainability via Grad-CAM. Trained on a personal dataset I annotated from my own photographs.

PythonPyTorchComputer VisionGrad-CAM
2025

Financial & ESG RAG

Retrieval-augmented Q&A with source traceability

A retrieval-augmented system for querying financial and ESG reports in natural language. Semantic retrieval pipeline with source citation and a hallucination-detection layer.

PythonLangChainRAGLLM
2024-25

MANAOS — ESG Data Platform

1st prize, ESILV PI2 2024-2025

Built with a team of 6 inside BNP Paribas' MANAOS subsidiary: an ESG data management app in Python / Streamlit with an integrated open-source LLM (Hugging Face) for natural-language queries over the data.

PythonStreamlitHugging FaceLLM
2026

Synthetic Data Generation Platform

Generative models, productionised

Ongoing at Aubay Solutec: a fullstack micro-services platform that picks and tunes generative algorithms (GAN, VAE, ARGN, DDPM) based on the input dataset. FastAPI + Angular, MinIO and PostgreSQL on Nexus.

FastAPIAngularGANVAEDDPMPostgreSQL
Off-screen

What keeps me sharp.

The hours I spend away from a keyboard — and why they end up shaping my engineering work more than I expected.

Volleyball

Setter for a competitive amateur team in Lognes. The position taught me a lot about anticipating, reading patterns, and making quick calls under pressure.

Running & Trail

Long runs and trail outings — the rhythm I rely on to think through hard problems, away from any screen.

Climbing

On the wall I get to optimise something different: balance, route reading, body tension. Debugging a route is a lot like debugging code.

Photography

I shoot mostly in available light. Photography is what got me into computer vision — and the source of the dataset for my composition-analysis project.

View gallery