Tharushan Uthayakumar

Data Science Student at Université Paris-Saclay, Institut Polytechnique de Paris & HEC Paris

About Me

I am a Data Science student in the first cohort of the prestigious CPES (Cycle Pluridisciplinaire d'Études Supérieures) program, a joint initiative between Université Paris-Saclay, Institut Polytechnique de Paris, and HEC Paris. This three-year program emphasizes cultural openness, collaborative project-based learning, innovation, and an introduction to research, with applications in health and society.

With a strong foundation in mathematics and computer science, I bridge technical expertise in data modeling with strategic thinking and societal impact. My diverse experiences, from statistical analysis at the French Ministry of Higher Education to building AI solutions for startups, reflect my commitment to leveraging data for meaningful innovation.

  • Ranked 4th out of 450 at École 42 Piscine
  • Ranked 14th out of 634 in QRT Data Challenge
  • Best Scientific Approach Award at Hi! PARIS Data Boot Camp

Education

Sept 2023 – July 2026 (expected)

Université Paris-Saclay × IP Paris × HEC Paris

CPES Data Science, Society and Health

Prestigious three-year interdisciplinary program emphasizing data science applications in health and society, with increasing specialization and research introduction.

  • Data Science: AI, Machine Learning, Data Visualization (Python), Computational Sociology
  • Mathematics: Statistics, Markov Chains, Measure & Integration, Probability Theory, ODEs
  • Computer Science: Algorithms, Language Interpretation, R Programming, Bioinformatics
  • Business: Entrepreneurial Methods, Labor Law, Data Modeling with Excel, Capstone Project
Nov 2024 – Dec 2025

École 42 Paris

Digital Technology Architect | Level: Minishell Completed

Followed in parallel with the second year of CPES.
Peer-learning based computer science program focusing on C programming, Shell scripting, algorithms, and system architecture. Completed intensive Piscine bootcamp (4 weeks, full-time) ranked 4th out of 450 candidates.

Sept 2020 – July 2023

Lycée Rocroy Saint-Vincent de Paul (Paris)

French Baccalaureate with Highest Honours

Specializations: Mathematics and Computer Science (NSI)
Options: Advanced Mathematics, Italian, German, Ancient Greek

Professional Experience

Data Scientist - HEC Capstone Project

MonCab (HEC-incubated startup) | Station F / HEC Paris

September 2025 – April 2026

  • Capstone project with 4 CPES classmates (12h/week) for MonCab's medical office platform
  • Built a Python pipeline to extract, clean, and consolidate national healthcare datasets
  • Delivered an interactive geomarketing map (ZIP/ZAC/FRR, density, amenities) for B2B and B2C decisions
  • Prepared normalized data for the RAG assistant and supported UX/SEO-driven acquisition improvements

Statistical Studies Officer (Intern)

French Ministry of Higher Education and Research | Paris

June – August 2025

  • Conducted data analysis for 2024 scientific employment survey integrated with national R&D survey
  • Developed R script to automate statistical production chain: control modules, pipeline redesign
  • Analyzed Junior Professor Chairs (CPJ) for 2021-2025 campaigns using Excel
  • Drafted internal report and policy brief highlighting key findings

Fellow

baby vc

March 2026 – Present

  • Selected for the Spring 2026 France cohort of the baby vc bootcamp, an intensive 10-week venture capital training program
  • Participating in weekly masterclasses led by investors from top VC funds to master the startup investment lifecycle
  • Developing practical skills in startup sourcing, screening, due diligence, and deal closing
  • Engaging with an exclusive community of over 1,000 fellows, operators, and entrepreneurs to deepen knowledge of the European tech ecosystem

Key Projects & Achievements

QRT Data Challenge 2025 - Ranked 14th/634

ENS Paris, Institut Louis Bachelier, Collège de France & Institut Gustave Roussy

Developed predictive models for overall survival in myeloid leukemia patients using clinical and genomic data. Applied survival analysis techniques, feature engineering, and rigorous model validation on real-world biomedical data.

Python Survival Analysis ML
View on GitHub

Hi!ckathon #6 - AI & Education

Hi! PARIS × Corporate Partners (L'Oréal, Capgemini, TotalEnergies, VINCI and Schneider Electric)

3-day intensive sprint using PISA dataset (1.7M students, 300+ variables). Developed XGBoost regression pipeline (RMSE 82.17) and designed AI-powered mental health solution connecting students, teachers, and counselors.

XGBoost GridSearchCV Frugal AI
View on GitHub

Turbofan Predictive Maintenance

Hi! PARIS Data Boot Camp | Best Scientific Approach Award

Predicted Remaining Useful Life (RUL) of aircraft engines using NASA C-MAPSS time-series sensor data. Combined data exploration, feature engineering, ML/DL techniques, and explainability methods (SHAP).

Python Deep Learning SHAP
View on GitHub

MonCab - Data Infrastructure for AI Assistant

HEC Capstone Project | Station F

Built a documented Python pipeline to consolidate national healthcare datasets for MonCab. Delivered a geomarketing map and normalized data for the RAG assistant, along with UX/SEO analysis to support B2B acquisition.

Python RAG LLM

MentaLIPPS - Mental Health Initiative

2-Year Solidarity Project | LIPPS High School

Developed sustainable initiative to reduce academic stress using data from national surveys. Organized annual forums facilitating peer discussions and alumni exchanges on orientation and mental well-being.

Design Thinking Data Analysis Social Impact

42 School Projects (Minishell & more)

École 42 Paris

Built shell interpreter, memory management systems, and algorithmic solutions in C. All projects peer-reviewed and tested by 3 students consecutively. Focus on low-level programming and system architecture.

C Shell Algorithms

Skills & Languages

Technical Skills

Python

Data Science & ML

C

System Programming

SQL

Database Management

R

Statistical Analysis

Excel

Data Modeling

Git

Version Control

Shell/Bash

Scripting & Automation

Machine Learning

Deep Learning, Neural Networks, NLP, Computer Vision

Languages

French

Native

English

C2 - IELTS 8.5/9

Tamil

Bilingual

Italian

Limited Professional

German

Limited Professional

Ancient Greek

Basic

Volunteering & Community Impact

Laureate - Autumn 2025 Promotion

Institut de l'Engagement

October 2025 – Present

Selected as laureate for demonstrated commitment to social engagement and community impact.

Student Advisor & Community Moderator

Article 1 & ASTCommunity

September 2022 – Present (3+ years)

  • Student Advisor: Mentoring students on academic orientation and higher education opportunities through Article 1 association
  • ASTCommunity Moderator: Managing Discord server with 12,000+ members helping students join French Grandes Ecoles via AST (Admissions Sur Titre) pathway
  • Arway Startup Mentor: Answering high school and university students' questions about academic paths and career guidance
  • Mission: Making educational information more accessible and reducing information asymmetry for students from all backgrounds

Founding General Secretary

BDE CPES Paris-Saclay (Student Union)

July 2024 – January 2026

  • Created the association: drafted statutes and submitted them to the prefecture for official recognition
  • Coordinated student events to strengthen campus cohesion and engagement
  • Managed the administrative and organizational aspects of the student office, including planning activities and budget management

Get In Touch

Send a Message