Data science pipelines

Last updated on 1 Jun 2023 BSc, MSc, PhD

A typical data science workflow involves a complex sequence of processes to transform raw data into actionable insights. These involve various tasks including data cleaning, exploratory data analysis, missing data imputation, feature extraction, model selection and training. Our research focusses on the challenges and opportunities presented by automated and AI-assisted machine learning pipelines.

Possible topics

Adversarial data analysis
AI-assisted Bayesian data analysis workflows
An automated visualization grammar for exploratory data analysis
Automated survey analysis pipelines
Data preparation multiverse analysis
Discovery and synthesis of data science pipelines via agent self-experimentation
Forensic data archaeology for scientific replicability
Preprocessing strategies for spatio-temporal event data

Data science pipelines

Possible topics

Further reading

Sergey Redyuk

Senior Researcher

David Antony Selby

Senior Researcher