PhD Position in Data Engineering: The DEEM Lab at Technische Universität Berlin is seeking a PhD student to work on responsible data engineering. This research will focus on enhancing data preparation and data pipelines for complex machine learning systems, addressing challenges related to correctness, reliability, and compliance with regulatory frameworks.
PhD Position in Responsible Data Engineering
Designation
PhD Position in Responsible Data Engineering
Detail | Information |
---|---|
Research Area | Responsible Data Engineering with emphasis on data-centric machine learning |
Location | Technische Universität Berlin, Germany |
Eligibility/Qualification | – Master’s degree in Computer Science, Artificial Intelligence, or equivalent – Strong programming skills in Python and additional languages (Java/Rust/C++) – Knowledge of data processing with dataflow systems, relational databases, and/or dataframe libraries (e.g., Apache Spark, DuckDB, pandas) – Basic understanding of machine learning libraries (e.g., pandas, sklearn, pytorch, SparkML) |
Desirable Qualifications | – Experience with real-world data processing systems or ML deployments – Contributions to open-source projects |
Job Description
The successful candidate will join a cross-organisational research group focused on responsible data management and the democratization of data science technologies. Research will encompass:
- Designing and implementing data-centric methods to control personal data usage in ML systems, in line with regulations such as GDPR and the European AI Act.
- Developing novel declarative methods for managing datasets in ML applications.
- Assisting non-expert users with data-centric tasks related to evaluating the robustness of ML pipelines.
- Contributing to open-source libraries based on research findings.
How to Apply
Interested candidates are invited to send their application including a CV, cover letter, and academic transcripts via email to Prof. Dr. Sebastian Schelter at schelter[at]tu-berlin[dot]de, quoting the reference number IV-22/25.
Last Date to Apply
February 14, 2025
Join us at the DEEM Lab to contribute to cutting-edge research and make a significant impact in responsible data engineering!