Description
Data Engineer (m/f/div) – CTC
About us
As a data engineer at the Center for the Transformation of Chemistry (CTC) you will play a pivotal role in designing, implementing, and maintaining the center’s data infrastructure to support FAIR data management and enable scalable and reliable (big) data processing, storage, and streamlined analytics. In this context you will contribute to the development of state-of-the-art research data management strategies that, by leveraging the data lake and data warehouse concepts, foster data integration and seamless analysis workflows.
You will be joining the CTC in an exciting phase of its development, where you will have not only the unique chance to contribute to shape the vision, strategy, and processes by which the center acquires, stores, curates, and manages heterogeneous research data, but also ample opportunities for skills improvement and professional growth.
You will be part of the ambitious Data Management Team of the CTC, which strives to be an exemplary in data management in the context of the chemistry research community. The team is deeply involved in all the CTC’s activities connected with the knowledge generation process and aims at supporting the scientific community of the center with state-of-the-art data handling solutions.
The CTC scientific team is located approximately 30 minutes from Leipzig and Halle (Saale) in Leuna.
Your key responsibilities
- contribute to the design and development of the CTC’s data architecture strategy
- build and optimize state-of-the-art, cost effective, scalable, and high-performance data storage and processing systems that integrate clouds and on-premise platforms and leverage big data technologies
- contribute to design, develop, and maintain data pipelines and ETL processes to ingest, transform, and load heterogeneous research data from various sources, in the context of a data warehouse / data lake infrastructure
- define data standards and best practices and implement data governance and best practices policies to support data FAIRness and ensure compliance with regulatory requirements (e.g., GDPR) across the whole CTC
- interact with the CTC’s scientists and with the IT group to define the users’ requirements and develop tailored solutions according to the peculiar features of the generated data
- contribute proactively to the continuous optimization of data processes through the enhancement of the data quality and the improvement of the efficiency of the data workflows
- keep the pace with the emerging technologies and best practices in data architecture and data engineering and contribute proactively to the continuous improvement of our data architecture and infrastructure
Qualifications
- bachelor’s or master’s degree in computer science or in a data science related field
- PhD or equivalent working experience (minimum three years) in computer science or in a data science related field
- proven experience as a data engineer or a similar role, with a strong background in designing and implementing data architecture solutions, data pipelines, and ETL processes: previous experience in the scientific research data domain would be a strong plus
- familiarity with data modeling techniques, data warehousing concepts (e.g., dimensional modeling), and data governance frameworks
- in-depth knowledge of database technologies, data modeling techniques, and ETL processes, including either relational databases (e.g., SQL Server, Oracle) or NoSQL databases (e.g., PostgreSQL, MySQL, MongoDB)
- proficiency in programming languages such as Python, Java, Scala, or C++
- hands-on experience with cloud platforms and services, big data technologies, such as, e.g., Hadoop and Spark and containerization technologies, such as, e.g., Singularity, Docker, Kubernetes
- familiarity with data governance frameworks, data security standards (e.g., GDPR, HIPAA), and data handling best practices
- strong analytical and problem-solving skills, with a passion for leveraging efficient data management practices to support and drive scientific insights and innovation
- excellent communication and collaboration skills, with the ability to interact effectively with technical and non-technical users
- the opportunity to shape the data management structures and policies of what will be the largest research center for chemical research in Europe
- meet and work with top experts from academia and industry
- competitive compensation according to TVöD Bund E 13 or the candidate’s qualifications
- partial compensation of job commute ticket
- comprehensive social benefits including 30 days of vacation, annual special payment, and additional pension scheme (VBL)
- term-limited position until December 31, 2025, allowing for fresh perspectives in terms of contract extension and promotion opportunities after 2025
The Max Planck Society and the CTC strive for gender equality and diversity. We welcome applications from all backgrounds. We are committed to increasing the number of individuals with disabilities in its workforce and therefore encourages applications from such qualified individuals. Furthermore, the Max Planck Society and the CTC seek to increase the number of women in those areas where they are underrepresented and therefore explicitly encourages women to apply.
The position is available starting immediately after the offer is awarded.