Description

Data Engineer (m/f/div) – CTC

About us

As a data engineer at the Center for the Trans­formation of Chemistry (CTC) you will play a pivotal role in designing, implemen­ting, and main­taining the center’s data infra­structure to sup­port FAIR data manage­ment and enable scalable and reliable (big) data pro­cessing, storage, and stream­lined analytics. In this context you will contri­bute to the develop­ment of state-of-the-art research data manage­ment strategies that, by leveraging the data lake and data ware­house concepts, foster data inte­gration and seamless analysis workflows.

You will be joining the CTC in an exciting phase of its develop­ment, where you will have not only the unique chance to contri­bute to shape the vision, strategy, and proces­ses by which the center acquires, stores, curates, and manages hetero­geneous research data, but also ample oppor­tunities for skills improve­ment and professional growth.

You will be part of the ambitious Data Management Team of the CTC, which strives to be an exemplary in data manage­ment in the context of the chemistry research com­munity. The team is deeply in­volved in all the CTC’s activities connected with the know­ledge generation process and aims at suppor­ting the scientific commu­nity of the center with state-of-the-art data handling solutions.

The CTC scientific team is located approximately 30 minutes from Leipzig and Halle (Saale) in Leuna.

Your key responsibilities

  • contribute to the design and development of the CTC’s data architecture strategy
  • build and optimize state-of-the-art, cost effective, scalable, and high-perfor­mance data storage and processing systems that integrate clouds and on-premise plat­forms and leverage big data techno­logies
  • contribute to design, develop, and maintain data pipelines and ETL pro­cesses to ingest, transform, and load heterogeneous research data from various sources, in the con­text of a data ware­house / data lake infrastructure
  • define data standards and best practices and implement data governance and best practices policies to support data FAIRness and ensure compliance with regulatory requirements (e.g., GDPR) across the whole CTC
  • interact with the CTC’s scientists and with the IT group to define the users’ requirements and develop tailored solutions according to the peculiar features of the generated data
  • contribute proactively to the continuous optimi­zation of data pro­cesses through the enhance­ment of the data quality and the improvement of the efficiency of the data workflows
  • keep the pace with the emerging techno­logies and best practices in data architecture and data engineering and contribute proactively to the conti­nuous improvement of our data architecture and infrastructure

Qualifications

  • bachelor’s or master’s degree in computer science or in a data science related field
  • PhD or equivalent working experience (minimum three years) in computer science or in a data science related field
  • proven experience as a data engineer or a similar role, with a strong back­ground in designing and implementing data archi­tecture solutions, data pipelines, and ETL processes: pre­vious experience in the scientific research data domain would be a strong plus
  • familiarity with data modeling techniques, data ware­housing concepts (e.g., dimensio­nal modeling), and data governance frameworks
  • in-depth knowledge of database technologies, data modeling techniques, and ETL processes, inclu­ding either relational databases (e.g., SQL Server, Oracle) or NoSQL data­bases (e.g., PostgreSQL, MySQL, MongoDB)
  • proficiency in programming languages such as Python, Java, Scala, or C++
  • hands-on experience with cloud platforms and services, big data technologies, such as, e.g., Hadoop and Spark and containerization technologies, such as, e.g., Singularity, Docker, Kubernetes
  • familiarity with data governance frameworks, data security standards (e.g., GDPR, HIPAA), and data handling best practices
  • strong analytical and problem-solving skills, with a passion for leveraging efficient data management practices to support and drive scientific insights and innovation
  • excellent communication and collaboration skills, with the ability to interact effectively with technical and non-technical users

  • the opportunity to shape the data management structures and policies of what will be the largest research center for chemical research in Europe
  • meet and work with top experts from academia and industry
  • competitive compensation according to TVöD Bund E 13 or the candidate’s qualifications
  • partial compensation of job commute ticket
  • comprehensive social benefits including 30 days of vacation, annual special payment, and additional pension scheme (VBL)
  • term-limited position until December 31, 2025, allowing for fresh perspectives in terms of contract extension and promotion opportunities after 2025

The Max Planck Society and the CTC strive for gender equality and diversity. We welcome applications from all backgrounds. We are committed to increasing the number of individuals with disabilities in its workforce and therefore encourages applications from such qualified individuals. Furthermore, the Max Planck Society and the CTC seek to increase the number of women in those areas where they are underrepresented and therefore explicitly encourages women to apply.

The position is available starting immediately after the offer is awarded.