Data Engineering | Machine Learning | Big Data Architectures
Aktualisiert am 04.07.2024
Profil
Freiberufler / Selbstständiger
Remote-Arbeit
Verfügbar ab: 01.08.2024
Verfügbar zu: 75%
davon vor Ort: 10%
Apache Spark
Apache Kafka
Machine Learn
Data Architecture
Data Engineering
Data Science
AWS
Docker
Kubernetes
Java
Scala
Python
neo4j
TigerGraph
Airflow
Apache Iceberg
Databricks
Hadoop
Hive
HBase
SQL
Machine Learning
Deutsch
Muttersprache
English
Verhandlungssicher
Spanisch
Grundkenntnisse

Einsatzorte

Einsatzorte

München (+500km)
Deutschland, Schweiz, Österreich
möglich

Projekte

Projekte

4 Monate
2023-09 - 2023-12

Implement LLM?Based Tax?Automation Tool

AI Engineer Python PyTorch Pandas ...
AI Engineer
  • Implement a command line?based tax automation and pdf processing tool

  • Fine?tune Llama2 LLM with custom generated training data to enable custom information retrieval from pdf documents

  • Utilize student?teacher approach to generate training data with ChatGPT results

Python PyTorch Pandas Google Colab Llama ChatGPT langchain
Self-Owned
München
1 Jahr 1 Monat
2022-09 - 2023-09

Re?Design of a Large?Scale Streaming Application

Freelance Data Engineering Consultant Spark (SQL Structured Streaming) Kafka (Avro) ...
Freelance Data Engineering Consultant
  • Conceptualize functional & non?functional requirements, consult business requirements and technical specification teams, technical review of specifications

  • Develop technical design for real?time streaming application processing multi?billion messages per day

  • Implement multiple Spark Structured Streaming applications, including custom outer?join operations, Hbase access and complex data structures

  • Conceptualize and implement data migration routines from classical DWH to event stream?based representation

Spark (SQL Structured Streaming) Kafka (Avro) Hbase Oracle DB Informatica HDFS Zeppelin YARN Apache Hive
Deutsche Börse
München
8 Monate
2021-12 - 2022-07

Advanced Analytics in a Large?Scale Knowledge Graph

Freelance Graph Data Science Engineer Neo4j Python AWS ...
Freelance Graph Data Science Engineer
  • Help client with stability and performance challenges with a large?scale single node Neo4j database

  • Implement a concurrent ETL process to load a timely?sorted two?dimensional grid of linked nodes in Neo4j

  • Setup a monitoring system for Neo4j

  • Implement an automatic backup process to backup the Neo4j database nightly and store archives

Neo4j Python AWS Grafana Graphite PostgreSQL HDFS Kafka
DB Systel
München
1 Jahr 8 Monate
2019-09 - 2021-04

Advanced Graph Analytics on a Multi?Billion Node Knowledge Graph

Senior Graph Data Science Engineer TigerGraph Kafka AWS ...
Senior Graph Data Science Engineer
  • Design, implementation & operation of a large?scale, stream?fed, multi?billion node knowledge graph (TigerGraph)

  • Requirement analysis, design, implementation and operation of a graph?based real?time recommendation engine for news articles

  • Schema modeling & validation of a knowledge graph of multiple data streams

TigerGraph Kafka AWS Docker Kubernetes Java Scala Go Grafana Prometheus Terraform Helm
Ippen Digital GmbH
5 Monate
2019-03 - 2019-07

Implementation of Real?Time Analytics Applications

Data Engineering Consultant Spark Kafka Kafka Streams ...
Data Engineering Consultant
  • Design and implement real?time streaming applications

  • Give internal workshop about Apache Kafka

Spark Kafka Kafka Streams Scala Docker Kubernetes AWS Akka Streams
Telefonica
München
5 Monate
2018-09 - 2019-01

Design & Implementation of an Advanced Analytics Streaming Application

Data Engineering Consultant Spark Kafka Java ...
Data Engineering Consultant
  • Designed and implemented a proof?of?concept for an event?driven data analytics application

  • Developed an ingestion pipeline to transform and feed relational data into a graph database

Spark Kafka Java Scala Docker Neo4j git
Allianz SE
München
7 Monate
2018-01 - 2018-07

Feature Implementation & Performance Optimizations of Document Mining Application

Big Data Application Engineering Consultant Spark Java Scala ...
Big Data Application Engineering Consultant
  • Optimized a large?scale text analytics pipeline for scalability and performance

  • Implemented knowledge discovery use?cases utilizing state?of?the?art NLP & ML approaches

  • Researched and applied efficient algorithms for analyzing large data sets

  • Educated in?house development team on big data software development and data mining

Spark Java Scala Docker AWS CI Spring Elasticsearch SQL Sonar git Grafana Graphite
Ayfie GmbH
München
4 Monate
2017-06 - 2017-09

Machine Learning for Knowledge Discovery in Food Recipes

Machine Learning Application Engineer Spark Scala AWS ...
Machine Learning Application Engineer
  • Project: Automatic calculation of nutritional values for food recipes

  • Consultation on planning of the project and the solution approach

  • Solution design: POC for a calculation pipeline based on multiple heterogeneous data sources and various machine learning approaches

  • Implementation of data processing pipeline to calculate nutritional values

  • Deploy pipeline on client infrastructure (AWS)

Spark Scala AWS Docker Python git MySQL CouchDB Requirements Software Design
EatSmarter GmbH
München
7 Monate
2016-11 - 2017-05

Master?s Thesis: Knowledge Discovery in unstructured Data

Machine Learning Developer Apache Spark Scala Docker ...
Machine Learning Developer
  • Title: Knowledge Discovery in textual Databases for enhancing the

    automatic Calculation of nutritional Values for online?based Food Recipes

  • The work utilizes various machine learning and NLP approaches to extract information from unstructured text to determine the nutritional content of food recipes

  • Theoretical approaches: Tokenization, Part?of?Speech Tagging, Stemming, Neural Networks, Logistic Regression, Word Embeddings

Apache Spark Scala Docker Python Stanford?NLP Research Presentation
TUM
München
7 Monate
2016-11 - 2017-05

Team Lead of Software Development at TUM Student Group ?Roboy?

Lead Software Developer C++ ROS (Robot Operating System) CMake ...
Lead Software Developer
  • As a team of interdisciplinary TUM students we developed a child?sized humanoid robot and promote the work on various events all over the world

  • As the team leader of the software development group, I was responsible for design, implementation and software engineering processes of the group

C++ ROS (Robot Operating System) CMake Unix Team Lead Presentation Software Desig Software Archi Robotics
TUM
4 Monate
2016-06 - 2016-09

Machine Learning for Automatic Classification of Food Recipes

Machine Learning Application Engineer Spark Scala AWS ...
Machine Learning Application Engineer
  • Project: Provide an indicator for the healthiness of food recipes which is based in expert ratings

  • Consulting in management and design thinking to derive useful innovative use cases from existing data of the client

  • Proof of concept: supervise generation of training data set and analyze data quality

  • Implementation of supervised classification system based on selected features and various approaches

  • Implementation and validation of final solution

Spark Scala AWS SQL git Docker Python Software-Design Requirements
EatSmarter GmbH
München
6 Monate
2014-10 - 2015-03

Research & Development: Unmanned Aerial Vehicle

Embedded Software Developer C++ QT CMake ...
Embedded Software Developer
  • Bachelor?s Thesis: Design and prototypical implementation of a dynamic mission planner for integration into the mission planning software of an unmanned aerial vehicle in C++ 
  • Throughout the thesis I researched multiple approaches of route planning of an unmanned helicopter (UMAT) to explore a predefined area for hazardous gas

  • I implemented the routing algorithm in C++ to be used by the mission planning software developed by ESG

C++ QT CMake Unix Embedded Soft Research Software Engineering Software Architec
ESG
Fürstenfeldbruck
1 Jahr 5 Monate
2013-05 - 2014-09

Software Engineering & Development: Aerosystems Avionics

Embedded Software Developer C CMake Unix ...
Embedded Software Developer
  • I was involved in the software development team developing a time? and safety?critical, distributed middleware in C

  • The middleware was deployed on a highly modular avionics platform developed by ESG

  • The platform was composed of multiple Unix?like modules, which were interchangeable on the fly

C CMake Unix Software Engine Software Arch Embedded Sof
ESG
Fürstenfeldbruck

Aus- und Weiterbildung

Aus- und Weiterbildung

1 Jahr 8 Monate
2015-10 - 2017-05

M. Sc. Computer Science

M. Sc., Technical University Munich (TUM)
M. Sc.
Technical University Munich (TUM)

  • Machine Learning
  • Artificial Intelligence
  • Big Data Analytics
  • Entrepreneurship 

3 Jahre 6 Monate
2011-10 - 2015-03

B. Sc. Computer Science

B. Sc., University of Applied Sciences Munich (FHM)
B. Sc.
University of Applied Sciences Munich (FHM)

  • Mathematics, Statistics
  • Algorithms & Data Structures
  • Software Engineering
  • Software Architecture 
  • Theoretical Computer Science

Position

Position

Freelance Data Engineer & Machine Learning Consultant, Content Creator, Speaker & Trainer

As a passionate freelance big data consultant, my expertise lies in crafting high?quality, and scalable data?driven applications that meet the unique requirements of my clients. I am dedicated to creating production?ready software solutions, leveraging cutting?edge machine learning techniques and robust big data architectures. Collaborating with in?house software development teams, as well as non?technical stakeholders, I thrive on practical problem?solving approaches and finding clear perspectives. Furthermore, I take great pleasure in empowering teams through customized workshops, equipping them with the knowledge and skills to develop scalable big data applications independently.

    Kompetenzen

    Kompetenzen

    Top-Skills

    Apache Spark Apache Kafka Machine Learn Data Architecture Data Engineering Data Science AWS Docker Kubernetes Java Scala Python neo4j TigerGraph Airflow Apache Iceberg Databricks Hadoop Hive HBase SQL Machine Learning

    Produkte / Standards / Erfahrungen / Methoden

    Big Data Architecture
    Experte
    Apache Spark
    Experte
    Apache Kafka
    Experte
    Scala
    Experte
    Python
    Experte
    Java
    Experte
    Software Engineering
    Experte
    Machine learning
    Fortgeschritten
    Neo4j
    Experte
    Amazon Webservice
    Fortgeschritten
    Algorithms
    Fortgeschritten
    Kubernetes
    Fortgeschritten
    Docker
    Fortgeschritten
    Elasticsearch
    Fortgeschritten
    Hadoop
    Fortgeschritten
    Hive
    Fortgeschritten
    NLP
    Fortgeschritten
    SQL
    Experte
    Statistics
    Fortgeschritten
    Airflow
    Fortgeschritten
    Databricks
    Experte
    Software Architecture
    Experte
    Clean Code
    Experte
    PyTorch
    Fortgeschritten
    Requirements Analysis
    Fortgeschritten
    Content Creation
    Experte
    Snowflake
    Fortgeschritten
    MySQL
    Fortgeschritten
    PostgreSQL
    Fortgeschritten


    Branchen

    Branchen

    • Insurance
    • LegalTech
    • FinTech
    • Transportation
    • Online Publishing
    • Telecommunication

    Einsatzorte

    Einsatzorte

    München (+500km)
    Deutschland, Schweiz, Österreich
    möglich

    Projekte

    Projekte

    4 Monate
    2023-09 - 2023-12

    Implement LLM?Based Tax?Automation Tool

    AI Engineer Python PyTorch Pandas ...
    AI Engineer
    • Implement a command line?based tax automation and pdf processing tool

    • Fine?tune Llama2 LLM with custom generated training data to enable custom information retrieval from pdf documents

    • Utilize student?teacher approach to generate training data with ChatGPT results

    Python PyTorch Pandas Google Colab Llama ChatGPT langchain
    Self-Owned
    München
    1 Jahr 1 Monat
    2022-09 - 2023-09

    Re?Design of a Large?Scale Streaming Application

    Freelance Data Engineering Consultant Spark (SQL Structured Streaming) Kafka (Avro) ...
    Freelance Data Engineering Consultant
    • Conceptualize functional & non?functional requirements, consult business requirements and technical specification teams, technical review of specifications

    • Develop technical design for real?time streaming application processing multi?billion messages per day

    • Implement multiple Spark Structured Streaming applications, including custom outer?join operations, Hbase access and complex data structures

    • Conceptualize and implement data migration routines from classical DWH to event stream?based representation

    Spark (SQL Structured Streaming) Kafka (Avro) Hbase Oracle DB Informatica HDFS Zeppelin YARN Apache Hive
    Deutsche Börse
    München
    8 Monate
    2021-12 - 2022-07

    Advanced Analytics in a Large?Scale Knowledge Graph

    Freelance Graph Data Science Engineer Neo4j Python AWS ...
    Freelance Graph Data Science Engineer
    • Help client with stability and performance challenges with a large?scale single node Neo4j database

    • Implement a concurrent ETL process to load a timely?sorted two?dimensional grid of linked nodes in Neo4j

    • Setup a monitoring system for Neo4j

    • Implement an automatic backup process to backup the Neo4j database nightly and store archives

    Neo4j Python AWS Grafana Graphite PostgreSQL HDFS Kafka
    DB Systel
    München
    1 Jahr 8 Monate
    2019-09 - 2021-04

    Advanced Graph Analytics on a Multi?Billion Node Knowledge Graph

    Senior Graph Data Science Engineer TigerGraph Kafka AWS ...
    Senior Graph Data Science Engineer
    • Design, implementation & operation of a large?scale, stream?fed, multi?billion node knowledge graph (TigerGraph)

    • Requirement analysis, design, implementation and operation of a graph?based real?time recommendation engine for news articles

    • Schema modeling & validation of a knowledge graph of multiple data streams

    TigerGraph Kafka AWS Docker Kubernetes Java Scala Go Grafana Prometheus Terraform Helm
    Ippen Digital GmbH
    5 Monate
    2019-03 - 2019-07

    Implementation of Real?Time Analytics Applications

    Data Engineering Consultant Spark Kafka Kafka Streams ...
    Data Engineering Consultant
    • Design and implement real?time streaming applications

    • Give internal workshop about Apache Kafka

    Spark Kafka Kafka Streams Scala Docker Kubernetes AWS Akka Streams
    Telefonica
    München
    5 Monate
    2018-09 - 2019-01

    Design & Implementation of an Advanced Analytics Streaming Application

    Data Engineering Consultant Spark Kafka Java ...
    Data Engineering Consultant
    • Designed and implemented a proof?of?concept for an event?driven data analytics application

    • Developed an ingestion pipeline to transform and feed relational data into a graph database

    Spark Kafka Java Scala Docker Neo4j git
    Allianz SE
    München
    7 Monate
    2018-01 - 2018-07

    Feature Implementation & Performance Optimizations of Document Mining Application

    Big Data Application Engineering Consultant Spark Java Scala ...
    Big Data Application Engineering Consultant
    • Optimized a large?scale text analytics pipeline for scalability and performance

    • Implemented knowledge discovery use?cases utilizing state?of?the?art NLP & ML approaches

    • Researched and applied efficient algorithms for analyzing large data sets

    • Educated in?house development team on big data software development and data mining

    Spark Java Scala Docker AWS CI Spring Elasticsearch SQL Sonar git Grafana Graphite
    Ayfie GmbH
    München
    4 Monate
    2017-06 - 2017-09

    Machine Learning for Knowledge Discovery in Food Recipes

    Machine Learning Application Engineer Spark Scala AWS ...
    Machine Learning Application Engineer
    • Project: Automatic calculation of nutritional values for food recipes

    • Consultation on planning of the project and the solution approach

    • Solution design: POC for a calculation pipeline based on multiple heterogeneous data sources and various machine learning approaches

    • Implementation of data processing pipeline to calculate nutritional values

    • Deploy pipeline on client infrastructure (AWS)

    Spark Scala AWS Docker Python git MySQL CouchDB Requirements Software Design
    EatSmarter GmbH
    München
    7 Monate
    2016-11 - 2017-05

    Master?s Thesis: Knowledge Discovery in unstructured Data

    Machine Learning Developer Apache Spark Scala Docker ...
    Machine Learning Developer
    • Title: Knowledge Discovery in textual Databases for enhancing the

      automatic Calculation of nutritional Values for online?based Food Recipes

    • The work utilizes various machine learning and NLP approaches to extract information from unstructured text to determine the nutritional content of food recipes

    • Theoretical approaches: Tokenization, Part?of?Speech Tagging, Stemming, Neural Networks, Logistic Regression, Word Embeddings

    Apache Spark Scala Docker Python Stanford?NLP Research Presentation
    TUM
    München
    7 Monate
    2016-11 - 2017-05

    Team Lead of Software Development at TUM Student Group ?Roboy?

    Lead Software Developer C++ ROS (Robot Operating System) CMake ...
    Lead Software Developer
    • As a team of interdisciplinary TUM students we developed a child?sized humanoid robot and promote the work on various events all over the world

    • As the team leader of the software development group, I was responsible for design, implementation and software engineering processes of the group

    C++ ROS (Robot Operating System) CMake Unix Team Lead Presentation Software Desig Software Archi Robotics
    TUM
    4 Monate
    2016-06 - 2016-09

    Machine Learning for Automatic Classification of Food Recipes

    Machine Learning Application Engineer Spark Scala AWS ...
    Machine Learning Application Engineer
    • Project: Provide an indicator for the healthiness of food recipes which is based in expert ratings

    • Consulting in management and design thinking to derive useful innovative use cases from existing data of the client

    • Proof of concept: supervise generation of training data set and analyze data quality

    • Implementation of supervised classification system based on selected features and various approaches

    • Implementation and validation of final solution

    Spark Scala AWS SQL git Docker Python Software-Design Requirements
    EatSmarter GmbH
    München
    6 Monate
    2014-10 - 2015-03

    Research & Development: Unmanned Aerial Vehicle

    Embedded Software Developer C++ QT CMake ...
    Embedded Software Developer
    • Bachelor?s Thesis: Design and prototypical implementation of a dynamic mission planner for integration into the mission planning software of an unmanned aerial vehicle in C++ 
    • Throughout the thesis I researched multiple approaches of route planning of an unmanned helicopter (UMAT) to explore a predefined area for hazardous gas

    • I implemented the routing algorithm in C++ to be used by the mission planning software developed by ESG

    C++ QT CMake Unix Embedded Soft Research Software Engineering Software Architec
    ESG
    Fürstenfeldbruck
    1 Jahr 5 Monate
    2013-05 - 2014-09

    Software Engineering & Development: Aerosystems Avionics

    Embedded Software Developer C CMake Unix ...
    Embedded Software Developer
    • I was involved in the software development team developing a time? and safety?critical, distributed middleware in C

    • The middleware was deployed on a highly modular avionics platform developed by ESG

    • The platform was composed of multiple Unix?like modules, which were interchangeable on the fly

    C CMake Unix Software Engine Software Arch Embedded Sof
    ESG
    Fürstenfeldbruck

    Aus- und Weiterbildung

    Aus- und Weiterbildung

    1 Jahr 8 Monate
    2015-10 - 2017-05

    M. Sc. Computer Science

    M. Sc., Technical University Munich (TUM)
    M. Sc.
    Technical University Munich (TUM)

    • Machine Learning
    • Artificial Intelligence
    • Big Data Analytics
    • Entrepreneurship 

    3 Jahre 6 Monate
    2011-10 - 2015-03

    B. Sc. Computer Science

    B. Sc., University of Applied Sciences Munich (FHM)
    B. Sc.
    University of Applied Sciences Munich (FHM)

    • Mathematics, Statistics
    • Algorithms & Data Structures
    • Software Engineering
    • Software Architecture 
    • Theoretical Computer Science

    Position

    Position

    Freelance Data Engineer & Machine Learning Consultant, Content Creator, Speaker & Trainer

    As a passionate freelance big data consultant, my expertise lies in crafting high?quality, and scalable data?driven applications that meet the unique requirements of my clients. I am dedicated to creating production?ready software solutions, leveraging cutting?edge machine learning techniques and robust big data architectures. Collaborating with in?house software development teams, as well as non?technical stakeholders, I thrive on practical problem?solving approaches and finding clear perspectives. Furthermore, I take great pleasure in empowering teams through customized workshops, equipping them with the knowledge and skills to develop scalable big data applications independently.

      Kompetenzen

      Kompetenzen

      Top-Skills

      Apache Spark Apache Kafka Machine Learn Data Architecture Data Engineering Data Science AWS Docker Kubernetes Java Scala Python neo4j TigerGraph Airflow Apache Iceberg Databricks Hadoop Hive HBase SQL Machine Learning

      Produkte / Standards / Erfahrungen / Methoden

      Big Data Architecture
      Experte
      Apache Spark
      Experte
      Apache Kafka
      Experte
      Scala
      Experte
      Python
      Experte
      Java
      Experte
      Software Engineering
      Experte
      Machine learning
      Fortgeschritten
      Neo4j
      Experte
      Amazon Webservice
      Fortgeschritten
      Algorithms
      Fortgeschritten
      Kubernetes
      Fortgeschritten
      Docker
      Fortgeschritten
      Elasticsearch
      Fortgeschritten
      Hadoop
      Fortgeschritten
      Hive
      Fortgeschritten
      NLP
      Fortgeschritten
      SQL
      Experte
      Statistics
      Fortgeschritten
      Airflow
      Fortgeschritten
      Databricks
      Experte
      Software Architecture
      Experte
      Clean Code
      Experte
      PyTorch
      Fortgeschritten
      Requirements Analysis
      Fortgeschritten
      Content Creation
      Experte
      Snowflake
      Fortgeschritten
      MySQL
      Fortgeschritten
      PostgreSQL
      Fortgeschritten


      Branchen

      Branchen

      • Insurance
      • LegalTech
      • FinTech
      • Transportation
      • Online Publishing
      • Telecommunication

      Vertrauen Sie auf Randstad

      Im Bereich Freelancing
      Im Bereich Arbeitnehmerüberlassung / Personalvermittlung

      Fragen?

      Rufen Sie uns an +49 89 500316-300 oder schreiben Sie uns:

      Das Freelancer-Portal

      Direktester geht's nicht! Ganz einfach Freelancer finden und direkt Kontakt aufnehmen.