Machine Learning Engineer / Data Scientist / Data Engineer / Data Science Project Manager
Aktualisiert am 25.11.2024
Profil
Freiberufler / Selbstständiger
Remote-Arbeit
Verfügbar ab: 26.11.2024
Verfügbar zu: 100%
davon vor Ort: 100%
Machine Learning & Deep Learning Expertise (NLP/Computer Vision)
Python Programming (PyTorch/TensorFlow/scikit-learn)
Cloud Platforms & Big Data Technologies (AWS/GCP/Apache Spark/BigQuery)
Natural Language Processing (NER/Topic Modeling/Summarization/Language Detection)
Computer Vision (Image Classification/Object Detection/Autoencoders)
Data Engineering & ETL Processes (Data Modeling/Migration/SQL)
Big Data Tools (Apache Spark/PySpark/Hadoop)
Machine Learning Libraries (Keras/XGBoost/Transformers)
Cloud Services (AWS SageMaker/EMR/EC2/S3; GCP BigQuery/BigTable)
Data Analysis Libraries (NumPy/pandas/Matplotlib/plotly)
DevOps & Containerization (Docker/Kubernetes/Terraform)
Version Control & CI/CD Tools (Git/Bitbucket/GitHub/GitLab/Jenkins)
ML Lifecycle & Experiment Tracking (MLflow/SageMaker/Vertex AI)
Web Technologies & APIs (FastAPI/RESTful APIs/Microservices)
Search Technologies (Solr/Elasticsearch)
Project Management & Agile (Jira/Confluence/Scrum/Kanban)
Monitoring & Logging Tools (Grafana/Prometheus/Kibana)
Testing & QA (pytest/unittest/Code Reviews/Clean Code/Pylint/Ruff)
Video Processing & Analysis (FFmpeg/Video Models)
Large Language Models & Transformers (Hugging Face/OpenAI API/GPT/LangChain)
Data Visualization (Matplotlib/plotly/seaborn)
Time Series Analysis & Anomaly Detection
Team Leadership/Coaching/Mentoring
Deutsch
Verhandlungssicher
Englisch
Verhandlungssicher
Ungarisch
Verhandlungssicher

Einsatzorte

Einsatzorte

Deutschland, Schweiz, Österreich


möglich

Projekte

Projekte

1 Jahr 4 Monate
2023-08 - heute

Use of the latest machine learning techniques

Machine Learning Engineer / Data Scientist
Machine Learning Engineer / Data Scientist

  • As a machine learning engineer and data scientist in the search team at OTTO, my main task is to use state of the art machine learning techniques to improve the search experience for our customers.
  • The Solr search engine, which processes 1.000 queries per second and supports around 20 million product variants 24/7, is central to OTTO's e-commerce platform.
  • All improvements are extensively tested and validated through online experiments.


Learning to Select:

  • Improved query precision by filtering out irrelevant results through comprehensive data-driven solutions on clickstream data. 
  • Also identified and removed fraudulent and bot-generated queries to improve model performance and data integrity.


Hybrid Search:

  • Collaborated with two teams to develop a system that integrates both lexical and
  • semantic search approaches to provide more relevant search results.


Advanced Spell Check:

  • Designed, implemented, validated and brought to production a leading-edge spell checking system. 
  • This solution not only corrects customer spelling errors but also guides them towards the most relevant products.


Query Intent Detection:

  • I also led the development of a customer query intent detection approach to identify non-product and navigation queries, and to recognize brand names and their context within search queries (Named entity recognition and classification).

AWS GCP BigQuery Clickstream Data FastText Huggingface Transformers MLflow OpenAI API SageMaker AirFlow Docker Jenkins Terraform Grafana Prometheus Elasticsearch Kibana Confluence Jira Miro Agile/Scrum FastAPI Poetry Python PyTorch GitHub Online Experiments/Testing Solr Pair Programming
OTTO
Hamburg
6 Monate
2023-02 - 2023-07

Large Language Model (LLM) for Start-Ups

Consultant
Consultant
  • As an external consultant, I helped startups to use GPT and other large language models (LLMs).
  • I provided training, evaluated use cases, assessed limitations such as security, performance, accuracy and explored options/alternatives to the OpenAI API.
Haystack Hugging Face Models LangChain Ollama OpenAI API Python
Remote
1 Jahr 5 Monate
2021-09 - 2023-01

Content Understanding / Metadata Creation from Video, Audio and Text

Data Product Owner & Solution Architect / Machine Learning Consultant at RTL Deutschland Python Scrum Agile
Data Product Owner & Solution Architect / Machine Learning Consultant at RTL Deutschland

  • As a freelance consultant and expert in machine learning applications for content understanding, I supported the RTL Data team in building the next generation multi purpose platform "RTL+" in cooperation with Deezer, using visual (video), audio and text data. An integral part of my role was to manage and balance the needs and expectations of the various stakeholders involved in the project.
  • The primary goal of this project is to derive and provide additional metadata from the raw content that can be used by downstream applications such as search, recommendation, and personalization. The key challenge is to establish a clean, reliable, scalable, and production-ready state-of-the-art solution for a large number of building blocks and to create an efficient execution pipeline on top of it.

Video based models:

  • Aesthetic Ranking
  • Dominant Color Extraction
  • End Credits Detection
  • Face Detection
  • Image Quality Detection
  • Logo Detection
  • Mood Detection
  • Object detection and Recognition
  • Place Prediction
  • Scene and Shot-Boundary Detection
  • Shot Type Detection by using and optimizing pre-trained and self-trained models


Audio based models and solutions:

  • Speech-to-Text transcriptions using Google?s Speech-to- Text API and Whisper from Open-AI on Podcasts and other audio sources


NLP solutions:

  • language detection (fastText), festivity detection, kids content detection, adult content detection, topic modeling (BERTopic), keyword extraction (KeyBERT) and text summarization

Google Cloud Platform (GCP) Gitlab CI/CD Google BigQuery SQL Terraform Hugging Face models Google Data Studio MLflow Argo Workflows Elasticsearch FFmpeg JIRA Confluence Scrum Python PyTorch TensorFlow pandas NumPy Poetry Jupyter LLM Transfomer Argo Workflow Docker FastAPI Grafana Atlassian JIRA Kafka Kibana Kubernetes spaCy Streamlit XGBoost
Python Scrum Agile
RTL Deutschland
Köln
3 Jahre 11 Monate
2017-09 - 2021-07

Consumer Insights and Data Science

Machine Learning Engineer / Data Scientist / Deep Learning Expert at adidas
Machine Learning Engineer / Data Scientist / Deep Learning Expert at adidas

  • As a freelance consultant and expert in machine learning, data science and deep learning, I specialized in fraud detection, product recommendation systems, image recognition/classification, anomaly detection, time series analysis and NLP. I led agile projects from conception to production and maintenance & optimization.
  • I focused on various e-commerce solutions that leveraged consumer data, product master data & descriptions, product images and sales transactions.


Product Similarity:

  • in order to increase of the downstream system's performance this solution will help to find similar or related products for a particular product which can be used then as a benchmark or replacement
  • The similarity will be determined by various modalities: 
    • visual similarity (image autoencoder), consumer behavior (clickstream data) and product descriptions (NLP transformers)


Skills:

Python, Jupyter, PySpark, TensorFlow, Jira, Bitbucket


Dynamic Pricing:

  • the main goal for this project is to identify poor performing products in an early stage, uncover possible product issues and determine the right actions e.g. optimal price change to boost performance
  • The overarching goal was to gradually replace the existing solution


Skills:

Python, Jupyter, PySpark, XGBoost, matplotlib, TensorFlow


Consumer Lifetime Value:

  • conception, implementation and maintenance for the historical and future monetary value attributed to an individual consumer. Regular extensions and adaptations for e.g. new markets / brands and deep dive into the model's most important features
  • The models are based on consumer behavior data and are running fully in production and will be updated on a weekly basis for all consumers. The results (KPIs) are intensively used in downstream systems and for marketing campaigns.


Skills:

Python, XGBoost, SHAP, matplotlib, Exasol, Jira, Bitbucket


Visual Product Embeddings:

  • conception and implementation of a variational autoencoder based on product images
  • The source images are being filtered, downscaled and prepared for a convolutional neural network (VAE) where the embeddings will be generated
  • These embeddings are able to capture design elements of a product image which can be used to find similar products but also will be fed into downstream models to improve any productbased model
  • The solution is running in production and will be updated with new images on a weekly basis


Skills:

Python, Keras/TensorFlow, PySpark, SageMaker, OpenCV


Purchase Propensity Scores: 

  • conception, implementation and maintenance for modelling the consumer's purchase intention
  • The solution is running very stable in production for a few years already and the results provide a high contribution to the marketing channels


Skills:

Python, XGBoost, SHAP, matplotlib, Exasol, Jira, Bitbucket

Python R Exasol JIRA Confluence Bitbucket XGBoost TensorFlow Keras Spark PySpark AWS SageMaker XGBoost Exasol
Herzogenaurach
5 Monate
2018-11 - 2019-03

Kaggle Challenge

Data Scientist / Machine Learning Expert Python Pytorch
Data Scientist / Machine Learning Expert

  • participating the "Histopathology Cancer Detection" competition
  • The goal was to identify metastatic cancer in medical images
  • My role was to bring state of the art computer vision techniques to the team and to implement an ensemble of models for the submission
  • We have reached #26 from 1.149 competitors using advanced (high-speed) training techniques and heavy image augmentations

Python Jupyter PyTorch plot.ly GitHub
Python Pytorch
3 Monate
2017-04 - 2017-06

Product Image Classification

Deep Learning / Machine Learning Expert Pyhton TensorFlow Keras
Deep Learning / Machine Learning Expert

  • The goal of the project was to build an MVP for a product image classification system to support annotators' workflows and to identify outliers/broken images within the image pool
  • My role was also to educate the team on the latest deep learning/computer vision possibilities and find additional business cases to implement

Pyhton TensorFlow Keras JIRA Confluence Git
Pyhton TensorFlow Keras
Karlsruhe

Aus- und Weiterbildung

Aus- und Weiterbildung

  • Studied computer science at the Friedrich-Alexander University in Erlangen
  • Electrical engineering (focus on data technology) studies at the Georg-Simon-Ohm University of Applied Sciences in Nuremberg

Professional Training:


  • Databricks to Local LLMs - Duke University (Coursera)
  • Python Essentials for MLOps (Coursera)
  • Microsoft Azure Databricks for Data Engineering (Coursera)
  • deeplearning.ai - Machine Learning Engineering for Production (MLOps)
  • deeplearning.ai - Natural Language Processing Specialization
  • Machine Learning Engineer Nanodegree at Udacity
  • Deep Learning Nanodegree Foundation at Udacity
  • Neural Networks and Deep Learning by deeplearning.ai on Coursera
  • Neural Networks for Machine Learning by University of Toronto on Coursera
  • Machine Learning: Clustering & Retrieval by University of Washington on Coursera
  • Machine Learning: Classification by University of Washington on Coursera
  • Machine Learning With Big Data (2015) by University of California, San Diego on Coursera
  • Machine Learning by Stanford University on Coursera
  • Machine Learning: Regression by University of Washington on Coursera
  • Introduction to Big Data Analytics (2015) by University of California, San Diego
  • Hadoop Platform and Application Framework by University of California, San Diego
  • Machine Learning Foundations: A Case Study Approach by University of Washington
  • iSAQB® Domain-Driven Design (DDD) Workshop
  • iSAQB® Certified Professional for Software Architecture
  • Certified Scrum-Master
  • Sun Certified Java Programmer

Position

Position

Deep Learning / Machine Learning / Data Science Expert

Kompetenzen

Kompetenzen

Top-Skills

Machine Learning & Deep Learning Expertise (NLP/Computer Vision) Python Programming (PyTorch/TensorFlow/scikit-learn) Cloud Platforms & Big Data Technologies (AWS/GCP/Apache Spark/BigQuery) Natural Language Processing (NER/Topic Modeling/Summarization/Language Detection) Computer Vision (Image Classification/Object Detection/Autoencoders) Data Engineering & ETL Processes (Data Modeling/Migration/SQL) Big Data Tools (Apache Spark/PySpark/Hadoop) Machine Learning Libraries (Keras/XGBoost/Transformers) Cloud Services (AWS SageMaker/EMR/EC2/S3; GCP BigQuery/BigTable) Data Analysis Libraries (NumPy/pandas/Matplotlib/plotly) DevOps & Containerization (Docker/Kubernetes/Terraform) Version Control & CI/CD Tools (Git/Bitbucket/GitHub/GitLab/Jenkins) ML Lifecycle & Experiment Tracking (MLflow/SageMaker/Vertex AI) Web Technologies & APIs (FastAPI/RESTful APIs/Microservices) Search Technologies (Solr/Elasticsearch) Project Management & Agile (Jira/Confluence/Scrum/Kanban) Monitoring & Logging Tools (Grafana/Prometheus/Kibana) Testing & QA (pytest/unittest/Code Reviews/Clean Code/Pylint/Ruff) Video Processing & Analysis (FFmpeg/Video Models) Large Language Models & Transformers (Hugging Face/OpenAI API/GPT/LangChain) Data Visualization (Matplotlib/plotly/seaborn) Time Series Analysis & Anomaly Detection Team Leadership/Coaching/Mentoring

Produkte / Standards / Erfahrungen / Methoden

AWS
Bitbucket
Confluence
Git
JIRA
Keras
SageMaker
Scrum
TensorFlow
XGBoost
PyTorch
GCP

Profil

  • Highly skilled and experienced freelance machine learning engineer/consultant with a deep business understanding specialized in state of the art deep learning, machine learning and data science with a proven track record of delivering high-quality results in a fast-paced and production-ready environment.
  • I have worked on projects for various clients in different industries, using my expertise to help the organisation improve efficiency, reduce costs, and increase revenue through the use of data-driven solutions.


Special skills and core competencies:

  • Teamplay: 
    • Team development
    • coaching
    • team motivation
    • agile values 
  • Main tasks: 
    • data science and machine learning (AI)
    • data engineering
    • project management
    • software architecture, analysis and implementation of requirements
    • data modelling
    • data migration
    • performance optimization
    • test automation
    • agile software development
    • enabling high performance teams
    • infrastructure evaluation and modernization
  • Professional software-development (20+ years experience)


Machine Learning / Deep Learning:

  • TensorFlow, Keras, PyTorch, XGBoost, Transformers (NLP & vision) , LLMs
  • Python (numpy, pandas, scikit-learn, matplotlib, plot.ly)


Big Data:

  • Amazon AWS
  • EMR
  • SageMaker
  • GCP
  • Hadoop
  • PySpark


Web-Technologies:

  • HTML
  • CSS
  • XML/XSLT
  • JavaScript/AJAX 
  • SOAP
  • REST
  • Micro-Services 
  • Google Analytics
  • Adobe Analytics


Software development tools and techniques: 

  • JIRA and Confluence 
  • Bitbucket
  • Git
  • Jenkins
  • GitLab 
  • Codeception
  • JUnit
  • PHPUnit
  • SoapUI
  • PyTest
  • Unittest 
  • SonarQube
  • Selenium
  • Pylint, Clean Code
  • Code Reviews 
  • Agile: 
    • Scrum
    • Kanban

Betriebssysteme

Microsoft Windows
Ubuntu
macOS
 

Programmiersprachen

Assembler
C
C++
Java
MATLAB
PHP
PySpark
Python
Sehr gute Kenntnisse
R
 

Datenbanken

Data modelling
data migration
ETL processes
Optimization
performance tuning
strong SQL skills
MS SQL
Oracle Database
MySQL
Exasol
Elasticsearch
BigQuery

Berechnung / Simulation / Versuch / Validierung

SHAP

Branchen

Branchen

  • Healthcare
  • E-Commerce
  • Fashion
  • Media & Television

Einsatzorte

Einsatzorte

Deutschland, Schweiz, Österreich


möglich

Projekte

Projekte

1 Jahr 4 Monate
2023-08 - heute

Use of the latest machine learning techniques

Machine Learning Engineer / Data Scientist
Machine Learning Engineer / Data Scientist

  • As a machine learning engineer and data scientist in the search team at OTTO, my main task is to use state of the art machine learning techniques to improve the search experience for our customers.
  • The Solr search engine, which processes 1.000 queries per second and supports around 20 million product variants 24/7, is central to OTTO's e-commerce platform.
  • All improvements are extensively tested and validated through online experiments.


Learning to Select:

  • Improved query precision by filtering out irrelevant results through comprehensive data-driven solutions on clickstream data. 
  • Also identified and removed fraudulent and bot-generated queries to improve model performance and data integrity.


Hybrid Search:

  • Collaborated with two teams to develop a system that integrates both lexical and
  • semantic search approaches to provide more relevant search results.


Advanced Spell Check:

  • Designed, implemented, validated and brought to production a leading-edge spell checking system. 
  • This solution not only corrects customer spelling errors but also guides them towards the most relevant products.


Query Intent Detection:

  • I also led the development of a customer query intent detection approach to identify non-product and navigation queries, and to recognize brand names and their context within search queries (Named entity recognition and classification).

AWS GCP BigQuery Clickstream Data FastText Huggingface Transformers MLflow OpenAI API SageMaker AirFlow Docker Jenkins Terraform Grafana Prometheus Elasticsearch Kibana Confluence Jira Miro Agile/Scrum FastAPI Poetry Python PyTorch GitHub Online Experiments/Testing Solr Pair Programming
OTTO
Hamburg
6 Monate
2023-02 - 2023-07

Large Language Model (LLM) for Start-Ups

Consultant
Consultant
  • As an external consultant, I helped startups to use GPT and other large language models (LLMs).
  • I provided training, evaluated use cases, assessed limitations such as security, performance, accuracy and explored options/alternatives to the OpenAI API.
Haystack Hugging Face Models LangChain Ollama OpenAI API Python
Remote
1 Jahr 5 Monate
2021-09 - 2023-01

Content Understanding / Metadata Creation from Video, Audio and Text

Data Product Owner & Solution Architect / Machine Learning Consultant at RTL Deutschland Python Scrum Agile
Data Product Owner & Solution Architect / Machine Learning Consultant at RTL Deutschland

  • As a freelance consultant and expert in machine learning applications for content understanding, I supported the RTL Data team in building the next generation multi purpose platform "RTL+" in cooperation with Deezer, using visual (video), audio and text data. An integral part of my role was to manage and balance the needs and expectations of the various stakeholders involved in the project.
  • The primary goal of this project is to derive and provide additional metadata from the raw content that can be used by downstream applications such as search, recommendation, and personalization. The key challenge is to establish a clean, reliable, scalable, and production-ready state-of-the-art solution for a large number of building blocks and to create an efficient execution pipeline on top of it.

Video based models:

  • Aesthetic Ranking
  • Dominant Color Extraction
  • End Credits Detection
  • Face Detection
  • Image Quality Detection
  • Logo Detection
  • Mood Detection
  • Object detection and Recognition
  • Place Prediction
  • Scene and Shot-Boundary Detection
  • Shot Type Detection by using and optimizing pre-trained and self-trained models


Audio based models and solutions:

  • Speech-to-Text transcriptions using Google?s Speech-to- Text API and Whisper from Open-AI on Podcasts and other audio sources


NLP solutions:

  • language detection (fastText), festivity detection, kids content detection, adult content detection, topic modeling (BERTopic), keyword extraction (KeyBERT) and text summarization

Google Cloud Platform (GCP) Gitlab CI/CD Google BigQuery SQL Terraform Hugging Face models Google Data Studio MLflow Argo Workflows Elasticsearch FFmpeg JIRA Confluence Scrum Python PyTorch TensorFlow pandas NumPy Poetry Jupyter LLM Transfomer Argo Workflow Docker FastAPI Grafana Atlassian JIRA Kafka Kibana Kubernetes spaCy Streamlit XGBoost
Python Scrum Agile
RTL Deutschland
Köln
3 Jahre 11 Monate
2017-09 - 2021-07

Consumer Insights and Data Science

Machine Learning Engineer / Data Scientist / Deep Learning Expert at adidas
Machine Learning Engineer / Data Scientist / Deep Learning Expert at adidas

  • As a freelance consultant and expert in machine learning, data science and deep learning, I specialized in fraud detection, product recommendation systems, image recognition/classification, anomaly detection, time series analysis and NLP. I led agile projects from conception to production and maintenance & optimization.
  • I focused on various e-commerce solutions that leveraged consumer data, product master data & descriptions, product images and sales transactions.


Product Similarity:

  • in order to increase of the downstream system's performance this solution will help to find similar or related products for a particular product which can be used then as a benchmark or replacement
  • The similarity will be determined by various modalities: 
    • visual similarity (image autoencoder), consumer behavior (clickstream data) and product descriptions (NLP transformers)


Skills:

Python, Jupyter, PySpark, TensorFlow, Jira, Bitbucket


Dynamic Pricing:

  • the main goal for this project is to identify poor performing products in an early stage, uncover possible product issues and determine the right actions e.g. optimal price change to boost performance
  • The overarching goal was to gradually replace the existing solution


Skills:

Python, Jupyter, PySpark, XGBoost, matplotlib, TensorFlow


Consumer Lifetime Value:

  • conception, implementation and maintenance for the historical and future monetary value attributed to an individual consumer. Regular extensions and adaptations for e.g. new markets / brands and deep dive into the model's most important features
  • The models are based on consumer behavior data and are running fully in production and will be updated on a weekly basis for all consumers. The results (KPIs) are intensively used in downstream systems and for marketing campaigns.


Skills:

Python, XGBoost, SHAP, matplotlib, Exasol, Jira, Bitbucket


Visual Product Embeddings:

  • conception and implementation of a variational autoencoder based on product images
  • The source images are being filtered, downscaled and prepared for a convolutional neural network (VAE) where the embeddings will be generated
  • These embeddings are able to capture design elements of a product image which can be used to find similar products but also will be fed into downstream models to improve any productbased model
  • The solution is running in production and will be updated with new images on a weekly basis


Skills:

Python, Keras/TensorFlow, PySpark, SageMaker, OpenCV


Purchase Propensity Scores: 

  • conception, implementation and maintenance for modelling the consumer's purchase intention
  • The solution is running very stable in production for a few years already and the results provide a high contribution to the marketing channels


Skills:

Python, XGBoost, SHAP, matplotlib, Exasol, Jira, Bitbucket

Python R Exasol JIRA Confluence Bitbucket XGBoost TensorFlow Keras Spark PySpark AWS SageMaker XGBoost Exasol
Herzogenaurach
5 Monate
2018-11 - 2019-03

Kaggle Challenge

Data Scientist / Machine Learning Expert Python Pytorch
Data Scientist / Machine Learning Expert

  • participating the "Histopathology Cancer Detection" competition
  • The goal was to identify metastatic cancer in medical images
  • My role was to bring state of the art computer vision techniques to the team and to implement an ensemble of models for the submission
  • We have reached #26 from 1.149 competitors using advanced (high-speed) training techniques and heavy image augmentations

Python Jupyter PyTorch plot.ly GitHub
Python Pytorch
3 Monate
2017-04 - 2017-06

Product Image Classification

Deep Learning / Machine Learning Expert Pyhton TensorFlow Keras
Deep Learning / Machine Learning Expert

  • The goal of the project was to build an MVP for a product image classification system to support annotators' workflows and to identify outliers/broken images within the image pool
  • My role was also to educate the team on the latest deep learning/computer vision possibilities and find additional business cases to implement

Pyhton TensorFlow Keras JIRA Confluence Git
Pyhton TensorFlow Keras
Karlsruhe

Aus- und Weiterbildung

Aus- und Weiterbildung

  • Studied computer science at the Friedrich-Alexander University in Erlangen
  • Electrical engineering (focus on data technology) studies at the Georg-Simon-Ohm University of Applied Sciences in Nuremberg

Professional Training:


  • Databricks to Local LLMs - Duke University (Coursera)
  • Python Essentials for MLOps (Coursera)
  • Microsoft Azure Databricks for Data Engineering (Coursera)
  • deeplearning.ai - Machine Learning Engineering for Production (MLOps)
  • deeplearning.ai - Natural Language Processing Specialization
  • Machine Learning Engineer Nanodegree at Udacity
  • Deep Learning Nanodegree Foundation at Udacity
  • Neural Networks and Deep Learning by deeplearning.ai on Coursera
  • Neural Networks for Machine Learning by University of Toronto on Coursera
  • Machine Learning: Clustering & Retrieval by University of Washington on Coursera
  • Machine Learning: Classification by University of Washington on Coursera
  • Machine Learning With Big Data (2015) by University of California, San Diego on Coursera
  • Machine Learning by Stanford University on Coursera
  • Machine Learning: Regression by University of Washington on Coursera
  • Introduction to Big Data Analytics (2015) by University of California, San Diego
  • Hadoop Platform and Application Framework by University of California, San Diego
  • Machine Learning Foundations: A Case Study Approach by University of Washington
  • iSAQB® Domain-Driven Design (DDD) Workshop
  • iSAQB® Certified Professional for Software Architecture
  • Certified Scrum-Master
  • Sun Certified Java Programmer

Position

Position

Deep Learning / Machine Learning / Data Science Expert

Kompetenzen

Kompetenzen

Top-Skills

Machine Learning & Deep Learning Expertise (NLP/Computer Vision) Python Programming (PyTorch/TensorFlow/scikit-learn) Cloud Platforms & Big Data Technologies (AWS/GCP/Apache Spark/BigQuery) Natural Language Processing (NER/Topic Modeling/Summarization/Language Detection) Computer Vision (Image Classification/Object Detection/Autoencoders) Data Engineering & ETL Processes (Data Modeling/Migration/SQL) Big Data Tools (Apache Spark/PySpark/Hadoop) Machine Learning Libraries (Keras/XGBoost/Transformers) Cloud Services (AWS SageMaker/EMR/EC2/S3; GCP BigQuery/BigTable) Data Analysis Libraries (NumPy/pandas/Matplotlib/plotly) DevOps & Containerization (Docker/Kubernetes/Terraform) Version Control & CI/CD Tools (Git/Bitbucket/GitHub/GitLab/Jenkins) ML Lifecycle & Experiment Tracking (MLflow/SageMaker/Vertex AI) Web Technologies & APIs (FastAPI/RESTful APIs/Microservices) Search Technologies (Solr/Elasticsearch) Project Management & Agile (Jira/Confluence/Scrum/Kanban) Monitoring & Logging Tools (Grafana/Prometheus/Kibana) Testing & QA (pytest/unittest/Code Reviews/Clean Code/Pylint/Ruff) Video Processing & Analysis (FFmpeg/Video Models) Large Language Models & Transformers (Hugging Face/OpenAI API/GPT/LangChain) Data Visualization (Matplotlib/plotly/seaborn) Time Series Analysis & Anomaly Detection Team Leadership/Coaching/Mentoring

Produkte / Standards / Erfahrungen / Methoden

AWS
Bitbucket
Confluence
Git
JIRA
Keras
SageMaker
Scrum
TensorFlow
XGBoost
PyTorch
GCP

Profil

  • Highly skilled and experienced freelance machine learning engineer/consultant with a deep business understanding specialized in state of the art deep learning, machine learning and data science with a proven track record of delivering high-quality results in a fast-paced and production-ready environment.
  • I have worked on projects for various clients in different industries, using my expertise to help the organisation improve efficiency, reduce costs, and increase revenue through the use of data-driven solutions.


Special skills and core competencies:

  • Teamplay: 
    • Team development
    • coaching
    • team motivation
    • agile values 
  • Main tasks: 
    • data science and machine learning (AI)
    • data engineering
    • project management
    • software architecture, analysis and implementation of requirements
    • data modelling
    • data migration
    • performance optimization
    • test automation
    • agile software development
    • enabling high performance teams
    • infrastructure evaluation and modernization
  • Professional software-development (20+ years experience)


Machine Learning / Deep Learning:

  • TensorFlow, Keras, PyTorch, XGBoost, Transformers (NLP & vision) , LLMs
  • Python (numpy, pandas, scikit-learn, matplotlib, plot.ly)


Big Data:

  • Amazon AWS
  • EMR
  • SageMaker
  • GCP
  • Hadoop
  • PySpark


Web-Technologies:

  • HTML
  • CSS
  • XML/XSLT
  • JavaScript/AJAX 
  • SOAP
  • REST
  • Micro-Services 
  • Google Analytics
  • Adobe Analytics


Software development tools and techniques: 

  • JIRA and Confluence 
  • Bitbucket
  • Git
  • Jenkins
  • GitLab 
  • Codeception
  • JUnit
  • PHPUnit
  • SoapUI
  • PyTest
  • Unittest 
  • SonarQube
  • Selenium
  • Pylint, Clean Code
  • Code Reviews 
  • Agile: 
    • Scrum
    • Kanban

Betriebssysteme

Microsoft Windows
Ubuntu
macOS
 

Programmiersprachen

Assembler
C
C++
Java
MATLAB
PHP
PySpark
Python
Sehr gute Kenntnisse
R
 

Datenbanken

Data modelling
data migration
ETL processes
Optimization
performance tuning
strong SQL skills
MS SQL
Oracle Database
MySQL
Exasol
Elasticsearch
BigQuery

Berechnung / Simulation / Versuch / Validierung

SHAP

Branchen

Branchen

  • Healthcare
  • E-Commerce
  • Fashion
  • Media & Television

Vertrauen Sie auf Randstad

Im Bereich Freelancing
Im Bereich Arbeitnehmerüberlassung / Personalvermittlung

Fragen?

Rufen Sie uns an +49 89 500316-300 oder schreiben Sie uns:

Das Freelancer-Portal

Direktester geht's nicht! Ganz einfach Freelancer finden und direkt Kontakt aufnehmen.