
German Election Programs Analysis
Analysis of election programs with Natural Language Processing and Development of local LLM-based Chatbot.
NLP, RAG, Spacy, Langchain, Vectorstore
Reports Github repoI'm a weekend data analyst from Germany. I'm mostly interested in getting valuable insights from data using the simplest means and not the fanciest algorithm. As you can see below I like working with data from various fields and apply them in different machine learning and data analytics projects.
Most of my data science projects are with Python and associated libraries while I'm deepening my skills at SQL, Tableau.
I've studied computer science and information systems with focus on data science in my bachelor and master. I have been working as a Business Architect at Hewlett Packard Enterprise (HPE) for more than five years. My major topics have been data platforms, data strategy and Trustworthy AI.
Analysis of election programs with Natural Language Processing and Development of local LLM-based Chatbot.
NLP, RAG, Spacy, Langchain, Vectorstore
Reports Github repoData visualization of official car license data in Germany with focus on alternative drives.
Data visualization, Tableau, Open Data, Data Storytelling
Tableau DashboardGathering and clustering open U.S. chocolate data, because customer segementation is boring...
KMeans, DBSCAN, unsupervised, REST API, PostgreSQL
Source Code hereData augmentation with Python and forecasting using Prophet algorithm. Explain the model with Shapley values.
Time-series forecast, Prophet, Streamlit, SHAP
Source Code here Streamlit Web AppData exploration of bike count stations in Berlin. Prepring data and views with SQL and visualization with Tableau.
Data Exploration, SQL, Tableau
Source Code here Tableau DashboardCombine and Preprocess Your Heterogeneous Data for Analytics with Apache Flink
Data Pipelining, Apache Flink, real-time
Read Medium articlePredict whether telco customers will leave the company: Exploratory Data Analysis, Machine Learning and Deep Learning
Classification, Scikit-Learn, Keras, Pandas
See source code Learn more soonCRISP-DM Data Science Project in Python: Exploratory Data Analysis and Machine Learning
Regression, Scikit-Learn, Pandas
source code here