Projects

Road Accidents in France

Titelbild von road-accidents-fr

Motivation

Each year, about 1.3 million people die from road accidents (source: WHO). The goal of this Data Science project is to investigate the influences on the severity and occurrence of traffic accidents. The knowledge gained should help political decision-makers to take measures which may prevent lethal traffic accidents.

Info

This is the repository for my personal data science project on road accidents in France. This is a work in progress; have a look at the repository to check the current status and more information.

The data is provided by the French government.

My Work

  • import raw data and clean it
  • visualize a number of relationships between variables
  • predict the severity of road accidents with means of
    • conventional machine learning (XGBoost, Random Forest with Bayesian Search), and
    • neural networks (Tensorflow Keras)
  • interpret the applied models (Permutation feature importance) .
  • organize the project with git, using GitHub actions to automate linting with Pylint (ensure compliance with the PEP 8 style guide) and testing (with papermill).

Backlog

  • Interpretation of Artificial Neural Networks (with permutation feature importance)
  • Analysis of the occurence of accidents with means of time series analysis (statsmodels)

Findings

Feature Importance

The permutation feature importance for XGBoost shows, that the severity (3 classes: unharmed, injured, killed) of accidents is largely affected by variables of place and time […]

Used Libraries and Tools (Selection)

View Jupyter Notebooks

Badges

Links

Kaggle Projects

Ames Housing Prices

My submission was ranked #764 of 3939 and awarded a bronze medal.

Gemstone Price Prediction

My submission was ranked #54 of 132

Other Coding Projects (not ML-Related)

  • Matlab tool for the evaluation of geometric deviations at the inner and outer arc of a bent component

    B.Sc. thesis

  • OpenMP parallel implementation of fast polynomial transformations in Fortran

    M.Sc. thesis

  • Bending line curvature and torsion analysis with Frenet-Serret equations

    Matlab/Python

  • Algorithm for geometric comparison of target and actual geometry of bending components

    Matlab/Python

Kay Langhammer

Researcher

Kay Langhammer 2024