Project Details

The Challenge | Chasers of the Lost Data

Help find ways to improve the performance of machine learning and predictive models by filling in gaps in the datasets prior to model training. This entails finding methods to computationally recover or approximate data that is missing due to sensor issues or signal noise that compromises experimental data collection. This work is inspired by data collection during additive manufacturing (AM) processes where sensors capture build characteristics in-situ, but it has applications across many NASA domains.

Finding Missing Data

This projects develops a python library for imputation of missing data. We have functions other open sources have and do not have!

FakeTiger

We propose a few methods for imputing missing data and create a python library.

The library includes a few basic and advanced methods for data imputation.

(What's new on us!)

Find if two columns have same pattern and do imputation accordingly.

Impute based on elements on the same COLUMN as well as same ROW.

We experiment on NASA meteorites and landslides, and find out doing imputation will improve prediction accuracy.

Our GitHub page for this project: https://github.com/efzfish/lostData