Team Null (Chasers of the Lost Data)| Chasers of the Lost Data

Project Details

The Challenge | Chasers of the Lost Data

Help find ways to improve the performance of machine learning and predictive models by filling in gaps in the datasets prior to model training. This entails finding methods to computationally recover or approximate data that is missing due to sensor issues or signal noise that compromises experimental data collection. This work is inspired by data collection during additive manufacturing (AM) processes where sensors capture build characteristics in-situ, but it has applications across many NASA domains.

Filling In Unknown Data with the SPACE Method (Super-Powered Advanced Combined Ensemble)

Past work in null value imputation has focused on one method, or weighting many equally. Researchers note it is hard to tell which might be best without trying it. SPACE weights each algorithm on its strength on the strategically modified target dataset.

Team Null (Chasers of the Lost Data)

We decided to apply our work on a cancer dataset and a NASA Global Landslide dataset.

To strategically modify the dataset, for a randomly selected portion of the rows with missing data, we took another row which had more data and removed it to get down to the data in the other rows to create a validation dataset.This helped judge the algorithms more realistically on the actual data that they would be filling in.