
When I read the description of this challenge I got idea to try to use existing Sudoku puzzle solver libraries to fill missing values in data. This is not possible because dataset with missing values do not follow easily described rules of Sudoku so correct values cen't be directly reasoned.
This got me thinking about training machine learning model to work in place of ruleset for every feature column in dataset. Dataset might contain multiple types of data so I also added unsupervised clustering of data before training the column models.
Code for the experiment is available at https://github.com/mikeful/nasa_2019_autofiller