I’ve been always passionate about the idea of algae blooms and wondered what are the reasons behind the occurrence of this natural phenomena. In order to find the answer to this question, I moved further and decided to use my data science and machine learning skills to find them.
We decided to find connections and hidden patterns in data on concentration of chlorophyll (which equals to concentration of algae in our case) and geophysical data in order to find, how different parameters influence the growth of algae. However, it would be a teadous task to analyze the whole data for each lake and sea on Earth, therefore we decided to take Ontario and Erie lakes as test-subjects for our research. Unlucky we haven’t had much time to prepare and develop our solution, but we managed to use approximate analysis on chlorophyll concentration, available for photosynthesis radiation, biologically-available carbon-sources and other parameters. After research on data (primary one), we came to conclusion that available for photosynthesis radiation, landscape data, biologically available carbon sources and temperature influence the growth of algae the most. The degree of influence is the next:
In our research we used the next NASA resources:
Sadly we haven't used any SpaceApps offers in our project, but we are going to explore and use them next year.
We are definitely going to continue work on our project, namely we are going to apply different machine learning techniques (Polynomial regression, clusterfication, et cetera) in order to evaluate the numerical and exact value for influence and build a tool able to predict the growth of algae and the chance of occurence of algae bloom in specific area.
We used Panoply (written on Java) as the main tool for exploring the data together with data provided by Ocean Color from MODIC satellites. Graphics were made in Paint.NET, free graphics designer and Figma. In the future development we are going to use Python as the main data analysis tool.
You can try to detect those patterns by yourself by looking at graphs which we used in our research. The album with them can be found here: https://imgur.com/a/EwL6tZ2
#algae_blooms #algae #machine_learning #nasa #panoply #modis #ocean_color #geophysical_data #chlorophyll