Using Machine Learning for Categorizing Satellite Data (Apprentice Position)
Summary
Cyclostationary signals are a special class of signals whose statistical properties do change with time in a periodic or close-to-periodic way. (https://cyclostationary.blog/)
There is a lot of research that needs to be done to learn about satellite data mainly about cyclostationary signals (Gardner, 1994) and research on analyzing these data for further processing. Normally, these data needs to be manually analyzed and interesting features needs to be plotted and extracted. In this work, we want to use machine learning on the available data to extract these interesting features automatically. Unsupervised machine learning models help to analyze data and categorize data automatically based on their features. It will help group the data into various plots based on their common features. After that more work can be done including application of various other machine learning models to analyze the data such as learning about cyclostaionary data, and their extraction methods.
References:
Gardner, W. A. (1994). An introduction to cyclostationary signals. In Cyclostationarity in communications and signal processing (pp. 1-90). New York: IEEE press.
Job Description
Students will develop machine learning models to analyze the raw satellite data and analyze cyclostaionary data. Students will be doing data preprocessing, primary data analysis and visualization and then developing models for extracting important features from the data. A few examples of feature analysis using machine learning: -Is significant signals at non-zero cycle frequencies -If so, how to characterize the shape -At which cycle frequencies signals are present -At which radio frequencies do the signals peak -Are they present in both polarizations -Is there any differences between the polarizations
Students will be exploring to implement various machine learning models that would work properly with the available dataset. The developed machine learning model should be able to predict the presence and categorize the cyclostaionary signals in new input data.
Computational Resources
Students will be using the computational resource in XSEDE. Students will be responsible for exploring various resources available in XSEDE that can be used for machine learning computation. Students will be using Python libraries for data processing such as numpy, pandas and dataframes. Pytorch and Tensorflow libraries will be used.
Contribution to Community
The work done by the students will be shared to XSEDE community as reports. This work will contribute to machine learning and satellite data research domain.
Position Type
Apprentice
Training Plan
Student has experience with basic machine learning concepts and worked with satellite data last summer. Student will continue their work and will be provided the necessary support by the faculty.