Compressive sensing techniques for NASA's PACE (Plankton, Aerosol, Cloud, ocean Ecosystem) mission satellite data analysis. This project implements and compares various machine learning frameworks for spectral data compression and reconstruction.
This repository contains five main analysis notebooks:
- Preprocessing_Granules.ipynb - Data preprocessing pipeline for PACE OCI L1B granules, including time-series search, scan line extraction, and S3 storage
- frameworks.ipynb - Comprehensive comparison of compressive sensing frameworks including PCA, autoencoders, concrete autoencoders, and feature selection methods
- hyperparameter_tuning.ipynb - Hyperparameter optimization for Concrete Autoencoder (CAE) and Feature Selection Autoencoder (FSAE) architectures
- data_saturation.ipynb - Analysis of training data requirements and performance scaling from 10% to 100% dataset utilization
- analysis.ipynb - Consolidated analysis and comparison of all experimental results, including model performance metrics and critical wavelength analysis
- Python 3.11 or higher
- Virtual environment (recommended)
-
Clone this repository:
git clone git@github.com:hilahersz/pace-cs.git cd pace-cs -
Create and activate a virtual environment:
python3.11 -m venv venv source venv/bin/activate -
Install required dependencies:
pip install -r requirements.txt
The project dependencies include:
earthaccess- NASA Earthdata authentication and data accesscartopy- Geospatial data processingboto3- AWS S3 integrationscikit-learn- Machine learning frameworkstensorflow- Deep learning modelsxarray,pandas,numpy- Data manipulationpyarrow- Parquet file handling