NASA SBIR 2016 Solicitation


PROPOSAL NUMBER: 16-2 S5.03-7927
SUBTOPIC TITLE: Enabling NASA Science through Large-Scale Data Processing and Analysis
PROPOSAL TITLE: Open Source Parallel Image Analysis and Machine Learning Pipeline

SMALL BUSINESS CONCERN (Firm Name, Mail Address, City/State/Zip, Phone)
Continuum Analytics, Inc.
221 West 6th Street, Suite 1550
Austin, TX 78701 - 7870
(512) 222-5440

PRINCIPAL INVESTIGATOR/PROJECT MANAGER (Name, E-mail, Mail Address, City/State/Zip, Phone)
Peter Steinberg
1223 NE 94th Street
Seattle, WA 98115 - 7870
(206) 715-4492

CORPORATE/BUSINESS OFFICIAL (Name, E-mail, Mail Address, City/State/Zip, Phone)
Hunt Sparra
221 West 6th Street, Suite 1550
Austin, TX 78701 - 7870
(512) 776-1094

Estimated Technology Readiness Level (TRL) at beginning and end of contract:
Begin: 4
End: 6

Technology Available (TAV) Subtopics
Enabling NASA Science through Large-Scale Data Processing and Analysis is a Technology Available (TAV) subtopic that includes NASA Intellectual Property (IP). Do you plan to use the NASA IP under the award?

TECHNICAL ABSTRACT (Limit 2000 characters, approximately 200 words)
Today, NASA researchers must create, debug, and tune custom workflows for each analysis. Creation and modification of custom workflows is fragile, non-portable and consumes time that could be better spent on advancing scientific discovery. The Phase I open source software Ensemble Learning Models (ELM) provides composable, portable, reproducible, and extensible machine learning pipelines with easy-to-configure parallelization, with tools specifically for satellite data processing, weather and climate data processing, and machine learning and prediction. This is a major advancement over the current state-of-the-art because of reduced workflow creation time, parallelization, portability of deployment and use, extensibility, and robustness. Phase II will extend the Phase I work with more options useful to NASA missions, such as advanced ensemble fitting and prediction tools, feature engineering options for 3-D and 4-D arrays, and a web-based map user interface. Phase II will also harden and extend ELM to make ELM's easy-to-use large data ensemble methods accessible to industry outside of NASA, increasing the potential user base in a variety of domains.

POTENTIAL NASA COMMERCIAL APPLICATIONS (Limit 1500 characters, approximately 150 words)
Continuum Analytics sees direct usage applications in any NASA project deriving analytical value from multidimensional climate data arrays and hyper- or multispectral remote sensing data, such as climate reanalysis, landscape change analysis, land cover mapping, or drought or vegetation indices. In Phase I the team provided a number of scientific data loading tools for formats common in NASA remote sensing and climate science missions. Phase I work also created parallel ensemble fitting and prediction methods for a variety of unsupervised and supervised machine learning models. Phase II will provide tools for NASA data formats, additional tools for feature engineering in multidimensional climate data arrays, and advanced options for ensemble fitting and prediction, like hierarchical modeling and vote count ensemble averaging. Phase II will also include work on a web-based map interface and demonstrations of how ELM MLT may be useful in NASA missions like climate reanalysis and land cover classification.

POTENTIAL NON-NASA COMMERCIAL APPLICATIONS (Limit 1500 characters, approximately 150 words)
The team sees direct usage application of the image analysis and machine learning pipeline outside of NASA, such as: - NOAA mission-related research to predict changes in climate, weather, oceans and coast, and conserving and managing coasting and marine ecosystems and resources. - DOD/IC - foreign defense and homeland security applications - Commercial infrastructure and engineering, disaster management and mitigation analysis, natural resource monitoring, energy-related exploration and operational management. - Flood and floodplain mapping for insurance adjustments, bridge construction projects, FEMA floodplain definitions, river habitat and restoration projects, and emergency planning at local, state, and federal agencies - Forest disease and insect damage density identification for large commercial forest owners - Snow and ice cover and recession analysis useful in climate change and water management planning at federal, state, and local agencies - Developing spectral identifiers of agricultural crops in healthy versus water and nutrient stressed conditions - Classifying parking lots and roads based on the number of vehicles evidently in the image, an indicator of economic activity and also potentially useful in federal security applications - Mapping ecologically sensitive and geotechnically unstable areas, such as wetlands and mass wasting events, useful for reducing the cost of development review in local, state, and federal environmental agencies, remote asset trackin

TECHNOLOGY TAXONOMY MAPPING (NASA's technology taxonomy has been developed by the SBIR-STTR program to disseminate awareness of proposed and awarded R/R&D in the agency. It is a listing of over 100 technologies, sorted into broad categories, of interest to NASA.)
Data Fusion
Data Processing
Image Analysis

Form Generated on 03-07-17 15:43