An Interactive Tool For Semi-automated Statistical Prediction Using Earth Observations and Models

Back to Search Results  Start New Search

First Author:
Fisseha Berhane, Johns Hopkins University, Baltimore,MD United States

Fisseha Berhane, Johns Hopkins University Baltimore,MD United States

Abstract ID:

Abstract Body:

We developed a semi-automated statistical prediction tool applicable to concurrent analysis or seasonal prediction of any time series variable in any geographic location. The tool was developed using Shiny, JavaScript, HTML and CSS. A user can extract a predictand by drawing a polygon over a region of interest on the provided user interface (global map). The user can select the Climatic Research Unit (CRU) precipitation or Climate Hazards Group InfraRed Precipitation with Station data (CHIRPS) as predictand. They can also upload their own predictand time series. Predictors can be extracted from sea surface temperature, sea level pressure, winds at different pressure levels, air temperature at various pressure levels, and geopotential height at different pressure levels. By default, reanalysis fields are applied as predictors, but the user can also upload their own predictors, including a wide range of compatible satellite-derived datasets. The package generates correlations of the variables selected with the predictand. The user also has the option to generate composites of the variables based on the predictand. Next, the user can extract predictors by drawing polygons over the regions that show strong correlations (composites). Then, the user can select some or all of the statistical prediction models provided. Provided models include Linear Regression models (GLM, SGLM), Tree-based models (bagging, random forest, boosting), Artificial Neural Network, and other non-linear models such as Generalized Additive Model (GAM) and Multivariate Adaptive Regression Splines (MARS). Finally, the user can download the analysis steps they used, such as the region they selected, the time period they specified, the predictand and predictors they chose and preprocessing options they used, and the model results in PDF or HTML format.
Key words: Semi-automated prediction, Shiny, R, GLM, ANN, RF, GAM, MARS

Proposed Session:
IN018: Enabling Scientific Analysis, Data Reuse, and Open Science through Free and Open Source Software

Proposed Section/Focus Group:
Earth and Space Science Informatics