Synopsis "Time Series Geospatial Big Data Analysis Using Array Database"

This document provides a summary of a synopsis for a project on analyzing time series geospatial big data using an array database. It includes an abstract describing the storage and processing of earth observation data as multidimensional arrays in an array database. A literature review covers several papers on related topics. The document identifies the problem of managing large volumes of earth observation data and proposes a methodology using the open source Rasdaman array database system. Tools to be used include Rasdaman and OSGeoLive. A schedule outlines work to be done over 4 months, including literature review, module implementation, paper submission, and final submission.

Uploaded by

saurabh

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views

Synopsis "Time Series Geospatial Big Data Analysis Using Array Database"

Uploaded by

saurabh

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 5

SYNOPSIS

“TIME SERIES GEOSPATIAL BIG DATA ANALYSIS USING

ARRAY DATABASE”

BACHELOR OF ENGINEERING
IN
COMPUTER SCIENCE AND ENGINEERING

SUBMITTED BY
Jayati Gandhi

UNDER THE GUIDANCE

OF
Prof. Sathish Kumar Penchala
Asst. Professor in CSE dept.

COMPUTER SCIENCE ENGINEERING DEPARTMENT

G.H.RAISONI COLLEGE OF ENGINERING
NAGPUR
2019-20
Abstract:-
Over the past few years, Earth Observation (EO) has been continuously generating much
spatiotemporal data that serves for societies in resource surveillance, environment
protection, and disaster prediction. The proliferation of EO data poses great challenges in
current approaches for data management and processing. Nowadays, the Array Database
technologies show great promise in managing and processing EO Big Data. This paper
suggests storing and processing EO data as multidimensional arrays based on state-of-
the-art array database technologies. A multidimensional spatiotemporal array model is
proposed for EO data with specific strategies for mapping spatial coordinates to
dimensional coordinates in the model transformation. It allows consistent query
semantics in databases and improves the in-database computing by adopting unified array
models in databases for EO data. Our approach is implemented as an extension to
Rasdaman, Open source array Database Management System. It provides flexible, fast,
scalable geo services for multi-dimensional spatio-temporal sensor, image, simulation, and
statistics data of unlimited volume. At final step, there will be Rasdaman UI through which
the stored multidimensional data can be retrieved, viewed and can apply queries to view the
desired results.

Literature survey:-

1). Yangming JIANG and Siwen BI, (2008) “Dynamic Object-Oriented Model and its
Applications for Digital Earth, Digital Earth Summit on Geoinformatics”, Nov, 12-14,
2008, Germany.

This paper has nominated a dynamic object oriented model which is deployed to trigger
changes in digital earth. A dynamic object oriented model which is regarded
spatiotemporal class as a base class of four classes – ZeroTObject (ZTO),
OneTObject(OTO), TwoTObject (TTO), ThreeTObject (TTHTO) where as ZTO is a
temporal node, OTO is a temporal arc, TTO is a temporal polygon, and THTO is a
temporal cube. This model is deployed to trigger changes in digital earth.

2) “An Approach for Assessing Array DBMSs for Geospatial Raster Data” by Jane
Kovanen, Ville Makinen, and Tapani SarjakoskRO, GEO Processing 2018: The Tenth
International Conference on Advanced Geographic Information Systems, Applications,
and Service

In this paper, an approach that can be used to assess the capabilities of Array Database
Management Systems (DBMSs) regarding the management and processing of raster data.
The paper presents a framework that can be used to compare the functionalities of Array
DBMSs and benchmark them. The main feature of the framework is assessing
functionality using both targeted test cases and benchmarking. This assessment is
followed by leveraging the gained experiences to assess non functionality using
characteristics from existing quality models. The framework can be extended by further
DBMSs, benchmarks and additional hardware resources. The assessment was first
implemented for the community editions of SciDB and Rasdaman. The study presents
some key initial observations regarding the particular Array DBMSs.

3) “Geo-Spatial Big Data Analysis: An Overview” by C.Kamali and Gethsiyal Augusta

Advance increasing interest in large-scale, higher solution, real-time geographic

information system (GIS) applications and spatial big data processing, traditional GIS are
not efficient enough to handle due to limited computational capabilities. Geospatial
analytics in big data needed new approaches that are flexible, non-parametric and should
be able for dynamic modeling with non-linear processes. Compared to general big data,
the special thing of geographical big data is Spatiotemporal Association Analysis (SAA)
for scrutinizing the geographical big data. This analysis wraps of some vital elements of
geometrical relations, statistical correlations, and semantics relations for effective
decisive and predictive measurements based solutions. Therefore in this paper the main
aim is to study and review the Spatiotemporal Association Analysis (SAA) in three
aspects such as measurement (observation) adjustment of geometrical quantities, human
spatial behavior analysis with trajectories, data assimilation of physical models and
various observations.

4) “Evaluating the Open Source Data Containers for Handling Big Geospatial Raster
Data” by Fei Hu and Mengchao Xu.

This paper provides a comprehensive evaluation of six popular data containers (i.e.,
Rasdaman, SciDB, Spark, Climate Spark, Hive, and MongoDB) for handling multi-
dimensional, array-based geospatial raster datasets. Their architectures, technologies,
capabilities, and performance are compared and evaluated from two perspectives: (a)
system design and architecture (distributed architecture, logical data model, physical data
model, and data operations); and (b) practical use experience and performance (data
preprocessing, data uploading, query speed, and resource consumption). Four major
conclusions are offered: (1) no data containers, except Climate Spark, have good support
for the HDF data format used in this paper, requiring time- and resource-consuming data
preprocessing to load data; (2) SciDB, Rasdaman, and MongoDB handle small/mediate
volumes of data query well, whereas Spark and Climate Spark can handle large volumes
of data with stable resource consumption; (3) SciDB and Rasdaman provide mature
array-based data operation and analytical functions, while the others lack these functions
for users; and (4) SciDB, Spark, and Hive have better support of user defined functions
(UDFs) to extend the system capability.
5) The Australian Geosciences Data Cube — Foundations and lessons learned by Adam
Lewis and Simon Oliver.

The Australian Geoscience Data Cube (AGDC) aims to realize the full potential of Earth
observation data holdings by addressing the Big Data challenges of volume, velocity, and
variety that otherwise limit the usefulness of Earth observation data. There have been
several iterations and AGDC version 2 is a major advance on previous work. The
foundations and core components of the AGDC are: (1) data preparation, including
geometric and radiometric corrections to Earth observation data to produce standardized
surface reflectance measurements that support time-series analysis, and
collection management systems which track the provenance of each Data Cube product
and formalize re-processing decisions; (2) the software environment used to manage and
interact with the data; and (3) the supporting high performance computing environment
provided by the Australian National Computational Infrastructure (NCI).

A growing number of examples demonstrate that our data cube approach allows analysts
to extract rich new information from Earth observation time series, including through
new methods that draw on the full spatial and temporal coverage of the Earth observation
archives. To enable easy-uptake of the AGDC, and to facilitate future cooperative
development, our code is developed under an open-source, Apache License, Version 2.0.
This open-source approach is enabling other organizations, including the Committee on
Earth Observing Satellites (CEOS), to explore the use of similar data cubes in developing
countries.

Problem identification:-
Traditional storage for EO data uses various kinds of ﬁles, such as Network Common
DataForm (NetCDF) for atmospheric and hydrological sciences, GeoTIFF, and
Hierarchical Data Format (HDF) for remote sensing images. These specially-designed
data formats work quite well when the amount of data is not very large. However, issues
start to arise when data volumes increases gradually. The most obvious problem is that it
is not easy to retrieve and query the information needed. To solve this problem, an array
database is designed and implemented as a common database service offering ﬂexible
and scalable storage and retrieval of large volumes of multidimensional array data, such
as sensor, image, simulation or statistics data. It has attracted extensive attention from
academic and industry data scientists

Methodology:-

The main aim of this project is to identify the water body area change during last 10
years. Using open source tool like Rasdaman configure a platform for geospatial data
management and analysis. After configuration download the time series satellite data of
water detection and ingest into database and execute the queries Rasql in database.
Prepare the proper Meta data for the images and store it in the database. Images are taken
from the LANDSAT 8 which is the American Earth Observation Satellite and it has 8
bands. Each band has different applications like coastal and aerosol studies, peak
vegetation detection of cloud contamination and water detection etc. The main purpose of
the bands is to monitor the earth and keep the track of changes on the planet’s surface.
After this implement different algorithms for extracting information from the time series
data. After implementation of algorithms develop a web application to query and
visualize the results.
Tools/Software Used:-

1. RASDAMAN AND OSGEOLIVE

Proposed schedule of work:-

Month Proposed work

July Literature survey
August Implementation of Modules
September Paper submission
October Final Submission and result

References:-

1). Yangming JIANG and Siwen BI, (2008) “Dynamic Object-Oriented Model and its
Applications for Digital Earth, Digital Earth Summit on Geoinformatics”, Nov, 12-14,
2008, Germany.

2) “An Approach for Assessing Array DBMSs for Geospatial Raster Data” by Janne
Kovanen, Ville Makinen, and Tapani SarjakoskRO, GEO Processing 2018: The Tenth
International Conference on Advanced Geographic Information Systems, Applications,
and Service

3). “Geo-Spatial Big Data Analysis: An Overview” by C. Kamali and Gethsiyal Augasta.

4). “Evaluating the Open Source Data Containers for Handling Big Geospatial Raster
Data” by Fei Hu and Mengchao Xu.

5). The Australian Geoscience Data Cube — Foundations and lessons learned by Adam
lewis and Simon Oliver

Signature of student Signature of Guide

Recent Advances in Geographic Information System for Earth Sciences
No ratings yet
Recent Advances in Geographic Information System for Earth Sciences
4 pages
Final - IJCSAPaper 21 07 2022 Updated
No ratings yet
Final - IJCSAPaper 21 07 2022 Updated
9 pages
Article Analysis 1
No ratings yet
Article Analysis 1
7 pages
Machine Learning Algorithms For GeoSpatial Data - Applications and Software Tools
No ratings yet
Machine Learning Algorithms For GeoSpatial Data - Applications and Software Tools
9 pages
Remote Sensing
No ratings yet
Remote Sensing
110 pages
Banda2014 PDF
No ratings yet
Banda2014 PDF
8 pages
A_COMPARATIVE_ANALYSIS_OF_CONV (Tripathi al., 2018)
No ratings yet
A_COMPARATIVE_ANALYSIS_OF_CONV (Tripathi al., 2018)
7 pages
Remote Sensing: An Overview of Platforms For Big Earth Observation Data Management and Analysis
No ratings yet
Remote Sensing: An Overview of Platforms For Big Earth Observation Data Management and Analysis
25 pages
Managing Hydrographic Data by Using Gis Technology in Malaysia
No ratings yet
Managing Hydrographic Data by Using Gis Technology in Malaysia
9 pages
Structure For Temporal Granularity Spatial Resolution and Scalability
No ratings yet
Structure For Temporal Granularity Spatial Resolution and Scalability
11 pages
TMP - 11927-Information Retrieval From Big Data For Sensor Data Collection-520372139741037689682152
No ratings yet
TMP - 11927-Information Retrieval From Big Data For Sensor Data Collection-520372139741037689682152
3 pages
Remotesensing 13 02428
No ratings yet
Remotesensing 13 02428
20 pages
Big Data Analytics Litrature Review
No ratings yet
Big Data Analytics Litrature Review
7 pages
Ophidia: Toward Big Data Analytics For Escience
No ratings yet
Ophidia: Toward Big Data Analytics For Escience
10 pages
A First Assessment of The P-Sbas Dinsar Algorithm Performances Within A Cloud Computing Environment
No ratings yet
A First Assessment of The P-Sbas Dinsar Algorithm Performances Within A Cloud Computing Environment
12 pages
Big Data Storage Techniques For Spatial Databases: Implications of Big Data Architecture On Spatial Query Processing
No ratings yet
Big Data Storage Techniques For Spatial Databases: Implications of Big Data Architecture On Spatial Query Processing
27 pages
Solutions For The Data Level's Representation in A Decision Support System in Wind Power Plants
No ratings yet
Solutions For The Data Level's Representation in A Decision Support System in Wind Power Plants
6 pages
Developing and Chaining Web Processing Services For Hydrological Models
No ratings yet
Developing and Chaining Web Processing Services For Hydrological Models
7 pages
Parallel Processing
No ratings yet
Parallel Processing
5 pages
Wong
No ratings yet
Wong
8 pages
JSTARS3021052
No ratings yet
JSTARS3021052
26 pages
A Conceptual Data Model For Trajectory Data Mining
No ratings yet
A Conceptual Data Model For Trajectory Data Mining
15 pages
Applications of Google Earth in Remote Sensing
No ratings yet
Applications of Google Earth in Remote Sensing
17 pages
Does NoSQL Have A Place in GIS?
No ratings yet
Does NoSQL Have A Place in GIS?
28 pages
Google Earth Engine Planetary-Scale Geospatial Ana
No ratings yet
Google Earth Engine Planetary-Scale Geospatial Ana
10 pages
The Earth System Grid
No ratings yet
The Earth System Grid
18 pages
Arma 11 548
No ratings yet
Arma 11 548
10 pages
Proposal
No ratings yet
Proposal
13 pages
DataIntensive Computer
No ratings yet
DataIntensive Computer
10 pages
A Smart Web-Based Geospatial Data Discovery
No ratings yet
A Smart Web-Based Geospatial Data Discovery
14 pages
Text[5]
No ratings yet
Text[5]
8 pages
1 s2.0 S0167739X16301856 Main
No ratings yet
1 s2.0 S0167739X16301856 Main
11 pages
Teaching Geoinformatics: A Geoscience Perspective
No ratings yet
Teaching Geoinformatics: A Geoscience Perspective
31 pages
Richards 2005
No ratings yet
Richards 2005
11 pages
Remote Sensing of Environment
No ratings yet
Remote Sensing of Environment
10 pages
Computers & Geosciences: Peisheng Zhao, Theodor Foerster, Peng Yue
No ratings yet
Computers & Geosciences: Peisheng Zhao, Theodor Foerster, Peng Yue
10 pages
A Spatio-Temporal Model For Massive Analysis of Shapefiles
No ratings yet
A Spatio-Temporal Model For Massive Analysis of Shapefiles
9 pages
Groundwater Modeling With Machine Learning Techniques: Ljubljana Polje Aquifer
No ratings yet
Groundwater Modeling With Machine Learning Techniques: Ljubljana Polje Aquifer
8 pages
A Scalable Geospatial Web Service For Near Real-Time, High-Resolution Land Cover Mapping
No ratings yet
A Scalable Geospatial Web Service For Near Real-Time, High-Resolution Land Cover Mapping
10 pages
Scalable Machine-Learning Algorithms For Big Data Analytics: A Comprehensive Review
No ratings yet
Scalable Machine-Learning Algorithms For Big Data Analytics: A Comprehensive Review
21 pages
DOLAP 2011-Analytics Over Large Scale MD Data
No ratings yet
DOLAP 2011-Analytics Over Large Scale MD Data
3 pages
Azimute Finder
No ratings yet
Azimute Finder
12 pages
I Nternational Journal of Computational Engineering Research (Ijceronline - Com) Vol. 2 Issue. 7
No ratings yet
I Nternational Journal of Computational Engineering Research (Ijceronline - Com) Vol. 2 Issue. 7
4 pages
10.1007@978-3-030-11881-510_paper_8
No ratings yet
10.1007@978-3-030-11881-510_paper_8
13 pages
Quantitative Comparison of Linear and Non-Linear Dimensionality Reduction Techniques For Solar Image Archives
No ratings yet
Quantitative Comparison of Linear and Non-Linear Dimensionality Reduction Techniques For Solar Image Archives
6 pages
aies-AIES-D-21-0002.1
No ratings yet
aies-AIES-D-21-0002.1
11 pages
Big Data Analytics in Forecasting Lakes Levels
No ratings yet
Big Data Analytics in Forecasting Lakes Levels
4 pages
Mri2003 Projdesc
No ratings yet
Mri2003 Projdesc
19 pages
1 s2.0 S0169023X22001148 Main
No ratings yet
1 s2.0 S0169023X22001148 Main
22 pages
Remotesensing 14 01718
No ratings yet
Remotesensing 14 01718
26 pages
Prediction Article - Scientific
No ratings yet
Prediction Article - Scientific
12 pages
An Overview of Platforms For Big Earth Observation Data Management and Analysis
No ratings yet
An Overview of Platforms For Big Earth Observation Data Management and Analysis
26 pages
Emerging Trends in Geospatial Artificial Intelligence (GeoAI) - Potential Applications For Environmental Epidemiology
100% (1)
Emerging Trends in Geospatial Artificial Intelligence (GeoAI) - Potential Applications For Environmental Epidemiology
6 pages
R en La Modelacion Hidrologica
No ratings yet
R en La Modelacion Hidrologica
5 pages
Big Data Approach To Analytical Chemistry: February 2014
No ratings yet
Big Data Approach To Analytical Chemistry: February 2014
7 pages
bams-bams-d-15-00324.1
No ratings yet
bams-bams-d-15-00324.1
14 pages
Eologic ATA Odels and Ordilleran Eology
No ratings yet
Eologic ATA Odels and Ordilleran Eology
6 pages
Geospatial Data Science: Combining Geography with Data Science
From Everand
Geospatial Data Science: Combining Geography with Data Science
Dr Aran Castro A J
No ratings yet
Exploring ArcMap 10.5
From Everand
Exploring ArcMap 10.5
Prof. Sham Tickoo
No ratings yet
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
From Everand
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
Byron Ellis
No ratings yet
Geog 104 Course Outline 2020
No ratings yet
Geog 104 Course Outline 2020
5 pages
Goodchild and Glennon - 2010 - Crowdsourcing Geographic Information For Disaster
No ratings yet
Goodchild and Glennon - 2010 - Crowdsourcing Geographic Information For Disaster
12 pages
Spatial Data Infrastructure (SDI) in China Some Potentials and Shortcomings PDF
No ratings yet
Spatial Data Infrastructure (SDI) in China Some Potentials and Shortcomings PDF
36 pages
Geo Analytical - Question Answering
No ratings yet
Geo Analytical - Question Answering
15 pages
Digital Earth
No ratings yet
Digital Earth
22 pages
Central Place Indexing
No ratings yet
Central Place Indexing
35 pages
Download full Manual of Digital Earth Huadong Guo ebook all chapters
100% (2)
Download full Manual of Digital Earth Huadong Guo ebook all chapters
43 pages