0% found this document useful (0 votes)
9 views

CBS Databases&SQL MinorDegree EVEN SEM

Uploaded by

dhanabal.cse
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views

CBS Databases&SQL MinorDegree EVEN SEM

Uploaded by

dhanabal.cse
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

Hindusthan College of Engineering and Technology

An Autonomous Institution, Approved by AICTE, New Delhi, Affiliated to Anna University, Chennai
Accredited by NBA (AERO, AUTO, CIVIL, CSE, ECE, EEE, IT, MECH, MCTS)
Accredited by NAAC ‘A++’ Grade with CGPA of 3.69 out of 4 in Cycle 2
Valley Campus, Coimbatore – 641 032, Tamil Nadu, INDIA
Tel: +91 422 4242424 www.hicet.ac.in

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CONTENT BEYOND THE SYLLABUS

Name of the Faculty : Dr.S.DHANABAL


Subject code / Name : 21CS6601- DATABASES AND SQL
Year & Semester : II & IV
Academic Year : 2023-2024 & EVEN
Topic : EMERGING DATABASE TECHNOLOGIES

Course Outcomes:
CO1: Understand the functional components of DBMS and datamodels

CO2: Able to write SQL queries

CO3: Analyze a system and design ER diagram and Relational Schema

CO4: Analyze a system and design ER diagram and Relational Schema

CO5: Illustrate the concepts for transaction processing, concurrency control


and recovery procedures for RDBMS.

Mapping of Content beyond the Syllabus with the POs and PSOs:

PO PO PO PO PO PO PO PO PO PO1 PO1 PO1 PSO PSO


1 2 3 4 5 6 7 8 9 0 1 2 1 2

CO 2 2 2 3 3 2
3 2
1

CO 3 2 2 2 2 3 2
2
2

CO 2 2 2 2 2 3 2
2 2
3

CO 3 2 2 2 3 2
2
4
CO 3 2 2 2 1 3 2
1
5

3 High level 2 Moderate level 1 Low level


EMERGING DATABASE TECHNOLOGIES
1. MonetDB
MonetDB is an open-source column-oriented database management system developed at
the Centrum Wiskunde & Informatica (CWI) in the Netherlands. It was designed to
provide high performance on complex queries against large databases, such as combining
tables with hundreds of columns and millions of rows. MonetDB has been applied in high
performance applications for online analytical processing, data mining, geographic
information system (GIS), Resource Description Framework (RDF), scientific applications,
text retrieval and sequence alignment processing.
1.1Components of MonetDB:
A number of extensions exist for MonetDB that extend the functionality of the database
engine. Due to the three-layer architecture, top-level query interfaces can benefit from
optimizations done in the backend and kernel layers.
1. SQL: MonetDB/SQL is a top-level extension, which provides complete support for
transactions in compliance with the SQL:2003 standard.
2. GIS: MonetDB/GIS is an extension to MonetDB/SQL with support for the Simple Features
Access standard of Open Geospatial Consortium (OGC).
3. SciQL: SciQL an SQL-based query language for science applications with arrays as first
class citizens. SciQL allows MonetDB to effectively function as an array database. SciQL is
used in the European Union PlanetData and TELEIOS project, together with the Data Vault
technology, providing transparent access to large scientific data repositories. Data Vaults
map the data from the distributed repositories to SciQL arrays, allowing for improved
handling of spatio-temporal data in MonetDB. SciQL will be further extended for the Human
Brain Project.
4. Data Vaults: Data Vault is a database-attached external file repository for MonetDB,
similar to the SQL/MED standard. The Data Vault technology allows for transparent
integration with distributed/remote file repositories. It is designed for scientific data data
exploration and mining, specifically for remote sensing data.
5. R integration: MonetDB/R module allows for user defined functions (UDFs) written in R
to be executed in the SQL layer of the system. This is done using the native R support for
running embedded in another application, inside the RDBMS in this case. Previously the
MonetDB.R connector allowed the using MonetDB data. sources and process them in an R
session. The newer R integration feature of MonetDB does not require data to be transferred
between the RDBMS and the R session, reducing overhead and improving performance. The
feature is intended to give users access to functions of the R statistical software for in-line
analysis of data stored in the RDBMS. It complements the existing support for C UDFs and is
intended to be used for in-database processing. 6.Python integration: Similarly to the
embedded R UDFs in MonetDB, the database now has support for UDFs written in
Python/NumPy. The implementation uses Numpy arrays (themselves Python wrappers for C
arrays), as a result there is limited overhead - providing a functional Python integration with
speed matching native SQL functions. The Embedded Python functions also support mapped
operations, allowing user to execute Python functions in parallel within SQL queries.
7.MonetDBLite: Following the release of remote driver for R (MonetDB.R) and R UDFs in
MonetDB (MonetDB/R), the authors created an embedded version of MonetDB in R called
MonetDBLite. It is distributed as an R package, removing the need to manage a database
server, required for the previous R integrations. The DBMS runs within the R process itself,
eliminating socket communication and serialisation overhead - greatly improving efficiency.
The idea behind it is to deliver an SQLite-like package for R, with the performance of an in-
memory optimized columnar store.
1.2 SciDB
SciDB is a column-oriented database management system (DBMS) designed for
multidimensional data management and analytics common to scientific, geospatial, financial,
and industrial applications. It is developed by Paradigm4 and co-created by Turing Award
winner Michael Stonebraker.
1.2.1 Requirements and Features of SciDB
A summary of the requirements of SciDB are as follows:
1. A data model based on multidimensional arrays, not sets of tuples
2. A storage model based on versions and not update in place
3. Built-in support for provenance (lineage), workflows, and uncertainty
4. Scalability to 100s of petabytes and 1,000s of nodes with high degrees of tolerance to
failures
5. Support for "external" data objects so that data sets can be queried and manipulated
without ever having to be loaded into the database
6. Open source in order to foster a community of contributors and to ensure that data is
never locked up — a critical requirement for scientists.

The SciDB team identifies as key features of their eventual product its array-oriented data
model, its support for versions, provenance, and time table, its architecture to allow
massively parallel computations, scalable on commodity hardware, grids, and clouds, its first-
class support for userdefined functions (UDFs), and its native support for uncertainty. The
SciDB data model supports nested multi-dimensional arrays—often a natural representation
for spatially or temporally ordered data. Array cells can be tuples, or other arrays, and the
type system is extensible. Sparse array representation and operations are supported, with
user-definable handling of null or missing data. SciDB allows arrays to be ―chunked‖ (in
multiple dimensions) in storage, with chunks partitioned across a collection of nodes. Each
node has processing and storage capabilities. Overlaps‖ are definable so that certain
neighborhood operations are possible without communication among nodes. The underlying
architectural conception is of a shared-nothing cluster of tens to thousands of nodes on
commodity hardware, with a single runtime supervisor dispatching queries and coordinating
execution among the nodes‘ local executors and storage managers. An array query language
is defined, and refers to arrays as though they were not distributed. A query planner
optimizes queries for efficient data access and processing, with a query plan running on a
node‘s local executor/storage manager, and a runtime supervisor coordinating execution.
The Array Query Language (AQL) is a declarative SQL-like language with array extensions.
There are a number of array-specific operators, and linear algebraic and matrix operations
are provided. The language is extensible with Postgres-style user-defined functions, and
interfaces to other packages (Matlab, R, etc.) will be provided.

You might also like