0% found this document useful (0 votes)
29 views7 pages

Apache Drill: SQL For Nosql

Apache Drill is a SQL query engine that allows querying of non-relational data sources like Hive, MongoDB, and HBase using SQL. It is based on Google's Dremel and has ODBC/JDBC drivers, allowing other tools to connect to it like a relational database. Drill allows fast SQL analysis of disparate data without transforming or loading it first by internally representing data as JSON without a fixed schema.

Uploaded by

Bora Yüret
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
29 views7 pages

Apache Drill: SQL For Nosql

Apache Drill is a SQL query engine that allows querying of non-relational data sources like Hive, MongoDB, and HBase using SQL. It is based on Google's Dremel and has ODBC/JDBC drivers, allowing other tools to connect to it like a relational database. Drill allows fast SQL analysis of disparate data without transforming or loading it first by internally representing data as JSON without a fixed schema.

Uploaded by

Bora Yüret
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

APACHE DRILL

SQL for noSQL


What is Apache Drill?

■ A SQL query engine for a variety of non-relational databases and data files
– Hive, MongoDB, HBase
– Even flat JSON or Parquet files on HDFS, S3, Azure, Google cloud, local
file system
■ Based on Google’s Dremel
It’s real SQL

■ Not SQL-Like
■ And it has a ODBC / JDBC driver so other tools can connect to it just like any
relational database
It’s fast and pretty easy to set up.

■ But remember, these are still non-relational databases under the hood!
■ Allows SQL analysis of disparate data source without having to transform and
load it first
– Internally data is represented as JSON and so has no fixed schema
You can even do joins across
different database technologies
■ Or with flat JSON files that are just sitting around
Think of it as SQL for your entire
ecosystem
Let’s drill

■ We’ll import data into Hive and MongoDB


■ Set up Drill on top of both
■ And do some queries!

You might also like