MapR redefines SQL-on-Hadoop with Apache Drill

September 17, 2014 technuter 2 min read

New Delhi, India, September 17, 2014:Â MapR Technologies, provider of the top-ranked distribution for Apache Hadoop, today announced the addition of Apache Drill 0.5 to the MapR Distribution including Hadoop. Bringing next-generation ANSI SQL to Hadoop, Apache Drill provides instant, self-service data exploration across multiple data sources including modern applications.

â€œOrganizations want to provide access to data stored in Hadoop and NoSQL databases to a broader set of users with existing SQL analysis skills,â€ said Matt Aslett, research director, data platforms and analytics, 451 Research. “Apache Drill’s ability to provide access to data in Hadoop without the need for centralized schemas and also to NoSQL datasets withÂ complex data structures including nested and repeated fields differentiates it fromÂ traditional approaches to SQL-on-Hadoop.”Â

Apache Drill provides the flexibility to immediately query complex data in native formats, such as schemaless data, nested data, and data with rapidly-evolving schemas, with minimal IT involvement. Because SQL queries can run directly on various file formats, live data can be explored as it is coming in, versus spending weeks preparing and managing schemas and setting up ETL tasks. Additionally, Apache Drill supports ANSI SQL so users can easily leverage their SQL skills and existing investments in business intelligence (BI) tools.

â€œThe vision and innovation that the Apache Drill community has brought to the marketplace heralds a new era of data exploration,â€ said John Schroeder, CEO and cofounder of MapR Technologies.Â â€œThe agility to directly query self-describing data and the flexibility to process complex data types push the envelope in big data analysis and insight. We are extremely excited by the potential of Drill to transform data-driven companies.â€Â

Organizations that use Apache Drill benefit from:

High-performance analysis of data in its native format includingself-describing datasuch as Parquet, JSON files and HBase tables
Direct querying of data in HBase tables without defining and maintaining a parallel/overlay schema in the Hive metastore
Intuitive SQL extensions to query and work with semi-structured/nested data, such as data from NoSQL stores like MongoDBand online REST APIs
Queries that simultaneously combine different Hadoop data sources such as files, HBase tables, and Hive tablesÂ

Developers and analysts can leverage existing SQL skillsets and BI tools to:

Minimize switching costs and the learning curve for users viathe familiar ANSI SQL syntax
Continue using familiar BI/analytics tools such as Excel, Tableau and a host of others using standard ODBC/JDBC drivers
Enable ad-hoc/low-latency queries on existing Hive tables. Reuse Hivemetadata, hundreds of file formats and user defined functions (UDFs) out of the boxÂ

Availability

Apache Drill 0.5 with the MapR Distribution including Hadoop is currently available.

MapR redefines SQL-on-Hadoop with Apache Drill

Organizations that use Apache Drill benefit from:

Developers and analysts can leverage existing SQL skillsets and BI tools to:

Availability

Â© Technuter.com News Service

Leave a Reply Cancel reply

Organizations that use Apache Drill benefit from:

Developers and analysts can leverage existing SQL skillsets and BI tools to:

Availability

Â© Technuter.com News Service

You May Also Like

Leave a Reply Cancel reply