www-announce mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sally Khudairi ...@apache.org>
Subject [Announce] The Apache Software Foundation Announces Apache Sqoop as a Top-Level Project
Date Mon, 02 Apr 2012 13:02:16 GMT
[this announcement is also available online at http://s.apache.org/mU]

Open Source big data tool used for efficient bulk transfer between Apache Hadoop and structured

Forest Hill, MD --The Apache Software Foundation (ASF), the all-volunteer developers, stewards,
and incubators of nearly 150 Open Source projects and initiatives, today announced that Apache
Sqoop has graduated from the Apache Incubator to become a Top-Level Project (TLP), signifying
that the Project’s community and products have been well-governed under the ASF's meritocratic
process and principles.

Designed to efficiently transfer bulk data between Apache Hadoop and structured datastores
such as relational databases, Apache Sqoop allows the import of data from external datastores
and enterprise data warehouses into Hadoop Distributed File System or related systems like
Apache Hive and HBase.

"The Sqoop Project has demonstrated its maturity by graduating from the Apache Incubator,"
explained Arvind Prabhakar, Vice President of Apache Sqoop. "With jobs transferring data on
the order of billions of rows, Sqoop is proving its value as a critical component of production

Building on the Hadoop infrastructure, Sqoop parallelizes data transfer for fast performance
and best utilization of system and network resources. In addition, Sqoop allows fast copying
of data from external systems to Hadoop to make data analysis more efficient and mitigates
the risk of excessive load to external systems. 

"Connectivity to other databases and warehouses is a critical component for the evolution
of Hadoop as an enterprise solution, and that's where Sqoop plays a very important role" said
Deepak Reddy, Hadoop Manager at Coupons.com. "We use Sqoop extensively to store and exchange
data between Hadoop and other warehouses like Netezza. The power of Sqoop also comes in the
ability to write free-form queries against structured databases and pull that data into Hadoop."

"Sqoop has been an integral part of our production data pipeline" said Bohan Chen, Director
of the Hadoop Development and Operations team at Apollo Group. "It provides a reliable and
scalable way to import data from relational databases and export the aggregation results to
relational databases."

Since entering the Apache Incubator in June 2011, Sqoop was quickly embraced as an ideal SQL-to-Hadoop
data transfer solution. The Project provides connectors for popular systems such as MySQL,
PostgreSQL, Oracle, SQL Server and DB2, and also allows for the development of drop-in connectors
that provide high speed connectivity with specialized systems like enterprise data warehouses.

Craig Ling, Director of Business Systems at Tsavo Media, said "We adopted the use of Sqoop
to transfer data into and out of Hadoop with our other systems over a year ago. It is straight
forward and easy to use, which has opened the door to allow team members to start consuming
data autonomously, maximizing the analytical value of our data repositories."

Availability and Oversight
Apache Sqoop software is released under the Apache License v2.0, and is overseen by a self-selected
team of active contributors to the project. A Project Management Committee (PMC) guides the
Project's day-to-day operations, including community development and product releases. Apache
Sqoop source code, documentation, mailing lists, and related resources are available at http://sqoop.apache.org/.

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees nearly one hundred fifty leading
Open Source projects, including Apache HTTP Server — the world's most popular Web server
software. Through the ASF's meritocratic process known as "The Apache Way," more than 350
individual Members and 3,000 Committers successfully collaborate to develop freely available
enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions
are distributed under the Apache License; and the community actively participates in ASF mailing
lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings,
and expo. The ASF is a US 501(3)(c) not-for-profit charity, funded by individual donations
and corporate sponsors including AMD, Basis Technology, Cloudera, Facebook, Google, IBM, HP,
Hortonworks, Matt Mullenweg, Microsoft, PSW Group, SpringSource/VMware, and Yahoo!. For more
information, visit

"Apache", "Apache Sqoop", and "ApacheCon" are trademarks of The Apache Software Foundation.
All other brands and trademarks are the property of their respective owners.

#  #  #
= = = = = 
Boston +1 617 921 8656
New York +1 917 725 2133
London +44 (0) 20 3239 9686
skype sallykhudairi

View raw message