Now a Big Data Appliance from Oracle

The Hadoop-based appliance is claimed to require minimal configuration and setup

Oracle forged a partnership with pioneering Apache Hadoop interface/tools provider Cloudera to create a specific distribution for the new Big Data Appliance.

Even though Oracle launched its 12-petabyte data-storage Exadata server in 2008 and its cloud database/analytics Exalogic server to equally high visibility two years later, it still believes it needs a product that is actually named "big data" so there's no mistaking its purpose.

So on Jan. 10 it launched a new data center machine called the Oracle Big Data Appliance, which follows in the footsteps of many Oracle products: Engineered-together hardware and software, previously tested and configured, and stamped with the Oracle and partners "seal of approval" (though not literally). It's all but a turnkey workhorse to provide analyses of large batches of data.

Naturally, there must be specialized software to go with it. So, also on Jan. 10, Oracle announced that it has forged a partnership with pioneering Hadoop analytics software provider Cloudera to create a specific Apache Hadoop distribution and tools for the Big Data Appliance.

"A good number of our customers want to just 'load and go' when it comes to big-data batch (warehousing and analytics) use cases," Cetin Ozbutun, Oracle vice president of Data Warehousing Products, told eWEEK (which has a content partnership with IT Next). "There is minimal configuration and setup with this appliance, and we've found that customers can save weeks or even months on getting their deployments up and running."

The Big Data Appliance is designed to provision a highly available and scalable system for managing and analyzing massive amounts of data in Hadoop and using R (the open-source programming language) on raw data sources, Ozbutun said.

"It also simplifies and controls costs for the big data analytics process by re-integrating all hardware and software components into a single big data package that complements (existing) enterprise data warehouses," Ozbutun said.

Cloudera: A Hadoop Interface Pioneer
Cloudera's was the first commercial implementation of the open-source data analytics package that came out of Yahoo's R&D division in 2006. Cloudera is an equal-opportunity Hadoop developer; last August, the company signed up to provide Oracle competitor Dell with a distribution of its own.

Hadoop is complicated software to deploy and utilize, and it lacked a user-friendly front end until Cloudera and others came in to add their interface expertise. Using Cloudera, Oracle's Big Data Appliance gives users a single source to deploy, manage and scale this new Apache Hadoop-based stack.

On the hardware side, the Big Data Appliance comes in a full-rack configuration of 18 Oracle Sun servers with a total of 864GB main memory; 216 CPU cores; 648TB of raw disk storage; 40G bps InfiniBand connectivity between nodes and other Oracle engineered systems; and 10G bps Ethernet data center connectivity.

The new engineered-together system scales by connecting multiple racks together via an InfiniBand network--like Exadata and Exalogic--which enables it to acquire, organize and analyze extreme-size data volumes consisting of machine- and/or human-created data streams, Ozbutun said.

Included in the Big Data Appliance rack is Cloudera Manager and the Oracle NoSQL Database, which can scale horizontally to hundreds of nodes with high availability. The appliance can run both Oracle NoSQL Database Community and Enterprise Editions, Ozbutun said.

The appliance, which was announced at Oracle OpenWorld last September and costs $450,000, includes Oracle's Linux operating system to run the 18 Oracle Sun servers. Oracle's Big Data Connectors to other data warehouse cost $2,000 per processor license.

Jordan Jumpman Pro AJ12.5


Add new comment