5 Big Data Moves At Hadoop World4:00 PM EST Thu. Oct. 25, 2012
Organizations are quickly recognizing that the Apache Hadoop software programming framework can meet the growing need to manage the processing of massive application datasets of big data, and the explosive growth of the cloud creating large hosting centers is adding to that need.
At the O'Reilly Strata Conference and Hadoop World 2012 taking place this week in New York, vendors are rushing to announce their support for Hadoop.
Continue on and see five announcements furthering Hadoop adoption.
Microsoft jumped into the big data world, announcing a beta release of Hadoop for the Windows operating system.
The company unveiled Hadoop previews for Windows Azure, called Windows Azure HDInsight Service, and for Windows Server, named Microsoft HDInsight Server for Windows.
Microsoft said the previews will enable Apache Hadoop compatibility for the enterprise and simplify deployment of Hadoop-based programs.
These Hadoop data management capabilities on the Windows Server and Azure platforms will help customers use Microsoft applications such as Excel, PowerPivot for Excel and Power View to pull insights from the data.
Microsoft is also expanding its partnership with commercial Hadoop vendor Hortonworks to distribute an enterprise version of Hadoop.
VMware unveiled an update to Project Serengeti, its software for running Hadoop in virtualized environments.
Serengeti allows deployment of standard Apache Hadoop clusters on a virtual platform, with no need to buy additional hardware or software.
The company said the new version supports vSphere, VMware's cloud computing virtualization operating system. The project is endorsed by all major Hadoop distributions including Cloudera, Greenplum, Hortonworks and MapR.
The new version of Serengeti offers Dynamic Elastic Scaling, a community contributed user interface and the ability to configure infrastructure topologies.
Cloudera released a real-time query engine for Hadoop, called Impala. The engine will allow Cloudera's platform for managing massive data processing, Cloudera Enterprise, to handle real-time as well as batch operations on any type of data in one system.
Impala is an Apache-licensed engine for managing data stored in Hadoop Distributed File System (HDFS) and Hbase. Cloudera Enterprise RTQ (Real-time Query) offers management and support capabilities needed to run Cloudera Impala in production environments.
SAP released a software suite integrating Hadoop with its database and analytics products, creating a "Big Data bundle."
The bundle includes the SAP HANA in-memory database technology platform and combines the SAP Sybase IQ server, SAP Data Integrator software and SAP BusinessObjects business intelligence (BI) solutions.
These packages will be offered with SAP partners including HP, Cloudera, Hitachi Data Systems, Hortonworks and IBM.
SAP said customers can integrate Hadoop into their existing BI and data warehousing environments in multiple ways to fit their needs.
SGI, a high-performance computing company and Hadoop big data provider, and MarkLogic, an enterprise NoSQL database vendor, unveiled SGI DataRaptor appliance with MarkLogic database.
SGI DataRaptor is an integrated and optimized hardware and software solution aimed at helping businesses to generate reliable, scalable and secure big data results.
The SGI DataRaptor appliance integrates the MarkLogic NoSQL database and SGI systems to allow companies to capture and manage large amounts of data.
The SGI DataRaptor comes in two configurations for either maximum performance or maximum capacity needs. Servers come with more than 300 processing cores and over 2600 GB of memory and storage ranging from 300 to over 500 TB of data capacity. SGI DataRaptor is available for order now.