On Sun, Oct 18, 2009 at 10:31 PM, Sanjay Sharma wrote:
I do not work for Vertica, but they were my neighbors in Hadoop World NYC
where they did talk about a Hadoop MapReduce connector.http://www.vertica.com/company/news/vertica-analytic-database-broadens-reach-with-flexstore-
"With version 3.5, Vertica also introduces native support for MapReduce via
connectivity to the standard Hadoop framework. The Vertica 3.5 interface to
Hadoop gives MapReduce developers the ability to perform highly scalable
in-database analytics by making it easy to store and retrieve data from the
equally scalable Vertica Analytic Database. This unique combination of MPP
analytic database and MPP compute framework gives enterprises the
flexibility to process large sets of structured and unstructured data and
make them available to business users at Web speeds."
BTW, GreenPlum, AsterData and Vertica all claim some kind of interfaces
with Hadoop now.
Also, IMHO, their focus is on certainly working ALONG with Hadoop
MapReduce. The symbiosis can be seen as Hadoop MapReduce doing the
unstructured data handling part with the structured part being handled by
these Column based databases over a rich SQL interface.
It would be interesting to know HBase, Hive/PIG figure in their plans
From: Arijit Mukherjee
Sent: Monday, October 19, 2009 10:18 AM
Subject: Re: Hive vs. Vertica
I've been in touch with Vertica since the past year. The main concept
behind Vertica is column-orientation, which in turn allows high degree
of compression and faster query processing (mainly retrieving large
data sets) as it selects only the column required instead of fetching
the entire row and applying project on it. It's Mike Stonebraker's
brainchild and builds on the original C-Store.
Performance is very impressive on not-so-high-end hardware - loading
is very fast, as are the queries. But, as of now, it does not support
map-reduce. This is supported in Greenplum (another DW datastore) -
but my experience with Greenplum was not so good - performance-wise -
and it needs quite high-end machines.
2009/10/17 Edward Capriolo <email@example.com>:
On Sat, Oct 17, 2009 at 1:11 AM, Ryan LeCompte wrote:
I am trying to understand Vertica and how it applies to the Hadoop
this basically a way to store large amounts of data and run SQL-like
on it that also result in map/red uce jobs Hadoop/Hive? Or am I trying
compare apples and oranges? If not, are Vertica queries faster in
"And when the night is cloudy,
There is still a light that shines on me,
Shine on until tomorrow, let it be."
Follow us on Twitter- https://twitter.com/impetuscalling.
*Impetus Celebrates Green Diwali.
NOTE: This message may contain information that is confidential,
proprietary, privileged or otherwise protected by law. The message is
intended solely for the named addressee. If received in error, please
destroy and notify the sender. Any use of this email is prohibited when
received in error. Impetus does not represent, warrant and/or guarantee,
that the integrity of this communication has been maintained nor that the
communication is free of errors, virus, interception or interference.