Vertica
Vertica is an analytic database management software company.[1][2] Vertica was founded in 2005 by the database researcher Michael Stonebraker with Andrew Palmer as the founding CEO. Ralph Breslauer and Christopher P. Lynch served as CEOs later on.
Industry | Database management & Data warehousing |
---|---|
Founded | 2005 |
Founder | Andrew Palmer and Michael Stonebraker |
Headquarters | Cambridge, MA, United States |
Key people | Mark Barrenechea (CEO and CTO) |
Products | Vertica Analytics Database, Vertica SQL on Data Lake |
Parent | OpenText |
Website | www |
Lynch joined as Chairman and CEO in 2010 and was responsible for Vertica's acquisition by Hewlett Packard in March 2011.[3][4] The acquisition expanded the HP Software portfolio for enterprise companies and the public sector group.[5] As part of the merger of Micro Focus and the Software division of Hewlett Packard Enterprise, Vertica joined Micro Focus in September 2017.[6] As part of OpenText acquisition of Micro Focus, Vertica joined OpenText in January 2023.
Products
The column-oriented Vertica Analytics Database was designed to manage large, fast-growing volumes of data and with fast query performance for data warehouses and other query-intensive applications. The product claims to greatly improve query performance over traditional relational database systems, and to provide high availability and exabyte scalability on commodity enterprise servers. Vertica runs on multiple cloud computing systems as well as on Hadoop nodes. Vertica's Eon Mode separates compute from storage, using S3 object storage and dynamic allocation of compute notes.[7]
Vertica's design features include:
- Column-oriented storage organization, which increases performance of sequential record access at the expense of common transactional operations such as single record retrieval, updates, and deletes.[8]
- Massively parallel processing (MPP) architecture to distribute queries on independent nodes and scale performance linearly.
- Standard SQL interface with many analytics capabilities built-in, such as time series gap filling/interpolation, event-based windowing and sessionization, pattern matching, event series joins, statistical computation (e.g., regression analysis), and geospatial analysis.
- In-database machine learning including categorization, fitting and prediction without down-sampling and data movement. Vertica offers a variety of in-database algorithms, including linear regression, logistic regression, k-means clustering, Naive Bayes classification, random forest decision trees, XGBoost, and support vector machine regression and classification. It also allows deployment of ML models to multiple clusters.
- High compression, possible because columns of homogeneous datatype are stored together and because updates to the main store are batched.[9]
- Automated workload management, data replication, server recovery, query optimization, and storage optimization.
- Native integration with open source big data technologies like Apache Kafka and Apache Spark.
- Support for standard programming interfaces, including ODBC, JDBC, ADO.NET, and OLEDB.
- High-performance and parallel data transfer to statistical tools and built-in machine learning algorithms.[10][11]
Vertica's specialized approach aims to significantly increase query performance in data warehouses, while reducing hardware costs.[12]
Since 2011, Vertica has offered a limited-capacity community edition for free.[13]
In July, 2021, Vertica announced an SaaS offering, Vertica Accelerator, running on Amazon AWS.[14]
Optimizations
Vertica originated as the C-Store column-oriented database, an open source research project at MIT and other universities, published in 2005.[15][16]
Vertica runs on clusters of commodity servers or on commercial clouds. It integrates with Hadoop,[17] using HDFS.
In 2018, Vertica introduced Vertica in Eon Mode, a separation of compute and storage architecture. The Eon architecture allows for elastic increase and decrease in compute capability as needed for workload elasticity. It also allows instantiation of multiple isolated sub-clusters dedicated to different workloads while maintaining a single shared data repository. It operates on shared object storage in the cloud, and also runs on object storage compatible hardware on-premises for private cloud implementations.
Version 10.1.1 of Vertica introduced Docker and Kubernetes support.[18]
Many BI, data visualization, and ETL tools work with Vertica Analytics Platform. Vertica supports Kafka for streaming data ingestion.
In 2021, Vertica released a connector for Spark.[19]
Vertica also integrates with Grafana, Helm, Go, and Distributed R.[20]
Company events
In January 2008, Sybase filed a patent-infringement lawsuit against Vertica.[21] In January 2010, Vertica prevailed in a preliminary hearing,[22] and in June, 2010, Sybase and Vertica resolved the suit, with the court dismissing all infringement claims.[23]
Since 2013, Vertica has held an annual user conference, now called Vertica Unify.
References
- Network World staff: "New database company raises funds, nabs ex-Oracle bigwigs", LinuxWorld, February 14, 2007
- Brodkin, J: "10 enterprise software companies to watch", Archived 2007-05-18 at the Wayback Machine Network World, April 11, 2007
- HP News Release: "HP to Acquire Vertica: Customers Can Analyze Massive Amounts of Big Data at Speed and Scale" Feb. 2011
- HP News Release: "HP Completes Acquisition of Vertica Systems, Inc." March 22, 2011.
- ComputerWorld.com: "Update: HP to buy Vertica for analytics." Kanaracus. Feb. 2011.
- SiliconAngle: "Vertica survives software industry turmoil to emerge as key cloud and big data player" Albertson.
- Press Release: "Micro Focus Announces Vertica in Eon Mode for Pure Storage" Sept 17, 2019
- Monash, C: "Are row-oriented RDBMS obsolete?" DBMS2, January 22, 2007
- Monash, C: "Mike Stonebraker on database compression – comments",DBMS2, March 24, 2007
- Gagliordi, Natalie. "HP adds scale to open-source R in latest big data platform". ZDNet. Retrieved 17 February 2015.
- Prasad, Shreya; Fard, Arash; Gupta, Vishrut; Martinez, Jorge; LeFevre, Jeff; Xu, Vincent; Hsu, Meichun; Roy, Indrajit (2015). "Enabling predictive analytics in Vertica: Fast data transfer, distributed model creation and in-database prediction". ACM SIGMOD International Conference on Management of Data.
- One Size Fits All? Part 2: Benchmarking Results (sect. 3.1)
- "Vertica Announces Community Edition Version of Vertica Analytic Database". Archived from the original on July 4, 2015. Retrieved August 17, 2016.
- PR Newswire: "Vertica Announces Early Access of Vertica Accelerator" Micro Focus. June 15, 2021.
- Stonebraker, Mike; Abadi, Daniel J.; Batkin, Adam; Chen, Xuedong; Cherniack, Mitch; Ferreira, Miguel; Lau, Edmond; Lin, Amerson; Madden, Sam; O'Neil, Elizabeth; O'Neil, Pat; Rasin, Alex; Tran, Nga; Zdonik, Stan (2018). "C-store: a column-oriented DBMS". In Brodie, Michael L. (ed.). Making Databases Work: The Pragmatic Wisdom of Michael Stonebraker. Association for Computing Machinery/Morgan & Claypool. pp. 491–518. doi:10.1145/3226595.3226638. ISBN 9781947487192. S2CID 3439184.
- "The Vertica Analytic Database: C-Store 7 Years Later" (PDF). VLDB. August 28, 2012.
- "Vertica-Hadoop integration". DBMS2. October 12, 2010.
- Vertica Blog: "Vertica 10.1.1 Goes Beyond Analytics with Support for Azure Cloud, Kubernetes, and Containers" Healey. April 30,2021
- "Vertica Spark Connector". GitHub. 25 February 2022.
- "Vertica". GitHub.
- Sybase, Inc. v. Vertica Systems, Inc. (Texas Eastern District Court January 30, 2008).Text
- Monash, C: "Vertica slaughters Sybase in patent litigation",DBMS2, January 14, 2010
- Vertica Press Release, "Vertica Resolves Sybase Patent Lawsuits" http://www.vertica.com/news/press/vertica-resolves-sybase-patent-lawsuits/