Actian Vector in Hadoop Turbocharges Spark Performance

PALO ALTO, Calif. – September 19, 2017Actian, the hybrid data management, analytics and integration company, today announced support for Apache® Spark™ in the latest release of Actian Vector in Hadoop (VectorH), providing enterprises with access to an expansive range of data types and faster and richer analytic functionality than ever before. Actian’s pioneering Vector technology exploits vectorized processing and multi-level, in-memory acceleration to deliver industry-leading performance on Hadoop data stores. The Actian Vector family of analytic databases offers innovative, flexible, best-fit solutions for single-server, clustered and hybrid transactional/analytic computing environments, spanning on-premises and cloud deployments.

“Customers facing the challenges of today’s dynamic, hybrid data environments are demanding faster, more nimble solutions to activate that data, to unlock the business value and insights, using the skills and tools they already know,” said Rohit De Souza, CEO of Actian. “Actian now offers the power of VectorH through Spark to support a diverse range of file formats and workloads including machine learning, delivering performance and operational advantages over open source and proprietary alternatives. Imagine how much more productive your data scientists and business analysts can be with results in seconds, not hours.”

Actian VectorH opens up rich analytics functionalities for developers and data scientists seeking to harness the power of Apache® Spark™ powered capabilities including machine learning, streaming and predictive analytics, and creates a modern production platform that offers security, real-time data updates with zero performance penalty, resource management in the Hadoop cluster, query optimization and industry-standard SQL support.

Highlights of the Actian VectorH Analytics Database:

  1. Scale-Out Hadoop Performance – dramatically improved performance with Actian Vector technology – VectorH enables the extensibility of the world’s fastest database from single node processing to accelerate performance in Hadoop clusters. Workloads of standard benchmark queries that typically take over two hours with traditional SQL solutions on Hadoop finish in less than a minute running on VectorH.
  2. Zero-Penalty, Real-Time Data Updates – Unlike traditional Hadoop analytics solutions, VectorH can process real-time data updates without any associated performance penalty, an essential capability to ensure that an organization’s analytic insight is always current, leveraging its freshest data. Organizations no longer have to give up consistency for performance.
  3. Spark-Powered Direct Query Access – Through its innovative native Spark support VectorH delivers optimized access to Hadoop data file formats including Parquet and ORC, circumventing the need to translate and store data separately into the Vector file format. This direct access includes the ability to perform functions like SQL joins across different table types.
  4. Native Spark DataFrame Support – Direct connection to Spark functionality via DataFrames, enabling VectorH to serve as a faster query execution engine for SparkSQL and Spark R applications.

“As an innovative provider of leading network monitoring solutions to manage global transportation and mobile systems, Expandium pushes the edge of Big Data technologies, starting with Actian Vector,” said Rodolphe Guillard, software team leader, Expandium. “With explosive growth in mobile data, we’ve developed our new network intelligence platform to scale up on Actian VectorH to perform near real-time data ingestion in a production environment.  We’re excited about employing Actian’s new native Spark integration to stream data to machine learning solutions to sustain our technical leadership.”

The new version of Actian VectorH will be released at the end of October. To learn more, visit here or stop by booth #350 at the Strata Data Conference taking place September 25-28 at Javits Center in New York, NY.

You can also explore the single-server version of Actian Vector running on Linux, distributed free as a community edition, available for download here.

About Actian – Activate Your Data™

Actian, the hybrid data management, analytics and integration company, delivers data as a competitive advantage to thousands of customers worldwide. Through the deployment of innovative hybrid data technologies and solutions Actian ensures that business critical systems can transact and integrate at their very best – on-premises, in the cloud or both. Thousands of forward-thinking organizations around the globe trust Actian to help them solve the toughest data challenges to transform how they run their businesses, today and in the future. For more, visit

“Actian” and “Activate your Data” are trademarks of Actian Corporation and its subsidiaries. All other trademarks, trade names, service marks, and logos referenced herein belong to their respective companies.


Jeff Veis
SVP & Chief Marketing Officer, Actian Corporation


Elizabeth Somerville
PAN Communications