Data Platform

Optimizing Cloud Compute and Data Storage for Data Analytics

Actian Corporation

October 24, 2019

Cloud online storage technology concept

When architecting your data warehouse solution, separating compute and data storage is extremely important for both operational sustainability and economic efficiency. Different things drive the technical needs for each of these, the capacity demands of the organization are different, and the best solution requires optimizing compute and data storage separately.

Data Storage Capacity is a Function of Time

The amount of data storage that your company needs is directly related to the number of business activities you’re doing. As you conduct business, you generate data – data about your customers, your products, your sales, etc. Over time, the amount of data volume your company has will grow. In busy times, the growth rate may be faster than in slow times, but the volume is always increasing. Taking this growth into account is essential when you are architecting your storage solution because the cost is directly related to data volume.

For on-premises data storage, you will need to acquire capacity in advance, based on projected data storage needs. For cloud-based storage solutions that are billed based on utilization, you will need to project your cost growth over time.

Compute Capacity is Driven by Business Trends

Compute capacity for analytics solutions is only partially influenced by the volume of data you are analyzing. The more significant factor at play is the demand for data consumption – during peak business times, the demand is higher, and during slow times, the demand is lower. Consider the example of black Friday in retail.  Business activity spikes, and the demand for analytics about the business activities spike too. A couple of months later, in early January, retail sales slowed, and there was also less demand for analytics. Whether you are talking about retail sales, the launch of a new product/service, or quarter-end financial close, every business has seasonality trends that cause their demand for compute capacity to vary significantly.

For on-premises compute solutions, capacity must be purchased and reserved to accommodate peak performance loads. That means that during slow periods, there is excess capacity. For cloud-based compute solutions where billing is based on utilization, capacity can be scaled up during peak periods and scaled back down during slow periods.

Developing a Hybrid Demand Forecast is Nearly Impossible

The capacity and performance requirements for both compute and data storage environments vary over time based on the activities of the business. The demand curves for each of these solutions look very different from the other, making cost and capacity modeling based on a combined architecture is both difficult and inefficient. Rather than invest the time and resources on-demand and cost forecasting, most companies find it much easier to separate compute and data storage into separate solutions with independent cost models and demand forecasts.

Technology is Changing

Cloud-based technology capabilities are improving and changing at a tremendous pace. When it comes to data analytics and cloud data warehousing, not only is the technology getting better every day, but certain areas are evolving faster than others. For example, the density of cloud-based storage solutions is causing the per-unit cost of data storage to decline in alignment with the deployment of new hardware by cloud service providers. Compute capabilities in the cloud are improving in both capacity/scale with new distributed compute architectures, and in speed/performance with new hardware. While a company may decide to forego a storage upgrade due to the migration costs, leveraging newer compute capabilities may be advantageous.

Separating compute and data storage solutions give companies greater flexibility in upgrading parts of their architecture while leaving other parts alone. When it comes to data analytics, there are a lot of moving parts. Data volumes are increasing. Analytics and compute demands (both performance and capacity) are going up and down with business trends. Developing and executing on an accurate forecast is nearly impossible. All the while, the technology is continuously evolving and the business is demanding better economic performance from IT investments. Companies that are thriving in this environment know that keeping solutions simple and maintaining the highest level of technical flexibility is the key to success. Separating compute and data storage is an essential part of giving you the most options to optimize the data analytics on which your company depends.

Actian Data Platform on Azure provides the flexibility organizations need to optimize the ratio of compute and data storage to meet the performance objectives of the application. Learn more about the Actian Data Platform at www.actian.com/data-platform

Learn Modernization Best Practices From Industry Experts and Insiders

If you are thinking about modernizing your enterprise data warehouse, watch our on-demand webinars featuring leading industry analysts and former executives from Teradata and Netezza.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Product Launches

Actian Data Platform on Microsoft Azure

Emma McGrattan

October 21, 2019

Find Actian on Microsoft Azure

Today we are excited to announce the release of Actian Data Platform on Microsoft Azure. Azure is growing significantly as a platform in the enterprise space and becoming the de facto choice for retail analytics. Hence, making Actian Data Platform available on Microsoft Azure has been a priority for us.

With this release, Actian Data Platform is now available on Microsoft Azure, AWS, and on-premises, delivering on our hybrid and multi-cloud vision. We have added new features to this release, like independent scaling of compute and storage resources. This is particularly appealing to those customers who have large amounts of data which is growing quickly but may not need compute to scale at the same pace. In addition, it will also benefit customers with known compute peaks like retail experiences during holidays or during promotional periods.

True Platform That Analyzes Data Where it Naturally Resides

With the addition of Microsoft Azure, Actian’s customers now have another choice as to where they deploy their analytics. For those with data lakes that span multiple clouds and on-premises, Actian provides a unifying solution that brings analytics to all of that data from a single pane of glass.  When we engage with prospects, they typically tell us that they wish to simplify their data ecosystem and bring the analytics capabilities to the data, rather than duplicating all of their data assets in a cloud data warehouse environment. Our approach is not only simpler and cheaper, it also offers greater security.

High-Performing Analytics That Thrives Under Demanding Scenarios

Actian has a long history of delivering record-breaking analytics performance, and Actian Data Platform on Azure is no different. It offers significant performance advantages over the competition and that advantage grows with data volumes, user volumes and query complexity. Actian Data Platform is designed to support large numbers of concurrent users and when you couple this with the fact that query execution times can be measured in milliseconds, it’s the ideal platform for organizations that wish to equip every business decision-maker in the organization with access to all of the data required to make the most informed decision possible.

The Actian Data Platform’s ability to handle mixed workloads means that data discovery, ad-hoc querying, batch reporting, and real-time data updates can all be happening simultaneously without the need to reconfigure the environment for each different use case and without any one of these use cases feeling the impact of the others.

Try Actian Data Platform With Your Data

You can try out Actian Data Platform  and experience for yourself the high performing engine of Actian Data Platform. I’d love to hear your feedback, so don’t be shy!

emma mcgrattan blog

About Emma McGrattan

Emma McGrattan is CTO at Actian, leading global R&D in high-performance analytics, data management, and integration. With over two decades at Actian, Emma holds multiple patents in data technologies and has been instrumental in driving innovation for mission-critical applications. She is a recognized authority, frequently speaking at industry conferences like Strata Data, and she's published technical papers on modern analytics. In her Actian blog posts, Emma tackles performance optimization, hybrid cloud architectures, and advanced analytics strategies. Explore her top articles to unlock data-driven success.
Data Platform

Getting Started With Actian Data Platform on Azure

Actian Corporation

October 21, 2019

data integration with Actian

Actian Data Platform (formerly Avalanche) on  Microsoft Azure launched this week.  This post will walk you through getting set up on Actian Data Platform on Azure. You must first have an Actian account as a customer of Actian 

After you log in to the Actian service, you will see the below screen in the center of the Actian console: 

actian console

To create your first warehouse, click the green Create your first Cluster button, which will load the below screen to get you set up: 

actian data platform warehouse cluster

All fields depicted above are required fields. After filling in the name of your cluster and location (e.g. East US for where your cluster will be located), you will also need to provide what size cluster you would like to create and enter an IP address to allow access.  

For the cluster size, Actian uses Avalanche Units (AU) to specify the size of your warehouse. AUs are a representation of the cluster’s size, so 2AUs would be a small cluster, and 4AUs would be a larger cluster. Actian offers up to 128AU clusters. AU size of the cluster is not directly linked to the data storage size; your data storage will scale independently of the compute for the cluster. If you are using a trial, Actian recommends using a 2AU cluster to get started, as larger clusters have higher costs. Since the a is associated with a specified amount of credits, a 2AU cluster will provide the maximum amount of time for you to work with Actian 

For the IP address field, if you will be connecting to the cluster from your current location, you can click the Allow List Application IP(s) field, and a small box will pop up that states Use my IP – if you select that box, your IP will auto-populate into the IP address field. The IP address that needs to be added on the Allow List is the IP address that you will be using to connect to the Actian cluster via ODBC, JDBC, or .Net applications. 

Once you are done with filling in the fields, click on the Create Cluster button. You will then be taken to the main dashboard and see the status of your cluster as Creating. 

custom cluster actian data platform

After the instance is provisioned, which will take about 10-15 minutes, the Creating indicator will change to Running as shown below:

creating cluster actian data platform

The Actian cluster is now available to connect to and perform some of the following activities: loading data, performing ad-hoc querying, using BI tools such as Power BI or Tableau for reportingor loading data into a Spark cluster or into a Data Science workbench. 

To get started on your journey to connect to Actian for Azure, you will need to: 

  1. Set a connection password in the Actian web interface. 
    • On the Actian main console, identify the cluster whose password you want to change. 
    • From the cluster’s Manage menu, select Set Connection Password. 
    • The Set/Update Connection Password dialog opens: 

connection password actian data platform

      • Enter and confirm the password for the dbuser user.
      • Click Set Connection Password.
      • The connection password is now in effect.
  1. Download a JDBC or Actian client runtime package.
    • To download drivers for Actian, log in to the web console and click the Driver & Tools link, which opens Electronic Software Delivery (ESD) in a new browser tab.

drivers and tools actian data platform

    • The following download packages are available from the RELEASE dropdown for Actian.
      • Actian JDBC (for any platform): includes the JDBC driver to connect to Actian from JDBC applications. To download and install this package, see Download the Actian JDBC Package. 
      • Actian Client Runtime: includes the JDBC driver, ODBC driver, and the Actian SQL Command Line Interface (CLI). To download and install this package, see Download the Actian Client Runtime Package. 
    • If you download an RPM package, you must install as root.
  1.  To connect from your favorite BI tools such as Power BI, Tableau, Looker, or your favorite SQL tools such as DBeaver and Squirrel SQL, or an application that uses ODBC/JDBCyou will need information specific to the application to connect. To get the information required, navigate to the cluster that you would like to connect to, and select the Manage menu and then select the Connect button.
    connect button actian data platform
  2.  After pressing the Connect button, a dialog will appear on-screen that provides the connection details.
    connect dialog actian data platform
    The connection string can be copied via the copy button on the right of the connection string field. Once copied, the string can be pasted into the tool you plan to connect to Actian after the Actian driver is installed and connected. 
  3. For more details on how to install the driver for an example application such as DBeaver, please refer to the following how-to link: Install DBeaver and Set Up an Actian Data Platform Driver guide.

The cluster will have sample data available to use for running SQL commands or building visualizations for your favorite query tools. For SQL commands, you can find a set of sample commands here 

If you have additional questions on getting set up with Actian, please feel free to reach out to the support team.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Integration

Using DataConnect to Acquire, Prepare and Deliver Data

Actian Corporation

October 18, 2019

dataconnect clouds

Actian recently announced the availability of the Actian Data Platform on the Azure platform, extending the capabilities already available on AWS. This announcement is exciting news for customers because it means greater flexibility and choice in selecting cloud operating environments as well as creating the opportunity for a multi-cloud deployment.

Actian DataConnect enhances the capabilities of the Actian Data Platform with a scalable Integration Platform as a Service (IPaaS) offering to help you manage connections from all of your source systems into your Actian data warehouse. With DataConnect, you will have the tools to acquire, prepare and deliver data to the Actian Data Platform with ease.

Establish Connections to Any Source

UniversalConnect provides the capabilities to establish data connections for any source, including APIs, SaaS solutions, files, databases, mainframe systems, and on-premises applications. This is important because your IT environment is diverse, but the data insights you need to span the breadth of your enterprise. With UniversalConnect, you can connect your legacy systems, existing data stores, and even third-party platforms into the Actian Data Platform to make the full enterprise dataset available for analysis and reporting.

Flexible Integration Patterns

With DataConnect, you can execute integrations using any integration pattern, including scheduled batch processing, REST APIs, event listeners, and streaming data. Having multiple integration patterns available is because not all your data connections are the same. Real-time streaming data needs to be processed quickly to harvest maximum analytical value, while some transactional data (such as that used for planning) only needs to be updated periodically. The key is making sure you have the most current and correct data available for analysis at the time that it needs to be consumed. DataConnect gives you great flexibility to select the integration patterns that are best for your business.

Design Integration Workflows With DataConnect Studio

Managing data in your business is more than establishing a bunch of connections. DataConnect Studio gives you the capabilities to design workflows to orchestrate the flow of data throughout your IT environment. For example, you might want to:

  1. Extract from a SaaS system like Salesforce or NetSuite.
  2. Standardize and enrich the data using simple functions, joins, etc.
  3. Transform into a simple format like CSV and optionally add columns to tag data or engineer additional data elements.
  4. Deliver the file to a data lake (i.e., S3, ADL).
  5. Dynamically generate SQL statements based on the file’s schema.
  6. Execute SQL commands by pushing them down to the database server for optimal processing.
  7. Create an external table reference to the delivered CSV file so that the Actian Data Platform can query against it directly.
  8. Define subsequent runs of the integration workflow to refresh the data set.

The release of the Actian Data Platform on Azure opens up a host of new opportunities for you to modernize your data management capabilities and move your data warehouse to the cloud. DataConnect provides the tools to connect all your data sources into the new cloud data warehouse environment so you can get the most value out of this investment. Actian Data Platform works best when paired with a scalable IPaaS solution like DataConnect that includes robust tools for establishing connections, acquiring data sets, preparing, and then delivering them into the Actian data warehouse. To learn more, visit DataConnect.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.