Data Platform

Five Benefits of the Hybrid Cloud Approach

Actian Corporation

September 21, 2020

Hybrid Cloud benefits

The ISO 17788 Cloud Computing Overview and Vocabulary standard defines a hybrid cloud as “a cloud deployment model using at least two different cloud deployment models.” When we talk about different deployment models, we are talking about private clouds, public clouds with multiple different cloud providers, and even legacy applications. When the different deployment models are combined, the multi-hybrid cloud is created. The multi-hybrid model allows businesses to optimize their operations.

Competitive Advantage

When a business employs a private and public cloud, it can make careful decisions regarding what applications and services are deployed to each part of the architecture. Often the investment in services that differentiate the business from the competition is kept on the private cloud. Other services that need to be provided but are not part of the core business services can be deployed to the public cloud.

It is even likely that the cloud provider may offer these necessary services so the development team can focus on driving business value. Because it is easy to provision resources, the public cloud remains a good way to manage the resources needed for data storage.

Deployment to a Cloud Provider

There are many cloud service providers, but the top three in several assessments related to market share are Amazon Web Services (AWS), Microsoft Azure, and Google Cloud. A hybrid cloud solution for a business can include one or more providers. While all of the cloud providers offer similar basic services, they all have unique offerings that need to be reviewed for applicability to a business need. For example, GCP offers Cloud Interconnect, a dedicated connection option that bypasses the need for an Internet Service Provider. Cloud Interconnect allows Google to offer Service Level Agreements that include guaranteed uptimes as high as 99.99%.

Data Residency

Business needs for hybrid clouds include the performance of the network when accessing data and applications. There are two factors to consider, latency and throughput. Latency is defined as the time taken for a data transaction request to complete a round trip between the sender and the receiver. One of the many options offered by the various cloud providers is the location of their servers. In the case of AWS, Azure, and Google Cloud, the servers are spread out throughout the world. The location of the data and applications impacts the latency of serving that data. This is especially important for transactional database access. Each of the cloud service providers offers a simple web-based method for determining network latency.

Beyond the questions of performance, there are other issues concerned with data residency. For example, there are multiple categories of sensitive data, including personal data, trade controlled data, regulated data. The location and movement of data needs to be understood, monitored and managed. In a cloud solution, the question becomes: what knowledge of and access to the customer’s data does the custodian (cloud service provider) have, and who is at risk for storing data that may contravene local laws? What access does the provider/processor have to the data? Answers to these questions are an important aspect to determining whether data is best stored in the cloud or on-premises.

Benefits of the Hybrid Approach

There are several benefits to the multi-hybrid cloud approach.

  • Flexibility – Businesses can repartition or deploy elements of a solution based on changing technology services or improved capabilities of one or more of the cloud service providers. Actian Data Platform supports AWS, Azure, Google Cloud and on-prem as deployment platforms. It uses the same database, data model, and ETL integration on-premises and in each of these cloud providers, thereby offering significant deployment flexibility.
  • Performance – Actian Data Platform takes full advantage of modern processors to maximize concurrency, parallelism, and resource utilization. In addition, businesses can quickly leverage new capabilities in their solutions no matter where they are available in the hybrid cloud.
  • Elasticity – The number of Compute Nodes in an Actian Data Platform cloud deployment can start at a low of 4 nodes, automatically ramp up based on usage and demand, with peak end of week or end of month capacity when needed. Thus, resources are allocated and paid for only if and when needed.
  • Consistency – Businesses can support continuous delivery of applications across the hybrid cloud leveraging common tools and processes.
  • Agility – Businesses can design and develop solutions in such a manner that where they are deployed across a hybrid cloud can be adjusted in a seamless manner.

Costs of the Hybrid Approach

Most cloud service providers offer the idea that infrastructure updates are automatically applied so that businesses do not need to manage updates. Automatic updates may cause issues with applications that are sensitive to versions, so they still need to be tracked. Beyond updates, there are potentially concerns about managing multiple cloud providers, but the benefits outweigh the costs.

The multi-hybrid cloud solution using an application that has been architected for performance and the combined on-premises and cloud environment provides the power to query and analyze business data at near real-time speed.

Actian Data Platform is a hybrid cloud data warehouse service designed to deliver high performance and scale at a fraction of the cost of alternative solutions. It is a hybrid platform that can be deployed on-premises as well as on multiple clouds, including AWS, Azure, and Google Cloud.

You can learn more about Actian here.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Analytics

Predictive Analytics Can Reduce Customer Churn, Optimize Marketing

Actian Corporation

September 17, 2020

Predictive Analysis

All customers are not equal – there are some your business just can’t afford to lose. Do you know who they are and what is required to retain them? With predictive analytics powered by Actian, you can do just that.

Companies have been using statistical modeling, data correlation and behavioral forecasting for many years to profile customers. Unfortunately, traditional capabilities have been limited by the amount and types of data they have been able to include in the analysis, leading to incomplete and inconclusive results. With Actian, you can increase the accuracy of your churn predictions by combining traditional transactional and account datasets with call center text logs, marketing campaign response data, competitive offers, social media and many other sources to develop a truly holistic understanding of your customer’s buying behavior. Having the ability to leverage all this data will enable your organization to leverage predictive analytics more effectively.

Aggregated customer profiling data can help you discover new classifications and customer segments and assign lifetime value and churn scores that clearly indicate which of your customers are most important to your business – and who you can’t afford to lose. This information can be used to customize your customer experiences, provide enhanced support and create programs to retain your high-value customers. Identify the forces that influence these customers’ buying behavior and use them to customize marketing efforts, provide personalized customer service and even optimize your service supply chain. Predictive analytics is more than just a big data play, it is a critical business requirement.

Just as not all customers are equal, neither are all customer segments. Some groups of customers are high-value, purchasing your products and services repeatedly, ordering large quantities and generating large profit margins. Other customer segments are low-value, with larger customer acquisition costs, low-order volumes, few repeat purchases and low profitability due to price competition and discount demands.

If you want your company to grow and thrive, then you must focus your marketing and product planning efforts on developing high-value customer segments and potentially offload less-profitable segments. To accomplish this effectively, you must understand your customer segments and which customers align to these segments. Actian helps you do this by giving you the tools to aggregate all your customer data in one place, analyze it in real-time and make actionable insights available to your staff. These insights can help you improve customer satisfaction and loyalty, optimize supply chains, accurately price products, develop effective marketing campaigns and reduce the likelihood of your customers taking their business elsewhere.

Some level of customer churn is inevitable. Customers’ needs change and the offers from your competitors are enticing. Predictive analytics help you identify customer behavioral changes and market threats early, so you can make informed decisions about how you want to respond. Stop guessing with your marketing investments and focus your resources on the activities that will make your customers happy, increase their lifetime value to your company and generate sustainable results.

Actian Data Platform can help. Actian is a highly efficient data analytics database service that can process large amounts of data in near real-time by separating it into small chunks that are processed in parallel. What this means is you can perform customer-churn analysis behind the scenes to retain more customers. To learn more, visit www.actian.com/data-platform.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Intelligence

A Smart Data Catalog, a Must-Have for Data Leaders

Actian Corporation

August 26, 2020

smart data catalogs

The term “smart data catalog” has become a buzzword over the past few months. However, when referring to something being “smart,” most people automatically think, and rightly so, of a data catalog with only Machine Learning capabilities.

We do not believe that a smart data catalog is reduced to only having ML features.

There are many different ways to be “smart”. This article focuses on the conference that Guillaume Bodet gave at the Data Innovation Summit 2020: “Smart Data Catalogs, A Must-Have for Leaders”.

A Quick Definition of Data Catalog

We define a data catalog as being:

A detailed inventory of all data assets in an organization and their metadata, designed to help data professionals quickly find the most appropriate data for any analytical business purpose.

A data catalog is meant to serve different people or end-users. All of these end-users have different expectations, needs, profiles, and ways to understand data. These end-users consist of data analysts, data stewards, data scientists, business analysts, and so much more. As more and more people are using and working with data, a data catalog must be smart for all end-users.

What Does a “Data Asset” Refer to?

An asset, financially speaking, typically appears in the balance sheet with an estimation of value. When referring to data assets, it is just as important, even more important in some cases, than other enterprise assets. The issue is that the value for data assets aren’t always known.

However, there are many ways to tap the value of your data. There is the possibility for enterprises to directly use their data’s value, like for example selling or trading their data. Many organizations do this; they clean the data, structure it, and then proceed to sell it.

Enterprises can also make value indirectly from their data. Data assets enable organizations to:

  • Innovate for new products/services.
  • Improve overall performance.
  • Improve product positioning.
  • Better understand markets/customers.
  • Increase operational efficiency.

High performing enterprises are those that master their data landscape and exploit their data assets in every aspect of their activity.

The Hard Things About Data Catalogs

When your enterprise deals with thousands of data, that usually means you are possibly dealing with:

  • 100s of systems that store internal data (data warehouses, applications, data lakes, datastores, APIs, etc) as well as external data from partners.
  • 1,000s of datasets, models, and visualizations (data assets) that are composed of thousands of fields.
  • And these fields contain millions of attributes (or metadata)!

Not to mention the hundreds of users using them…

This raises two different questions:

How can I build, maintain, and enforce the quality of my information for my end-users to trust in my catalog?

How can I quickly find data assets for specific use cases?

The answer is in smart data catalogs

We believe that are five core areas of “smartness” for a data catalog. It must be smart in its:

  • Design: The way users explore the catalog and consume information.
  • User Experience: How it adapts to different profiles.
  • Inventories: Provides a smart and automatic way of inventorying.
  • Search Engine: Supports the different expectations and gives smart suggestions.
  • Metadata management: A catalog that tags and links data together through ML features.

Let’s go into detail for each of these areas:

A Smart Design

Knowledge Graph

A data catalog with smart design uses knowledge graphs rather than static ontologies (a way to classify information, most of the time built as a hierarchy).  The problem with ontologies is that they are very hard to build and maintain, and usually only certain types of profiles truly understand the various classifications.

A knowledge graph on the other hand, is what represents different concepts in a data catalog and what links objects together through semantic or static links. The idea of a knowledge graph is to build a network of objects, and more importantly, create semantic or functional relationships between the different assets in your catalog.

Basically, a smart data catalog provides users with a way to find and understand related objects.

Adaptive Metamodels

In a data catalog, users will find hundreds of different properties, to which aren’t relevant to some users. Typically, two types of information are managed:

  • Entities: Plain objects, glossary entries, definitions, models, policies, descriptions, etc.
  • Properties: The attributes that you put on the entities (any additional information such as create date, last updated date, etc.)

The design of the metamodel must serve the data consumer. It needs to be adapted to new business cases and must be simple enough to manage for users to maintain and understand it. Bonus points if it is easy to create new types of objects and sets of attributes!

Semantic Attributes

Most of the time, in a data catalog, the metamodel’s attributes are technical properties. Some of the attributes on an object include generic types such as text, number, date, list of values, and so on. As this information is necessary to have, it is not completely sufficient because they do not have information on the semantics, or meaning. The reason this is important is because with this information, the catalog can adapt the visualization of the attribute and improve suggestions to users.

In conclusion, there is one size fits all to a data catalog’s design, and it must evolve in time to support new data areas and use cases.

A Smart User Experience

As stated above, a data catalog holds a lot of information and end-users often struggle to find the information of interest to them. Expectations differ between profiles. A data scientist will expect statistical information, whereas a compliance officer expects information on various regulatory policies.

With smart and adaptive user experience, a data catalog will present the most relevant information to specific end-users. Information hierarchy and adjusted search results in a smart data catalog is based on:

  • Static Preferences: Already known in the data catalog if the profile is more focused on data science, IT, etc.
  • Dynamic Profiling: To learn what the end-user usually searches, their interests, and how they’ve used the catalog in the past.

A Smart Inventory System

A data catalog’s adoption is built on trust and trust can only come if its content is accurate. As the data landscape moves at a fast pace, it must be connected to operational systems to maintain the first level of information on metadata on your data assets.

The catalog must synchronize its content with the actual content of the operational systems.

A catalog’s typical architecture is to have scanners that scan your operational systems and bring and synchronize information from various sources (Big Data, noSQL, Cloud, Data Warehouse, etc.). The idea is to have universal connectivity so enterprises can scan any type of system automatically and set them in the knowledge graph.

In the Actian Data Intelligence Platform, there is an automation layer to bring back the information from the systems to the catalog. It can:

  • Update assets to reflect physical changes.
  • Detect deleted or moved assets.
  • Resolve links between objects.
  • Apply rules to select the appropriate set of attributes and define attribute values.

 A Smart Search Engine

In a data catalog, the search engine is one of the most important features. We distinguish between two kinds of searches:

  • High Intent Search: The end-user already knows what they are looking for and has precise information on their query. They either already have the name of the dataset or already know where it is found. Low intent searches are commonly used by more data savvy people.
  • Low Intent Search: The end-user isn’t exactly sure what they are looking for, but want to discover what they could use for their context. Searches are made through keywords and users expect the most relevant results to appear.

 A smart data catalog must support both types of searches

It must also provide smart filtering. It is a necessary complement to the user’s search experience (especially low intent research), allowing them to narrow their search results by excluding attributes that aren’t relevant. Just like many big companies like Google, Booking.com, and Amazon, the filtering options must be adapted to the content of the search and the user’s profile in order for the most pertinent results to appear.

Smart Metadata Management

Smart metadata management is usually what we call the “augmented data catalog”, the catalog that has machine learning capabilities that will enable it to detect certain types of data, apply tags, or statistical rules on data.

A way to make metadata management smart is to apply data pattern recognition. Data pattern recognition refers to being able to identify similar assets and rely on statistical algorithms and ML capabilities that are derived from other pattern recognition systems.

This data pattern recognition system helps data stewards set their metadata:

  • Identify duplicates and copy metadata.
  • Detect logical data types (emails, city, addresses, and so on).
  • Suggest attribute values (recognize documentation patterns to apply to a similar object or a new one).
  • Suggest links – semantic or lineage links.
  • Detect potential errors to help improve the catalog’s quality and relevance.

It also helps data consumers find their assets. The idea is to use some techniques that are derived from content-based recommendations found in general-purpose catalogs. When the user has found something, the catalog will suggest alternatives based both on their profile and pattern recognition.

Start Your Data Catalog Journey

Actian Data Intelligence Platform is a 100% cloud-based solution, available anywhere in the world with just a few clicks. By choosing the Actian Data Intelligence Platform Data Catalog, control the costs associated with implementing and maintaining a data catalog while simplifying access for your teams.

The automatic feeding mechanisms, as well as the suggestion and correction algorithms, reduce the overall costs of a catalog, and guarantee your data teams with quality information in record time.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Platform

What to Do When Your Data Warehouse Chokes on Big Data

Actian Corporation

August 26, 2020

Data warehouse

This may seem like an academic question, but it is increasingly becoming a reality for modern businesses. What do you do when you have millions of records with infinite width and depth, and your data warehouse chokes? Do you trim your data? Do you add more infrastructure capacity? Or do you need to look at a better data warehouse solution?

This problem is akin to owning an old car that makes a bunch of noises, smells terrible, and has wheels that rattle when you drive down the road. What do you do about it? Drive slower (that’s annoying), open the windows for some fresh air, and turn up the radio to drown out the sounds? Do you get some new tires, an air freshener, and a louder radio to mask the issues? Or do you consider buying a new car? Nostalgia may be a valid reason to keep a classic car, but it isn’t a good reason to keep a data warehouse around that isn’t meeting your business needs. Your business is evolving, and you need a data warehouse platform that will give you agility and the ability to move faster, not slow you down.

Where is the Infinite Data Problem Coming From?

The digital transformation of business processes and the rapid adoption of modern connected technology is what is driving the infinite data challenge. Instead of having a business run on a few core platforms with well-structured data schemas and transactional data growth curves that are relatively flat, modern businesses are embracing a wide variety of specialized systems and things like IoT and mobile devices that produce seemingly endless streams of data. This “measure everything” culture, combined with an uptick in data update volume from transactional systems, leads to a data profile where there can be an infinite number of rows of data and a seemingly infinite set of column attributes that are collected. This problem is a sign of success – it means that your organization understands the value of data and is actively working to collect the most diverse and expansive information footprint it can. You don’t want your data warehouse system to get in the way of that.

Why is Your Data Warehouse Choking on Big Data?

Most data warehouses were designed for on-prem infrastructure hardware with fixed capacity and processing optimized for relational database schemas. This is what companies needed five years ago. Times have changed. Traditional data warehouses are choking because they aren’t architected for big-data analytics in real time. They aren’t deployed on flexible and scalable cloud infrastructures and configured for on-demand resource scaling, and they are trying to apply old-school scalar processing approaches to modern data structures. If you give the system enough time, it will get the job done, just not with the speed that most modern businesses demand.

A Modern Solution to The Big Data Problem

The Actian Data Platform is a modern solution to your big data problem. Designed for high-efficiency processing, deployed on scalable cloud infrastructure, and leveraging high-performance vectorized data processing, Actian can meet the big data challenges of today and give you plenty of growth room for the future. Yes, many other data warehouse solutions can be deployed in the cloud to give you access to the compute and storage capacity, but in a side by side comparison, Actian’s unique approach out-performs the next best option and through highly efficient hardware utilization that can deliver higher performance at a much lower cloud cost. To learn more about how the Actian Data Platform delivers superior performance and can cut your cloud data warehouse bill in half, check out this video.

To learn more about how the Actian Data Platform can help you address your business’s big data problems, visit www.actian.com/data-platform.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.