Data Platform

Using a Cloud Data Warehouse to Support Localized Systems

Actian Corporation

August 7, 2019

Cloud Data Warehouse

Over the past few years, we’ve seen an increasing trend of regional governments applying unique restrictions and controls on where data is stored and how it is managed for users and businesses in their jurisdiction. The EU and Japan have recently imposed some strict rules about data export.

A cloud data warehouse can be an effective tool in helping your company remain compliant with regional regulations by keeping data within the region it was created.

Distributed Architecture of the Cloud

Cloud infrastructure is different from on-premises data centers in that it is inherently designed to support distributed systems. This could be either the replication of common capabilities to multiple regions or the localized deployment of specialized capabilities to a specific regional audience. Cloud management platforms include the tools to be able to define regions and effectively direct transactions (and data) to technical resources aligned with your unique business needs.

Separation of Applications and Data Stores

In the past, there was typically a 1:1 relationship between applications that users interact with and the databases where transactional data is stored. If you wanted to keep the data from one region localized in that region, you needed to create a new database and a new version of the application to connect to it. Modern cloud architecture changes that. You can now have a single application that is replicated globally and use routing rules to connect the application to different databases, either based on performance criteria or geopolitical rules.

The Impact of Data Localization Rules on Data Warehouses

One of the unique challenges that companies have faced when seeking conformance to data localization regulations is the need to not only store transactional data within the local region but also to control data being copied and exported outside of the source jurisdiction. This means that if companies want to perform deep analytics and data mining, they may need to do these activities in data warehouses that are in a specific region. When data warehouses were predominantly hosted on-premises, this was a costly proposition – acquiring a data center, standing up a data warehouse, operating it, and performing analytics within a specific region.

Data Warehouses in the Cloud

The cloud makes deploying localized data warehouses easier and cheaper than on-premises alternatives. The same cloud providers that are hosting localized applications can provision a data warehouse in-region with both the compute and storage capacity for performing business analytics and data mining. While the raw data is often export-controlled, there are often fewer restrictions (if any) on exporting the analytics and insights derived from data to decision-makers at a corporate headquarters in a different region. Companies can now distribute their data warehousing capabilities across the globe the same way they are distributing applications and transactional data stores, while at the same time retaining the ability to do high-performance processing and achieve centralized information insights.

Actian is the leader in hybrid data warehousing that span on-premises and multiple cloud platforms. Actian Data Platform provides a full-featured cloud data warehouse solution that can be deployed quickly – either globally or to a targeted region, helping your company achieve superior business value, fast time to market, and sustainable operational costs.  To learn more, visit www.actian.com/data-platform.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Integration

Considering Cloud Migration? First, Read This Checklist.

Actian Corporation

August 5, 2019

3d rendering robot learning or machine learning

Most modern companies have fully embraced the cloud as the preferred place to host the applications that their businesses use.  Many new applications are “cloud-native” or are designed specifically for operating in the cloud, while many of the legacy applications that your company uses today may have been designed to run on hardware in your company’s data center, i.e., on-premises.  The good news is that even if you have legacy applications, most software providers are now offering cloud-hosted options for their solutions.

Before you make the move to migrate legacy apps to the cloud, there are a few things you need to consider.  Use the following checklist as a tool to help you understand if cloud migration is right for your app, and if so, how you can increase the success of the migration process.

  1. Does the Cloud App Have All the Functions You Need?
    While most commercially available software packages that were developed for on-premises hosting now have cloud equivalents, the first thing you should look for is functional parity.  To accelerate time to market, some software providers have only implemented a subset of their tool’s functionality in their cloud offerings.  Now is a good time to review not only what features you have in your applications but also how are they being used by your users and business processes.
  1. Can Your Users Access the Cloud Application?
    One of the most significant downsides of cloud apps is that they require connectivity from the end user’s device to the cloud services in order for the app to work.  If your users need to be able to leverage the application when traveling to remote areas where reliable internet connectivity is unavailable, this can be an issue.  Connectivity can also be a challenge in corporate environments with robust network security protocols and firewalls that restrict access to external resources.  Enabling users to access cloud apps may require configuration to your network’s access control configurations.
  1. Where is the App Hosted?
    Many people don’t understand that the cloud is an extensive network of data centers run by large service providers.  In this network, applications may be hosted in a specific data center or region, or they may be replicated to operate in data centers across the globe.  It is important to understand where your users are located and use this information to guide your cloud deployment.  If your users are all in one city or region, it may be sufficient to have your app hosted only in that region.  If you have users in many different regions, more of a global footprint may be needed.  What you want to avoid is having you users in one region and your app hosted in another.  Network latency can cause significant impacts on your end-user’s experience related to app performance.
  1. Where is the Data Stored?
    In on-premises applications, typically the database that the application uses is co-located with the web or app server that brokers user transactions.  With cloud implementations, this is not always the case.  You may have application instances distributed around the globe, but leverage a centralized database to record transactions. Depending on the nature of the application, this configuration could cause performance issues if the network latency between the app and its data store is significant. Some modern architectures afford the ability to store and process data within the application itself (wherever it is deployed).  This is an important issue for your solution architects to investigate.
  1. How Will You Manage Integration?
    Digital business processes often involve the use of multiple applications and many data sources. Integrating your cloud apps with other systems and data stores, both on-premises and in the cloud can be complex and difficult to maintain.  Legacy approaches that leverage point-to-point integrations are often not effective in cloud environments.  Consider the use of a hybrid data integration platform such as Actian DataConnect to help you solve this problem.  By managing your connections in one place, you can enable greater flexibility in your deployment strategies.
  1. Do You Need to Replicate Data to Your Cloud Data Warehouse?
    When looking at your data integration needs for a cloud app, consider what (if any) of the application’s data needs to be replicated into your data warehouse for data mining and detailed analytics. A cloud data warehouse can be a powerful part of your cloud strategy, enabling less data to be retained locally (a key cost driver), improved analytics capabilities, and easier data retention should you decide to replace the cloud app in the future

As you can see from this checklist, there are some basic things that you need to evaluate to determine whether the migration of your application to the cloud will meet your business needs.  There are also some considerations that can help increase business value once your cloud migration is complete.  The choices you make about how your application data is managed are important to both achieving sustainable value for your business as well as enabling agility for the future.

Actian DataConnect is a hybrid data integration platform designed to help companies manage the connections between applications, platforms, and data sources across the enterprise. By managing your connections in one place, you can lower your operational costs, improve security, and enable business agility. To learn more, visit DataConnect.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Actian Life

Austin’s First Salsa and Guacamole Contest

Actian Corporation

August 1, 2019

Actian Culture

I am so thrilled to tell you about all about our first social event which put a little “spice” into everyone’s work week!

We sponsored a best salsa and guacamole contest. We asked everyone to bring in their favorite salsa and guacamole recipe and let their fellow colleagues’ taste and vote for the best one in each category. There were several entries, and everyone got a chance to place their vote anonymously.

We all had multiple “taste testing’s, just to make sure the vote chose was correct. 

We decorated our café area and blew up balloons which came in cactus and avocado shapes; red balloons, mini maracas and a table runner completed the look. 

The winner for best salsa went to Jay Clark and the winner of best guacamole went to Melanie Richards. They each won an Amazon gift card.

Everyone had a good time, sampling and resampling, talking to each other and enjoying the event. Sometimes in life, it’s the little things that bring a smile to someone’s face.

The biggest take away we received from this event; when is our next event? 

Thank you for reading about one of the events that make the Actian Culture phenomenal!

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Management

Should You Have Separate Document, Time-Series, NoSQL SQL Databases?

Actian Corporation

July 31, 2019

Server Image for Data Analysis

Managing and analyzing heterogeneous data is a challenge for most companies, which the oncoming wave of edge computing-generated datasets has only exacerbated. This challenge stems from a rather large “data-type mismatch” as well as how and where data has been incorporated into applications and business processes. How did we arrive here?

At one time, data was largely transactional and Online Transactional Processing (OLTP) and Enterprise resource planning (ERP) systems handled it inline, and it was heavily structured. Primarily, Relational DataBase Management Systems (RDBMS) managed the needs of these systems and eventually evolved into data warehouses, storing and administering Online Analytical Processing (OLAP) for historical data analysis from various companies, such as Teradata, IBM, SAP, and Oracle.

As most manual processes utilizing paper moved to digital records management, content management systems emerged as a means to manage all the unstructured documents from knowledge workers or which the expanded functionality within ERP and personal computing systems autogenerated. These systems contain semi-structured and unstructured document data stored in eXtensible Markup Language (XML) and JavaScript Object Notation (JSON) formats.

In parallel, and more so during the last few years with the Internet of Things (IoT) revolution, the third wave of digitization of data is upon us, operating at the edge in sensors, video, and other IoT devices. They are generating the entire range of structured and unstructured data but with two-thirds of it in a time-series format. Neither of these later datasets lends itself to RDBMS systems that underpin data warehouses due to how the data is processed and analyzed, the data formats used and the mushrooming dataset sizes.

Consequently, separate Document Store Databases, such as MongoDB and Couchbase, as well as several time-series databases, including InfluxDB and a multitude of bespoke Historians, emerged to handle these very distinct datasets. Each has a separate Application Programming Interface (API), lumped together as NoSQL – as in everything that’s not Structured Query Language (SQL).

The aftermath of these three waves of data types and database structures is data architects must now implement separate databases for each type of data and use case or try to merge and aggregate all of the different data types into a single database. Until recently, the only significant or enterprise-wide aggregation point for multiple databases and data types was the traditional data warehouse. The legacy data warehouse, however, is lagging as an aggregation point for two reasons.

First, many of them are based on inflexible architectures in terms of their capability to manage JSON and time-series data and the cost to expand them to administer larger datasets or complexity of modern analytics, such as Artificial Intelligence (AI) and Machine Learning (ML). Second, sending all the data to them in a single, centralized location on-premises can be costly and hinders decision-making at the point of action at the edge of the network.

During the era of edge computing and a wholesale flip of the majority of data being created and emanating from the edge instead of from the data center or a virtualized image in the cloud, specialized applications and platforms have an essential purpose in business process enablement. Just as each business process is unique, the data requirements for that technology to support those processes are also unique. While it may seem best-of-breed database technology for document store versus time-series versus traditional, fully structured transactional data may remove constraints on the use of technology within a business, you should be very careful before you go that route.

In general, the more APIs, underlying database architectures, resulting differences in supporting file formats, management, and monitoring systems and changes in which ones you use based on use case simply increase the complexity of your enterprise data architectures. This is particularly the case if you offer or implement multiple products, technologies and integration methodologies with this medley of databases. This complexity tends to have a domino effect into your support lifecycle for any software leveraging these databases – even the procurement of the databases.

Provided you can find a single database with similar performance and addresses all the data types and SQL as well as direct manipulation of the data through a NoSQL API, it makes far more sense to merge and aggregate heterogeneous data into a common database structure, particularly in Edge Computing use cases. For example, if you are looking at video surveillance data, sensor networks, and logs for security, then combinations of these and other disparate data sets must be aggregated for cross-functional analytics.

If you need to analyze, create reports and dashboards based on data of different types and in different source systems, then you will need some sort of capability for normalizing the data, so it can be queried either onsite or remotely from a single data set.

The requirements have changed during the last 30 years and Actian has built a new modular database that is purpose-built for edge-computing technologies and use cases and is capable of handling all datasets through a single NoSQL API, yet provides full SQL compliance. In both SQL and NoSQL functions, our 3rd party benchmark results show far better performance than any of the major Document Store, Time-Series or traditional SQL databases capable of handling Mobile and IoT.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Architecture

Rethinking Data Warehouse Modernization

Actian Corporation

July 30, 2019

Data Transformation

Data Warehouse Modernization

Sounds like a no-brainer. Move your Netezza, Teradata or Exadata data warehouse that likely runs on obsolete, proprietary hardware to a shiny new system that runs in the cloud, costs a fraction of what you were paying before and can be turned on and off like a light switch.

Is it That Easy? That Simple? Let’s Discuss…

Data warehouse modernization projects come in many flavors. Some are straightforward database upgrades or tweaks to data models. Others are more ambitious, such as redesigning your company’s primary data platform. Modernization may involve incorporating previously untapped features, such as vectorized processing, real-time columnar in-memory analytics, in-database algorithms or SPARK-based AI/ML analytics. Further, today’s next-gen cloud data warehouses deliver unprecedented economics due to their elastic, pay-as-you-go consumption model. Increasingly, modernization may include all of the above in order to address new data sources such as social, mobile and IoT data.

You can read my full article here.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Architecture

Smart IT Managers are Forging Their Data Warehouse Path to the Cloud

Actian Corporation

July 30, 2019

it's the journey not the destination

As Ralph Waldo Emerson once said, It’s not the destination it’s the journey. Unfortunately, this wise saying is not nearly applied as often as it should be to the contemporary topic of data warehouse modernization project design and execution. The result, thousands of data warehouse modernization projects unnecessarily end up in failure. Gartner’s Adam Ronthal, suggests that over 60% of all database migrations fail. That is, a migration is started, then starts slipping, overruns the original budget by multiples, until finally, management pulls the plug. Time is lost, money wasted, and careers wrecked. How do you make sure yours is not added to this dubious and growing list? 

There are Some Key Steps to Designing the Optimal Journey: 

  1. Migrate or Offload: A first step that many organizations take that mitigates risk and accelerates value delivery is the off-load journey. In this situation, several demanding workloads are identified to offload in a phased manner rather than migrate the entire data warehouse. Unless an organization needs to completely get off its existing data warehouse, e.g. your vendor has announced EOL plans as is happening with Netezza today, taking a phased approach with your migration usually makes sense and lets you get started sooner. Great candidates may include workloads with these characteristics: large data sets, ad-hoc query tool diversity or requests for new unsupported tools, hybrid data from multiple diverse data sources and complex queries.
  2. Baby Step or Giant Leap to the Cloud: Some organizations, as part of a cloud-first strategy move all of their data warehouse to the cloud. In other cases, pragmatic organizations often choose to conduct their migrations in stages. For example, moving a data landing and staging area to the cloud provides useful elasticity and agility while reducing the risk of disruption.
  3. Automate, Automate, Automate: Leading solution providers now offer sophisticated migration assessment tools that can identify SQL queries that can be translated to the new target system automatically. Typically, data warehouse systems that fully support the entire current SQL industry standard will do better at supporting the automation of query migration. It is not untypical for industry-standard SQL systems to support 95% and higher levels of automatic conversion from the source to the target system.
  4. Replicate, Augment or Breakthrough: Most organizations take a 3-step approach to the migration: first, replicate the source system report generation, then utilize the upgraded performance of the target system to augment the query base with additional dashboards and other ad-hoc analytics. Finally, in the third step, forward-thinking organizations look to develop new composable analytics applications such as real-time offers and fraud analysis that were prohibited for cost and/or performance reasons from considering before.
  5. Think “Business” with a Big “B”: A successful data migration project is always dependent on understanding and addressing the current and future needs of the business. This requires proactive collaboration with the direct and indirect users of the insight that you hope to deliver with your modernized data warehouse project. Especially important will be to identify and prioritize discovery-oriented analytics including ad-hoc analysis that the business side seeks and values.

How to Get Started? 

Your first step is to seek out and short-list a set of next generation solution vendors that can offer you a truly hybrid data warehouse journey that runs both in the cloud and on-premises with zero changes to your query stream and easily reroutes your ETL connections to your source data warehouse apps and external data sources, supports multiple cloud platforms to eliminate lock-in, and excels at workloads that demand high levels of scale and concurrency.  

 

Solutions such as Actian Data Platform designed to run seamlessly in the cloud and on-premises represent a potential breakthrough worth checking out. As the above diagram conveys an organization can charter its modernization journey through multiple paths with different combinations of cloud and on-premises deployments. A glass of hybrid with your slice of cloud data warehouse computing sir? Drink up – it’s a hybrid world we live in, today and tomorrow. 

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Integration

What You Should Know Before Moving to a Multi-Cloud Environment

Actian Corporation

July 30, 2019

Multi-Cloud Environment

Most IT leaders will agree: the cloud is the right place to host and operate many of their applications and systems. What isn’t quite as clear for these leaders is “which cloud” they should use.

The answer is “There is no right answer.” Each company (and application) has its own unique set of technical and operating needs. They will drive the decision about whether it should run in one of the public cloud environments or a company’s private cloud or on a 3rd party’s cloud environment (e.g., hosted and consumed as Software as a Service (SaaS)). This is a decision that should be made on a case-by-case basis and informed by a clearly documented and enforced enterprise-cloud policy.

Most organizations will likely choose and operate a hybrid environment with capabilities stratified across multiple clouds. Sensitive data apps may be on a private cloud, customer-facing systems that require geographic scalability may be on public clouds and SaaS components will likely be on someone else’s cloud. In this configuration, your focus should be on how you manage and connect the data in these systems. You will likely have business processes and data-driven analytics that require data to be integrated from different systems for your company to operate effectively.

Establishing and maintaining data connections in a multi-cloud environment is often more complex than managing connections in a single-cloud environment. Although it may be difficult, there are 3 reasons why data integration across multiple clouds is essential:

  1. Avoiding latency in business processes.
  2. Enabling data aggregation for analytics.
  3. Replicating data to manage business continuity risk.

Cloud environments provide a tremendous amount of cost leverage and scalability potential but require more robust data-connectivity capabilities than your IT department’s likely experience in legacy environments. In multi-cloud setups, it is often necessary to shift your integration patterns from point-to-point integrations between applications to more of a centralized data integration hub architecture, leveraging a platform, such as Actian DataConnect, to broker the connections to all your systems.

In addition to making manageability easier, DataConnect can help you monitor the flow of data between different cloud environments – you’ll make more informed decisions about how to improve end-to-end system performance and how to reduce infrastructure operating costs. Most companies make their initial cloud-selection decisions based on functionality criteria; however, cost savings is often the driver for making the decision to migrate systems from one cloud to another. Since cloud infrastructure is charged on a consumption model, understanding how your cloud applications are being used can help you identify cost-savings opportunities quicker.

Whether you are designing your cloud strategy in preparation to begin migrating capabilities to the cloud or seeking to optimize your current cloud utilization – selecting the right set of tools to help you manage your data connections is an essential step for harvesting maximum value from your cloud investments. Actian is an expert in hybrid data management, with modern solutions to help you manage data throughout the lifecycle of your IT systems. To learn more, visit DataConnect.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.