Alibaba Cloud DataWorks Highly Recognized by Forrester

  • Flexible deployment
    CDWs are expected to have several flexible deployment modes. For small enterprises, CDWs should provide the online multi-tenant mode to allow these customers to quickly mobilize computing resources and implement data warehouse deployment in just several minutes. For medium and large enterprises, CDWs should support the exclusive or local deployment mode to provide robust computing performance and absolute security as well as leave out technical details of high complexity
  • Efficient data migration to cloud
    For customers that have not yet migrated their data warehouses to cloud or customers that adopt online and offline hybrid architectures, CDWs should provide a fast and low-cost approach to help users implement data collection.
  • Diverse analysis methods
    CDWs should support multiple technical means to help users get desired data processing capabilities in various business scenarios.
  • Excellent security
    CDWs should provide security in various aspects, including data encryption, auditing, data desensitization and access control.

Product Architecture

Before analyzing DataWorks, we will first take a quick look at its role in the Alibaba Cloud CDW service system and its product architecture.

  • Data integration: Integrate heterogeneous data to collect numerous data from various source systems on big data cloud platforms
  • Data development: Data warehouse design and ETL development
  • O&M monitoring: O&M monitoring over jobs in the ETL process
  • Real-time analytics: Real-time data exploration and analysis
  • Data asset management: Metadata management, data map, data lineage, data asset graph, etc.
  • Data quality: The system for data quality control, monitoring, verification and assessment
  • Data security: data permission management, classified data marking, data desensitization and data audits
  • Data service: data sharing, data switching and data API services

Flexible Deployment

This Forrester report gives lengthy explanation of the necessity of multiple deployment modes, and includes the comparison among CDWs from several service providers. DataWorks is one of the first-tier products that provide multiple deployment modes.

Efficient Data Migration to the Cloud

It is obvious that efficient data integration methods can significantly facilitate the migration of enterprise data to cloud. During the initial migration stage, enterprises need to quickly and securely migrate their data assets to cloud; during the stage of continuous business operations, enterprises need to input various kinds of data into CDWs and then output processed data from CDWs to individual business units.

Diverse Analysis Methods

DataWorks provides powerful data development IDEs and supports visual editing of SQL code, integration tasks and business flow DAG graphs. Multi-user online cooperation and task script version management can meet practical needs of enterprise-level data development. In addition to the offline task processing feature, DataWorks provides the lightweight “Analytics Workbench” tool to fully utilize the computing capacity of MaxCompute and meet users’ instant data analysis needs.

Robust Security

Sensitive data protection requires even better compliance with the industry standards and data privacy laws and regulations. Security is the top priority of DataWorks. DataWorks provides data security modules and implements all-round data security using the following security protection means:

  • Multi-tenant isolation
    DataWorks has its own multi-tenant permission model. Tenants can apply for resource quotas on demand and manage their own resources; tenants can also manage their own data, permissions, users and roles independently from each other to ensure data security.
  • Data security level setting
    Data security levels allow users to discover and locate sensitive data, and see the sensitive data distribution on data resource platforms. Auto-discover sensitive data based on specified insensitive data types and classify insensitive data. Appropriate security rules are applied based on secret levels such as Top Secret, Confidential and General.
  • Data access audit
    DataWorks will strictly examine privileged users’ access, including access time, executed operations and execution order. Recording and auditing privileged users’ access can ensure that appropriate operations are performed at the proper time by these privileged users, and check if abnormal operations are made, to further improve the security of data systems.
  • Data desensitization
    When failing to decide whether some users, access addresses, or even fields are distrustful or not, DataWorks will focus on data content itself, identify sensitive information points and block dynamic access to this information to ensure data security.

Conclusion

With “Internet Plus” further applied in different industries, there is an increasing need for enterprises to manage, process and employ their data assets. Internet companies can quickly use their big data processing capability to meet other enterprises’ needs. That also explains why these four cloud service providers, instead of long-established data warehouse companies like Oracle and IBM, are listed in the Forrester report as first-tier CDW providers.

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Alibaba Cloud

Alibaba Cloud

Follow me to keep abreast with the latest technology news, industry insights, and developer trends. Alibaba Cloud website:https://www.alibabacloud.com