Using the Table Wizard Feature of Data Lake Analytics

By Zhao Yongchun, Data Lake Analytics Team

The Table Wizard feature in Data Lake Analytics (DLA) is designed for simplifying the process for creating Object Storage Service (OSS) mapping databases and tables in DLA. You can enable the Table Wizard in just three steps:

  1. Check authorization
  2. Data source configuration
  3. Configure the location of OSS tables
  4. Configure the location of DLA databases
  5. Other configurations
  6. Import configuration

This article describes how to quickly create OSS tables through the Table Wizard feature.

Step 1: Check Authorization

This step ensures that the DLA service has access to OSS data. If you already have permission to access OSS data sources, you can skip this step.

  1. Log on to the DLA console.
  2. In the left-side navigation pane, select Table wizard, and then click Agree to authorize.

On the Cloud Resource Access Authorization pane, click Confirm Authorization Policy to complete the OSS authorization.

Step 2: Data Source Configuration

This step configures the storage location for OSS tables and DLA databases.

Note: You need to configure the OSS tables’ location, and the DLA databases’ location, and then set other configurations.

Configuring the Location of OSS Tables

Click OSS Console in the preceding figure to obtain the URL of an OSS data table. The URL is the OSS data table address. You can obtain the URL using the following three methods:

Method 1: Accessing the File URL

Method 2: Using File Preview

Method 3: Using ossbrowser

Obtain the URL using the OSS official tool ossbrowser.

After you enter the URL of an OSS file, click Parse URL to automatically display Table name and Storage location.

Configuring the Location of DLA Databases

The DLA databases location must be the path or the parent path where the OSS files are located. For example, if an OSS path is oss://analyticdb-bucket/table/2019/1.24/, the corresponding DLA database path can be any of the following options:

? /
? /table/
? /table/2019/
? /table/2019/1.24/

Scenario 1: Creating an OSS Table in DLA without DLA Database Creation Requirements

If you do not need to create a DLA database when creating an OSS table in DLA, click Schema list , and select a DLA database from the Choose a schema drop-down list.

Scenario 2: Creating an OSS Table in DLA with DLA Database Creation Requirements

If you need to create a DLA database when creating an OSS table in DLA, click New schema, and select a DLA database path from the Storage location drop-down list.

Other Configurations

  • Data format: The data formats JSON and Text are supported.
  • Compression: No compression is selected by default.
  • Scan mode: Scan child folder mode is selected by default.

Other configurations, such as Skip header rows, Column embedded level, Select file only, and more can be set according to your requirements.

Step 3: Import Configuration

  1. Click Next to import configurations.

Click Execute to go to the Execute window, and then click OK.

After configurations are executed successfully, you are prompted the OSS table has been created. You can click Back to go to the Table wizard tab page, or click Log on to DMS Console to analyze the OSS data.

Reference:Using the Table Wizard Feature of Data Lake Analytics — Alibaba Cloud Community

Follow me to keep abreast with the latest technology news, industry insights, and developer trends.