Skip to main content
Version: 2024.12.12

Datasets Creation

The dataset page helps you view, create, edit, duplicate, extend, and publish the dataset to meet your requirements. It facilitates the integration of datasets from various sources, allowing you to work with diverse data types and formats.

The dataset page has the following options:

  • Select source instance: It enables you to choose the desired source instance to access datasets created within that specific environment. A source instance refers to a particular system or database from which data is extracted and integrated into the data warehouse environment.
  • Grid and list view: You can view the datasets in grid and list view using these options. The list view presents data in a tabular format with rows and columns, making it easier to scan and compare values visually. The grid view presents data in a structured grid format, with each grid representing a dataset. The grid view is useful for comparing and analyzing multiple datasets simultaneously.
  • Search dataset: It helps you to search for a dataset.
  • Create dataset: It helps you to create a dataset.

Searching dataset

You can type a keyword in the Search dataset to search for a dataset.

Creating a dataset

This task enables you to create a dataset. It enables you to create new datasets, defining the structure, schema, and properties of the data. It supports importing data from various sources and integrating it into datasets seamlessly. You can also edit, modify, or transform data within datasets, ensuring data quality and consistency. Here's a flowchart to understand the dataset creation process.

flowchart

Prerequisites

Analytics author Access is required to duplicate out-of-the-box datasets and create or edit custom datasets.

Complete the following steps to create a dataset:

Selecting an entity

Entities are crucial in dataset creation as they provide a structured framework for organizing and representing data. Defining entities enables a clear understanding of the data stored and the relationships among various pieces of information. Complete the following steps to select an entity:

  1. Click Create dataset.
  2. Click Select source instance, and then select the required source instance.
  3. Click Linked instance, and then select the required instance.
  4. Click the + icon against entities to add the required entities from the entity list. If you accidentally closed the Add panel, you can reopen it by clicking on the Add entity button. By clicking on Add entity the panel should reappear, allowing you to resume the entity addition process without losing any progress.

::: note

You can search or use the scroll bar to select entities.

:::

Selecting entity

Choosing out-of-the-box columns

Out-of-the-box columns are vital for dataset entities, providing a foundation for data organization, standardization, and efficiency. They ensure data consistency and enhance the dataset's value for analysis and decision-making. Complete the following steps to select the out-of-the-box columns:

  1. Click the entity.
  2. In the Column tab, select the required columns, and then click Add. Note: You can do a single or multiple selection of columns.
  3. Click the edit button in the Explore dataset pane to modify the column’s name.

Creating custom object or column

The custom columns are typically derived from existing data within the dataset through calculations, transformations, or other manipulations to generate new insights or information. You can create a custom object or column using this option.

note

You cannot change the data object type that is attribute and metrics for the created data object.

Complete the following steps to create the attributes:

  1. In the Explore dataset pane, click Create object.
  2. In the Data Object window, type the column or object name.
note

The column or object name must not contain any blank spaces.

  1. Select Attribute.
  2. Click KeyColumnDatatype, then select the required data type.
  3. Type Key expression.
note

Key column datatype and key expression are mandatory to create a custom column.

  1. Click DisplayColumnDatatype, then select the required data type.
  2. Type Display expression.
  3. Click Save.

Data Object

Complete the following steps to create the metrics:

  1. In the Explore dataset pane, click Create object.
  2. Type of the object or column name.
  3. Select Metric.
  4. Click Aggregation Rule, then select the required aggregation.
  5. Click KeyColumnDatatype, then select the required data type.
  6. Type Key expression.
  7. Click Save.

Data Object

note

You get a Data Object ABC was successfully created message once the data object or column is created.

Key expression

The key expression enables you to create a column or attribute that helps identify, organize, and relate data across tables. You can transform existing attributes to ensure uniqueness and aid in efficient querying and analysis. This expression plays a vital role in schema design and data modeling.

Example: Consider you have an Orders table that contains information about orders placed by customers. Each order has an Order ID and is associated with a Customer ID. you want to count the number of orders placed by each customer.

Here's how you can use a key expression along with the COUNT function to achieve this: "COUNT(OrderID)"

note

You can type # to select an entity.

CustomerIDOrderCount
1013
1021
1031

This result shows the count of orders placed by each customer. Customer 101 has placed 3 orders, customer 102 has placed 1 order, and customer 103 has also placed 1 order.

Editing custom object or column

You can edit the existing object or column using this option. You can rename the column, adjust the property, edit the data, and rearrange the structure. Complete the following steps to edit the object or column:

  1. On the Explore dataset pane, click the edit button against the data object name.
  2. In the Data object dialog box, edit the required details.
  3. Click Save.
note

You get a Data Object ABC was successfully updated message once the object or column is edited.

Deleting custom object or column

This feature enables you to delete the custom object or column. Complete the following steps to delete the object or column:

  1. On the Explore dataset pane, click the delete button against the data object name.
  2. You get a Data Object ABC was successfully deleted message once the object or column is deleted.

Selecting out-of-the-box relationship

You can select a predefined relationship using this option. Out-of-the-box relationship provides a convenient and reliable approach to establishing data connections and promoting efficiency, accuracy, and consistency in data analysis and reporting processes. This option displays the out-of-the-box relationships between the selected entity and the other entities in the selected source. Complete the following steps to select the relationship.

  1. Click the entity.
  2. In the entity, click Relationship.
  3. Click the Link against the required relationship.

Adding custom relationship

You can create custom relationships using this option. Custom relationships offer flexibility, control, and adaptability in establishing data connections, allowing you to address your business needs, accommodate complex data structures, and optimize data analysis processes. Complete the following steps to add a custom relationship.

  1. Click and draw a line between two entities.
  2. Click join type and select the required join type.
  3. Enter an expression in Expression.
  4. Click Add Join.

Add join

Join type

The join type refers to the method used to combine rows from two or more tables in a relational database based on a related column between them. The join types determine how the database engine matches rows from different tables and includes them in the result set. You can use any of the following types to create a relationship based on your requirements:

  • Inner join: This type of join returns only the rows from both tables that have matching values in the specified columns. It combines rows from two tables where the join condition is met.
  • Left outer join: This join returns all the rows from the left table (the table mentioned first in the SQL query) and the matching rows from the right table. If there are no matching rows in the right table, NULL values are returned.
  • Right outer join: It returns all the rows from the right table and the matching rows from the left table. If there are no matching rows in the left table, NULL values are returned.
  • Full outer join: This join returns all rows from both tables, regardless of whether there is a match or not. If there is no match, NULL values are returned for the columns from the table that lack a matching row.
  • Cross join: This type of join returns the Cartesian product of the two tables, meaning it combines each row of the first table with every row of the second table. It does not require a join condition.

Example: Consider the Employees and Departments tables. The Employees table contains information about each employee, such as their ID, name, department ID, and salary. The Department's table contains information about each department, such as its ID and name. Now, let's say you want to join these two tables to get a result set that contains each employee's ID and the department name.

You can achieve this by using the following SQL query with an inner join:

#Employees.Department_ID = #Departments.Name
note

You can type # to select an entity.

The Inner Join specifies that only the rows with matching values in both tables will be included in the result. You get the following table as a result:

IDDepartment
101HR
102Marketing
103Finance

Adding filter

This feature enables you to create a filter. The filter feature enhances the dataset view in data exploration capabilities, empowering users to extract meaningful insights from the dataset more efficiently. Complete the following steps to add a filter.

  1. Click the Filter tab.
  2. On the Filter tab, type the filter condition.
  3. Click Save.

add filter

Example: consider a Students table with columns for StudentID, Name, and Age. Let's say you want to filter out students who are older than 18 years old..

Here's how you can write a filter condition for this scenario: " Students.Age > 18"

note

You can type # to select an entity.

The result of this query would include only the students who are older than 18 years old.

Saving custom Dataset

The Save Custom Dataset feature enables you to save customized datasets with specific configurations, filters, or transformations applied. By saving a custom dataset, you can easily access and reuse your tailored data views for ongoing analysis or reporting tasks. Complete the following steps to save a dataset:

  1. To enable the preview dataset feature, click Save or Save and continue. Selecting Save and continue enables you to preview the dataset without closing the dataset creation page.
  2. In Save dataset, type the dataset name and description.
note

The Dataset name must not contain any blank spaces

  1. Type the mapping name.
  2. Click Save. You get a Dataset XYZ was successfully created message once the dataset is saved.
note

The dataset is created, and it is in a draft state.

Save dataset