Loading data into SQL Database#
This guide explains how to load data into SQL Database for further analysis.
Introduction#
Use the SQL Database destination to load data into an SQL Database. Adverity supports the following engine types:
Amazon Redshift
Microsoft SQL Server
MySQL
Oracle
Postgres
SAP HANA
Snowflake
Prerequisites#
Before you complete the procedure in this guide, perform all of the following actions:
Create a datastream whose data you want to load into SQL Database. For more information on creating a datastream, see Collecting data in Adverity.
Procedure#
To load data from a datastream into SQL Database, follow these steps:
Add SQL Database as a destination to the workspace which contains the datastream or to one of its parent workspaces.
Assign the SQL Database destination to the datastream.
You can assign as many destinations to a datastream as you want.
Some destinations require specific Data Mapping, such as Hubspot and Facebook Offline Conversions. If these Data Mapping requirements conflict, the destinations cannot be assigned to the same datastream.
Adding SQL Database as a destination#
To add SQL Database as a destination to a workspace, follow these steps:
Go to the Destinations page.
Click + Create destination.
Search for and click SQL Database.
In the Configuration page, fill in the following fields:
- Name
(Optional) Rename the destination.
- Engine type
Select the SQL engine type from the drop-down list.
- Hostname
Specify the URL of the SQL Database. If you use a port other than the default setting for this SQL Database, specify the port in the format
hostname:port
.- Database
Specify the name of the database into which you want to load the data.
- Username
Enter the username for SQL Database destination.
- Password
Enter the password for SQL Database destination.
For more information on advanced configuration settings, see Advanced SQL Database tips.
Click Create.
Assigning SQL Database as a destination#
To assign the SQL Database destination to a datastream, follow these steps:
Go to the Datastreams page.
In the Load section, click + Add destination.
Select the SQL Database checkbox in the list.
Click Save.
For the automatically enabled destinations, in the pop-up window, click Yes, load data if you want to automatically load your previously collected data into the new destination. The following data extracts will be loaded:
All data extracts with the status collected if no other destinations are enabled for the datastream
All data extracts with the status loaded if the data extracts have already been sent to Adverity Data Storage or external destinations
Alternatively, click Skip to continue configuring the destination settings or re-load the data extracts manually. For more information, see Re-loading a data extract.
Configuring settings for loading data into SQL Database#
To configure the settings for loading data into SQL Database, follow these steps:
Go to the Datastreams page.
In the Load section, find the SQL Database destination in the list, and click Actions on the right.
Fill in the following fields:
- Table name
Specify the target table in the destination into which to load data from the datastream. The name can contain alphanumeric characters and underscores. For example,
target_table
. To specify a schema, use the syntaxschemaName.tableName
.By default, Adverity saves data from each datastream in a different table named
{datastream_type}_{datastream_id}
(for example,mailgun_83
).You can specify the same target table for several datastreams. If a column is shared between datastreams, Adverity performs a full outer join and concatenates values. If a column is not shared between datastreams, Adverity writes null values in the relevant cells.
To create a new SQL Database spreadsheet containing the data you load into SQL Database, enter a name for the new spreadsheet into this field.
You can use the following placeholders when creating new table names in the destination:
Placeholder
Description
{datastream_id}
The datastream ID.
{datastream_type}
The data source.
{extract_id}
The data extract ID.
{meta[*]}
Replace
*
with a metadata placeholder to use metadata in the table name. For example,{meta[datastream_URI]}
uses the datastream URI as the table name. For more information on metadata and placeholders, see Using placeholders.{name}
The automatically generated filename of the data extract.
{scheduled_day}
The day when the data fetch was scheduled to run.
{scheduled_month}
The month when the data fetch was scheduled to run.
{scheduled_year}
The year when the data fetch was scheduled to run.
- Truncate
Select this checkbox to delete all rows from the relevant table in the destination before loading the latest data extract.
If the Truncate checkbox is selected, the following overwrite conditions are not applied and all data will be deleted.
If no overwrite options are selected, the loaded data is appended to the destination table.
- Datastream
Select this checkbox to overwrite all data previously loaded into the destination from this datastream.
Data loaded from other datastreams is not overwritten.
- Date Range
Select a date column from your data extract to overwrite all data in the destination with a date that matches a date in this column.
If the Datastream checkbox is selected, only the data loaded from the current datastream with matching dates will be overwritten.
- Filename
Select this checkbox to overwrite the relevant data in the destination if a data extract with the same filename already exists in the destination.
- Key Columns
Select this checkbox to overwrite data in the destination based on the key columns defined in the Data Mapping of the datastream. Adverity executes this overwrite option after all the other overwrite options. When you select this checkbox, the configuration to overwrite data based on dates does not have an effect.
Click Save.
Advanced SQL Database tips#
Configuration#
To configure the SQL Database destination, follow these steps:
Go to the Destinations page.
Click the SQL Database destination in the list.
In the secondary menu, click Configuration.
Fill in the following fields:
- Load state table
Select the checkbox to create a load state table. The load state table is a table in the destination which contains information about each data load. The load state table contains the following fields:
Fieldname
Description
datastream_id
datastream ID.
datastream_name
datastream name.
datastream_url
datastream URL.
extract_filename
Filename of the data extract.
range_start
Start date and time of the fetch.
range_end
End date and time of the fetch.
uuid
Unique ID of the data transfer to the target table.
load_state
State of the data transfer. The possible states are failed, finished, and running.
load_start
Date and time when the data transfer started.
load_end
Date and time when the data transfer finished.
target_schema
Data schema used in the target table.
target_table
Name of the target table.
nrows
Number of rows in the target table.
ncols
Number of columns in the target table.
- Name of load state table
Enter a name for the load state table. The default name of a load state table is dt_loadstate.
- Options
Specify the options for the database in JSON format.
- Default schema
Specify the default schema used in the database.
Click Save.
Technical details of loading data into SQL Database#
When loading a data extract into SQL Database, Adverity creates the following additional columns in the target table:
- dt_created
The date the data extract was created.
- dt_updated
The date the data extract was last updated.
- dt_filename
The automatically generated filename of the data extract.
In addition, Adverity creates a temporary table in SQL Database with the name _stage. This ensures that no partial updates are made to the database (guarantee of atomicity). Adverity deletes this temporary table after the data load is complete or has been aborted.
Troubleshooting: Loading data into SQL Database fails after changing the table structure of the data extract#
Data may fail to load into SQL Database because of a change in the table structure of the data extract. To ensure the frictionless operation of production systems, Adverity only modifies existing tables in SQL Database if the changes are considered neutral, such as adding a new field to the table. Changes to the table structure of the data extract are not considered neutral. As a consequence, Adverity does not update the existing table in SQL Database resulting in failure when loading data.
The following actions change the table structure of the data extract:
You change the key columns in the datastream’s Data Mapping. For more information on key columns, see Setting key columns.
You change the data type of a column in the datastream’s Data Mapping. For more information on the data types used in Adverity, see Data types used in data harmonization.
You change the maximum character length of values in a column which contains String values.
To resolve this problem, follow these steps:
In SQL Database, modify or drop the table that corresponds to the datastream.
In Adverity, load data from the datastream into SQL Database.
Troubleshooting: Adverity cannot access SQL Database destination#
If Adverity cannot access the SQL Database destination, the destination may restrict access based on IP address.
To resolve this issue, follow these steps:
Find the IP address of your Adverity instance. To find the IP address of your Adverity instance, contact Adverity Customer Support.
Add Adverity’s IP address to the whitelist of your destination.