Connecting to data sources in Data Virtualization
Data Virtualization supports many relational and nonrelational data sources that you can add to your data source environment. Data Virtualization connects to relational data sources by using the Java™ Database Connectivity (JDBC) protocol.
Learn how you can connect to your data sources.
- Adding a connection to a data source
- Adding a data source from an existing platform connection
- Accessing data sources by using remote connectors in Data Virtualization
- Adding platform connections
Before you begin
- If you want to enforce governance for your published objects, set up a governed catalog to publish your assets to. For more information, see Governing virtual data in Data Virtualization.
- Review the list of data sources that are supported for Data Virtualization. For more information, see Supported data sources in Data Virtualization.
- For certain data sources, such as Amazon S3, Ceph®, IBM Cloud Object Storage, Google BigQuery, and Snowflake, you must complete specific steps.
- After a data source is added, any user with virtualize permissions (Data Virtualization Manager or Engineer roles) can create virtual tables. The user can create virtual tables by using any of the added data sources, no matter which user added the data source. For more information, see Managing users and roles.
- Review limitations and restrictions for data type mapping in Data Virtualization. For more information, see Supported data sources in Data Virtualization.
- Review Data source connection access restrictions in Data Virtualization to plan who you want to be able to access the data source connection and what privileges you want them to have.
Adding a connection to a data source
To add a connection to a data source in your environment, complete the following steps.
On the navigation menu, click Data sources page by default.
. The service menu opens to theClick Data Virtualization.
to see a list of data sources that can be added toSelect the type of data source that you want to connect to and then click Select.
The type of connection that you create determines the information that you must provide.
Typically, a connection requires a URL, a hostname, and port number.
Specify the required information based on the connection that you selected:
- The connection name and description.
- The name of the database
- The hostname or IP address and port number of the database, which is required to access the connection.
- The username and password that allow access to the
data source.Note: The username and password that is specified here refer to an ID with read-only access to the data source. This user is required for accessing data from the data source and does not necessarily correspond to a Cloud Pak for Data username or a Data Virtualization user ID.
For some data sources, you can use the Cloud Pak for Data credentials to access the data source. To do so, select the corresponding checkbox.
- Any additional properties required to create the connection.
If you want to use SSL to connect to the database, copy the content of the SSL certificate and paste it in the corresponding box.
- Add collaborators to the data source connection to determine who can access it. On the
Add collaborators and add the connection to a remote connector page, the
creator is listed as a collaborator. Choose from the following options to add additional
collaborators to the connection:
- Select Skip to create the data source connection without any additional collaborators or remote connectors. This means only the creator of the connection can view and use it.
- Select and select the users and user groups that you want to add as collaborators. Any users that you select, and any users that belong to groups that you select, can access the connection
- Select and select the roles that you want to add as collaborators. You can add the Engineer role, the Admin role, or both. Any users that have the roles that you select can access the connection.
For more information about collaborators, see Collaborators.
-
Click Create to add the connection to the data source environment.
- Manage access for the connection to determine what database tasks the
collaborators can perform on the connection.
- On the Data sources page, click the vertical overflow menu () and select Manage access. On the Manage access page, you can see the collaborators and their currently assigned privileges.
- You can add additional collaborators from the Manage access page.
- Apply your changes.
- Transfer ownership of the data source connection. See Transferring ownership of data sources in Data Virtualization.
Adding a data source from an existing platform connection
To add a data source from an existing platform connection, complete the following steps.
On the navigation menu, click Data sources page by default.
. The service menu opens to the-
Click the Add connection drop-down menu and click Existing platform connection to see a list of data sources that can be added to Data Virtualization.
-
Select the data source that you want to add and click Add.
- Add collaborators to the data source connection to determine who can access
it. On the Add collaborators and add the connection to a remote connector page,
the creator is listed as a collaborator. Choose from the following options to add additional
collaborators to the connection:
- Select Skip to create the data source connection without any additional collaborators or remote connectors. This means only the creator of the connection can view and use it.
- Select and select the users and user groups that you want to add as collaborators. Any users that you select, and any users that belong to groups that you select, can access the connection
- Select and select the roles that you want to add as collaborators. You can add the Engineer role, the Admin role, or both. Any users that have the roles that you select can access the connection.
For more information about collaborators, see Collaborators.
-
Optional: Select a remote connector to associate to the data source and click Add to connector.
For more information, see Accessing data sources by using remote connectors in Data Virtualization.
- Click Add to add the connection. Note: When you add data source connections in Data Virtualization, you might need to refresh twice on the Virtualize page. The first refresh notification is displayed when new data source connections are added. Click Refresh to reload tables, including those from new connections. After tables reload, a second notification appears. Click Refresh again to update your table list with newly loaded tables.
- Manage access for the connection to determine what database tasks the
collaborators can perform on the connection.
- On the Data sources page, click the vertical overflow menu () and select Manage access. On the Manage access page, you can see the collaborators and their currently assigned privileges.
- You can add additional collaborators from the Manage access page.
- Apply your changes.
- Transfer ownership of the data source connection. See Transferring ownership of data sources in Data Virtualization.
Learn more