SFTP Connections
You can create connections to SFTP servers to upload your datasets to the Cloud Portal.
Linux- and Windows-based SFTP servers are supported.
Jobs can be executed from SFTP sources on the following running environments:
Trifacta Photon
Spark on EMR
Read: Supported
Write: Not supported
Limitations
Read-only connection
Files and folders whose names begin with underscore (
_
) aren't visible.Ingest of over 500 files through SFTP at one time isn't supported.
For private SFTP servers, you can't run jobs on Spark. These jobs must be run using Trifacta Photon.
Through SFTP connections, you can't run jobs on Avro or Parquet files or on parameterized datasets, from file types that require conversion such as JSON, PDF, and Excel.
Single Excel files can be imported through SFTP.
Prerequisites
Acquire user credentials to access the SFTP server. You can use username/password credentials or SSH keys. See below.
Verify that the credentials can access the proper locations on the server where your data is stored. Initial directory of the user account must be accessible.
SSH Keys
If preferred, you can use SSH keys to for authentication to the SFTP server.
Nota
SSH keys must be private RSA keys. If you have OpenSSH keys, you can use the ssh-keygen utility to convert them to private RSA keys.
Allowlist SFTP server
If your SFTP server is private, you must add the Trifacta's IP to the allowlist of IPs that are permitted to communicate with the server. For more information, please see the documentation that is provided with your software distribution.
Create Connection
Create through application
You can create a SFTP connection through the Cloud Portal.
Steps:
In the left nav bar, select the Connections icon. See Connections Page.
In the Connections page, click Create Connection. See Create Connection Window.
In the Create Connection window, click the SFTP connection card.
Specify the properties for your SFTP server.
Property
Description
Host
The hostname of the FTP server to which you are connecting. Do not include any protocol identifier (
sftp://
).Port
The port number to use to connect to the server. Default port number is
22
.Credential Type
Select one of the following:
basic
- authenticate via username and passwordSSH Key
- authenticate via username and SSH keyUser Name
The username to use to connect.
Password
(Basic credential type) The password associated with the username.
SSH Key
(SSH Key credential type) The SSH key that applies to the username.
Test Connection
Click this button to test the connection that you have specified.
Default Directory
Absolute path on the SFTP server where users of the connection can begin browsing.
Block Size (Bytes)
Fetch size in bytes for each read from the SFTP server.
Nota
Raising this value may increase speed of read operations. However, if it is raised too high, resources can become overwhelmed, and the read can fail.
Connection Name
The name of the connection as you want it to appear in the application.
Description
This description is displayed in the application.
For more information, see Create Connection Window.
Click Save.
Create through APIs
Type:
jdbc
Vendor:
sftp
For more information, see : API Reference docs