Bulk Migration Parameters
This article describes the input, output, tokenization, and detokenization parameters of the migration.properties
and detokenization.properties
files.
Input configuration
The input configuration section lists and defines the parameters required to define the input data file and the columns from input data file that needs to be tokenized (migration.properties
file) or detokenized (detokenization.properties
file). For example the name and path for the input data file is passed to the bulk utility through the Input File Path and Input Type parameters in the Input Configuration section of migration.properties
/detokenization.properties
file.
The following is a sample of the Input Configuration section of migration.properties
and detokenization.properties
files:
Tokenization configurations
The TokenSpec Configuration and AlgoSpec Configuration are sections of the migration.properties
file. It contains all the necessary information required for the tokenization of input data. For example, token format that will be used to tokenize a column and whether the generated token will pass or fail luhn check.
The following sample shows the configuration of AlgoSpec and TokenSpec:
Decryption configuration
For tokenization of encrypted text the Decryption Configuration section in migration.properties file needs to be configured as mentioned here:
Note
If the above mentioned parameters are not being used, they must be left blank.
Detokenization configuration
The TokenSpec Configuration and AlgoSpec Configuration are sections in detokenization.properties
file. It contains all the necessary information required for the bulk detokenization of tokenized input data. For example, the token format that will be used to detokenize a column. The following is a sample of the AlgoSpec Configuration and TokenSpec Configuration sections of detokenization.propertie
s file:
Output configuration
The output configuration is a section in migration.properties/detokenization.properties files. It contains all the necessary information required for creation of output data file. For example the output data file name and path or the destination database details is passed to the bulk utility as the parameter in output configuration section of properties file.The configuration section must contain the correct information, in the correct order.
The Output.Sequence parameter in the migration.properties/detokenization.properties file specifies the sequence, of the input columns and tokenized columns, in which they are written to the output file. If a column of input file has to appear in the output file then its column index needs to be specified in the output sequence. The sequence number can either be positive or negative. Positive sequence number indicates that the decrypted and/or tokenized value for a column is written to the output file, if the column was decrypted and/or tokenized. While negative sequence number indicates that the original value for a column, from the input file is written to the output file. For any columns that are not decrypted and/or tokenized the specified sequence number has the same effect.
The following is a sample of the Output Configuration section of migration.properties
and detokenization.properties
files: