How to verify if DataStage project has any corrupted files

Question

If DataStage project or temp directory runs out of disk space, some of the buffered write requests might not get correctly completed and the project might ended up having corrupted hash files. There is one approach to check the project for corrupted hash files.

Answer

Run the uvbackup process and redirect the backup to null devices. The uvbackup produces an output file that can be used to identify the corrupted files.
Here are the steps (please note that you must be logged in as DataStage Administrator):

  1. Source your dsenv file in $DSHOME (. ./dsenv)
  2. Go to your project directory (../InformationServer/Project/<project name>)
  3. List all files and direct them to a file (ls > myfiles.txt) - this is used to list of files for the uvbackup
  4. Run the uvbackup and redirect output to null with this command: "$DSHOME/bin/uvbackup -V -f -cmdfil myfiles.txt -s uvbackupout.txt -t /dev/null  2>&1 > testing123.txt"
  5. grep "WARNING:" uvbackupout.txt


The output file: uvbackupout.txt will help to identify if there are any corrupted files in the project..

Here is an example what you might see in the uvbackupout.txt file (output of the grep command in step 5)

WARNING: Unable to open file 'RT_STATUS3' for reading. File not saved!^G

The uvbackup verifies the integrity of the files and will not backup any files that are corrupted.





Popular posts from this blog

Shrink you container size up to 95%.

alma linux: dnf Module yaml error: Unexpected key in data