How to reduce the amount of failed data created in Information Steward?
I am developing rules to evaluate quality of data in a system. And the tables have millions of rows (6 - 15 million records). So every time i create a new rule on the table and want to evaluate the score, I end up creating huge amounts of failed data (as i have to evaluate the score for all the rules bound to that table). Also, at this point of time am not planning to create any reports from the failed data. (as am still in development phase)
I have implemented the following to reduce the amount of failed data being created;
a. Not to select the option to store failed data in 'Failed Data Repository' in the rule task.
b. Create a task to evaluate the scores after binding decent number of rules to it. (to an extent practically possible)
It would be very helpful if you can share ideas to reduce the amount of failed data being created ?