on 04-07-2014 12:03 PM
Dear Team,
I have UTF-8 (Native language) data stored in my Oracle database, I'm able to see the UTF-8 data in data services but not in Information steward where it showing junk values.
Any advice how to fix this? My oracle database is UTF-8 and while creating connection in CMC for Information steward I used the option UTF-8 code page. Also could someone throw some light on how to compare two UTF-8 based data for duplicate.
Any help would be sincerely appreciated.
Regards,
Tushar.
Hi Tushar,
I have checked with some Arabic data as source and did profiling and cleansing package. Both cases it is working fine. I have imported cleansing package in the Data Services and check the results , It is fine I don't have UTF-8 database that's why I used UTF-8 text file with Arabic data.
Please find the below screenshots.
Please check your settings . If possible could you please provide some sample data to test from my side.
Thanks & Regards,
Ramana.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
Hi Tushar ,
Please change the setting in the Information steward file format option as shown below .
Please find the below profiling results .
For come to cleansing package builder I am getting the problem with data .
If I use break on white space only I cannot parse the data properly because of commas.
If I use the option break on white space is no then data is showing like this .
Thanks & Regards,
Ramana
User | Count |
---|---|
86 | |
10 | |
10 | |
9 | |
7 | |
7 | |
6 | |
5 | |
4 | |
4 |
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.