DS XI 3.0 Output is format masks of to_char functi...

Former Member · ‎09-08-2008

Data Services XI 3.0 (12.0.0.0)

Just started using DS and thought I was doing something wrong, but it now appears to me to be a bug.

Background: The intention is to standardise the data coming from different sources so we want to generate a single Unique Reference Number (URN) from each source. Some sources have already got a single column unique reference and others have composite columns.

I create a data flow (let's call it dataflow_a) which essentially has a Flat File source, a query to read that flat file and the output goes to a template table (stored on Oracle 10g database) (let's call it output_a) which has "Make Port" checked as I require the data flow to be an embedded source in another data flow. The query does a little manipulation of the incoming flat data so that it can be read from the file (i.e. it needs rtrimming and there are some dates without times etc.) and the output columns are a mixture of "int", "varchar" and "datetime". The data in this has a composite key to identify unique records and that is made up of 4 columns... ref_no (int), ref_year (int), region (varchar), seq_no (int)

If this dataflow is executed (batch job) then the data is read into the template table ok. So far so good.

dataflow_a:


flatfile - - - - query - - - - output_a (port)

Now, to generate the single URN, dataflow_a is used as a source in another dataflow (let's call it dataflow_b). A query is attached to the output of dataflow_a and the columns mapped directly from the left side to the right side.

In addition, within the query, we add an Output column called "urn" for which we include a mapping of:

(I've split this onto seperate lines at the concatenation points to make it easier to read on the forum)


ltrim_blanks(to_char(dataflow_a.ref_no,'099999'))||
ltrim_blanks(to_char(dataflow_a.ref_year,'09'))||
dataflow_a.region||
ltrim_blanks(to_char(dataflow_a.seq_no,'09'))

The output is captured in another template table (output_b) on the same Oracle database.

dataflow_b:


dataflow_a - - - - query - - - - output_b

However, when I execute this job now, all the data is populated correctly apart from the URN column which shows...

09999909SA09

for each row, with only the region changing with the data. The ref_no, ref_year and seq_no only show the format mask rather than the actual data.

Ok, so at first I thought I wasn't using to_char correctly, so I checked in the manual and it was identical to the examples given, it's documented that it works with "int" types and I also tested with some hardcoded numbers rather than the data columns just to check.

Now here's where it get's odd.

Firstly...

Within dataflow_b, if I put a second query pulling data from dataflow_a ...

dataflow_b:


dataflow_a - - - - query - - - - output_b
         \
          \- - - - query - - - - output_c

... and I execute this job, then the output_b data suddently works correctly and I get the ref_no, ref_yr, region and seq_no concatenated correctly within it. If I then remove that second query and output_c from it and execute the job again, output_b reverts to showing the format masks rather than the data again.

Secondly...

Within the query for dataflow_b, if I change the mapping for the URN to read...


ltrim_blanks(to_char(dataflow_a.ref_no*1,'099999'))||
ltrim_blanks(to_char(dataflow_a.ref_year*1,'09'))||
dataflow_a.region||
ltrim_blanks(to_char(dataflow_a.seq_no*1,'09'))

or


ltrim_blanks(to_char(to_decimal(dataflow_a.ref_no,'.',',',1),'099999'))||
ltrim_blanks(to_char(to_decimal(dataflow_a.ref_year,'.',',',1),'09'))||
dataflow_a.region||
ltrim_blanks(to_char(to_decimal(dataflow_a.seq_no,'.',',',1),'09'))

then this also works ok.

So it would seem that there is an issue within the to_char function for it recognising the datatype of the source columns (which are int) even though the documentation says that to_char is happy to accept int. However that still doesn't explain why adding a second query makes it work.

Is it something I'm doing? Is is something anyone else can reproduce? Is it a known bug?

Your input is appreciated.

Giles

Edited by: Giles Cartmel on Sep 8, 2008 3:01 PM

Changed title to something a little more appropriate

Former Member · ‎12-19-2011

Greetings Thread Originator,

This post is older than 60 days and there are no entries in the past 30 days. Based on the content discussed, it appears that you question has been

answered. This message is being marked as answered and points are being assigned if available where possible.

Thank you for being an active participant in the SAP Forums,

Rob Siegele

Forum Moderator

SAP Americas

Former Member · ‎09-10-2008

Ok, after a litte more testing (with simple reproducable test data) and investigation on my part I've found further evidence...

My source data (flat file) is like...

1¬¬¬¬¬<tab>2¬<tab>XX<tab>1¬<tab>Text Data¬¬¬¬¬¬¬¬¬¬¬¬¬¬¬¬¬<tab>Text Data¬¬¬¬¬¬¬¬¬¬¬¬¬<newline> 2¬¬¬¬¬<tab>2¬<tab>XX<tab>1¬<tab>Text Data¬¬¬¬¬¬¬¬¬¬¬¬¬¬¬¬¬<tab>Text Data¬¬¬¬¬¬¬¬¬¬¬¬¬<newline> 2¬¬¬¬¬<tab>2¬<tab>XX<tab>2¬<tab>Text Data¬¬¬¬¬¬¬¬¬¬¬¬¬¬¬¬¬<tab>Text Data¬¬¬¬¬¬¬¬¬¬¬¬¬<newline> 1¬¬¬¬¬<tab>3¬<tab>XX<tab>1¬<tab>Text Data¬¬¬¬¬¬¬¬¬¬¬¬¬¬¬¬¬<tab>Text Data¬¬¬¬¬¬¬¬¬¬¬¬¬<newline>

Where ¬ is a . i.e. I have fixed width columns with tab field delimiters.

When I read the flat file using the query, I do an RTRIM_BLANKS on all the data, whether numeric or text before it gets put into output_a where the columns are the corresponding int or varchar datatypes.

What appears to be happening is that because I've used RTRIM_BLANKS on the numeric data which is going into "int" columns in the output template table, it is somehow remembering that the data is varchar (the output data type of the rtrim_blanks function).

If I wrap the rtrim_blanks with a to_decimal conversion or just remove the rtrim_blanks (the source flat file has the column defined as "int" too), then the problem is solved.

However, this still doesn't explain why, in my first post, adding a second query taking data from the dataflow_a as a source, made it work too.

It seems to me that there is still a bug around implicit conversions of datatypes which are not passing through the flows correctly. Certainly, if my output_a has a column defined as "int" I would expect it to implicitly convert the data to int and treat it as such from that point onwards, but it's not.

Can anyone else reproduce this, or tell me that I'm doing something blatently stupid that I haven't seen, or is it really a bug as I think it is?

Thanks

DS XI 3.0 Output is format masks of to_char function rather than the data

Accepted Solutions (0)

Answers (2)

Answers (2)

Re: git command to connect to BTP Destination poin...

Re: Making an API call and unzipping a file in Dat...

Re: Making an API call and unzipping a file in Dat...

Re: How to make a REST "PUT" call to a dynamic url...

Re: User Deprovisioning Signavio