Solved: Java Mapping eats superscripts!

Former Member · ‎05-05-2009

Hi All,

This is reference to my previous thread. I'm still trying to locate the missing superscripts and closing tags.

[XML not well formed|]

I suspect the tiny Java mapping used is causing problems. Would greatly appreciate the expert comments/ suugestions on the mapping ( I'm not a Java guy )

public class SeparateFile implements StreamTransformation {

String strXML = new String();

AbstractTrace trace;

private Map param = null;

public void setParameter(Map param) {

this.param = param;

}

public void execute(InputStream in, OutputStream out) {

int strBegin, strEnd;

String ns1String = new String();

String ns2String = new String();

String ns3String = new String();

String headerString = new String();

String outString = new String();

String xmldecl = "<?xml version=\"1.0\" encoding=\"utf-8\"?>";

//String StrXML = new String();

String outString1 = " ";

String BillStr[] = null;

String[] BillStrMod = new String[2000];

trace =

(AbstractTrace) param.get(

StreamTransformationConstants.MAPPING_TRACE);

trace.addInfo("Process Started");

String line = new String();

try {

StringBuffer strbuffer = new StringBuffer();

byte[] b = new byte[4096];

for (int n;(n = in.read(b)) != -1;) {

strbuffer.append(new String(b, 0, n));

}

strXML = strbuffer.toString();

} catch (Exception e) {

System.out.println("Exception Occurred");

}

/************ NameSpace and the Root Element is Trimmed here

<ns1:MT_BillPrint xmlns:ns1=\"http://londonhydro.com/Matrix/BILL/BillPrint\">

and each invoice taken into array

************/

strXML = strXML.substring(114, strXML.length());

BillStr = strXML.split("</invextract>", -1);

BillStr[0] = xmldecl.concat(BillStr[0]);

/************ Append The Array Values with

and write it to string buffer.

************/

for (int cnt = 0; cnt < BillStr.length - 1; cnt++) {

BillStrMod[cnt] =

BillStr[cnt].concat("</invextract>");

outString = outString.concat(BillStrMod[cnt]);

}

in = new ByteArrayInputStream(outString.getBytes());

try {

out.write(outString.getBytes());

} catch (Exception e) {

System.out.println("Exception in Writing to output Stream");

}

Thanks in advance!

Anish

stefan_grube · ‎05-19-2009

byte[] b = new byte[4096];
for (int n;(n = in.read(b)) != -1;) {
strbuffer.append(new String(b, 0, n));
}
strXML = strbuffer.toString();

I think, this could be the issue.

UTF-8 is a variable length codepage, some characters like ³ are represented with two, three or four bytes. The in.read() assigns exactly 4096 bytes (not characters), so the byte representation for ³ might be split.

When it is split, the first byte alone is no UFT-8 character, but the second byte starts a sequence and together with the " cannot interpreted as well, so both character are not visible.

When you have 19969 occurrecies of ³ and 5 of them are wrong, that is exactly what is expected based on probability, as each ³-character has a probability of 1/4096 to be split.

You have to change that piece of code.

Regards

Stefan

Java Mapping eats superscripts!

Accepted Solutions (1)

Accepted Solutions (1)

Answers (0)

Re: SAC Measure input control doesn't work

fuzzy search with multiple name fields

Re: Using OData in a Ui-Integration Card (Componen...

Re: Building SAP Asset Manager Client (MDK-23.8.7 ...

Re: Building SAP Asset Manager Client (MDK-23.8.7 ...