Solved: Slow Indexing

Former Member · ‎10-16-2008

Hi all,

I have to index a repository of 20000 documents. It took 5 hours to index 1000 documents. if it continues like this it will take 100 hrs to complete the indexing.

What could be the reason for this low performance? We are having the portal and Trex on different servers.

In the portal server's trace file I can see the following message:

#1.5 #001321CCE41F00840000004F000022F80004595C881A43B2#1224153862312#com.sap.portal.prt.runtime#sap.com/irj#com.sap.portal.prt.runtime#index_service#189##n/a##7eaeefb09b6c11ddbc3d001321cce41f#SAPEngine_Application_Thread[impl:3]_37##0#0#Error##Java###04:14_16/10/08_0315_21412850

[EXCEPTION]

#1#com.sap.engine.services.servlets_jsp.server.exceptions.WebIllegalStateException: The stream has already been taken by method [getOutputStream()].

In the TREX admin's trace I can see several messages like:

[5752] 2008-10-16 19:23:47.312 e preprocessor Preprocessor.cpp(00941) : HTTP-GET failed for URL http:// <file name>

with Errorcode -5 , but HTTP-HEAD worked, trying again

[5752] 2008-10-16 19:23:47.421 e HTTPData Preprocessor.cpp(04944) : HTTPGET: Stop retries after 5 rounds, skipping

[5752] 2008-10-16 19:23:47.421 e preprocessor Preprocessor.cpp(00951) : HTTPHEAD failed for URL http:// : <file name>

Errorcode -5 , Message Reader::readHeaderSkip100 failed, url=http://<file name>

The TREX server has 16GB RAM.

What can be done to improve the performance?

Thanks and Regards,

Shyam.

Former Member · ‎10-17-2008

Hi Shyam,

Not sure if it could help you but the below guide makes recommendations for configuring search and classification (using TREX 6.1) for efficient indexing. It covers the following topics: fast initial indexing of large data sets; fast updating of indexes; and fast index replication in distributed TREX systems.

1) How to Configure TREX 6.1 for Efficient Indexing

https://www.sdn.sap.com/irj/sdn/howtoguides?rid=/library/uuid/1545e1bf-0d01-0010-a5ab-f80e574423bf

Hope that helps.

Ray

Slow Indexing

Accepted Solutions (1)

Accepted Solutions (1)

Answers (0)

Error getting value from 'ReadTimeout'

Re: Integrate an external task system to Cloud ALM...

[BIG PROBLEM] SAP Host Agent cannot connect to SYS...

What is the right SAP MARA field to map with CPI L...

SKU for Additional Storage