cancel
Showing results for 
Search instead for 
Did you mean: 

HiveServer2 connection JDBC

Former Member
0 Kudos

Hi,

I cannot connect to hive via hiveserver2 and some extra options (transportMode=http, httpPath=cliservice).

Here is the error I get using PA on my desktop ("Requested array size exceeds VM limit ... ", see below).

The hive DB is working fine. I can access it via beeline without any problem.

What does this error message mean ? Any clue to solve this?

Accepted Solutions (1)

Accepted Solutions (1)

PritiSAC
Advisor
Advisor
0 Kudos

Hello JB, Antoine is absolutely correct about support of Hive versions. So Hive 1.2 is only supported in automated mode and we have not upgraded Expert mode beyond 0.13 version. However purely on the basis of technical compatibility , you could try this. In our ODBC/JDBC connectivity in PA mode - we have tried and test "Hive Server2 port" . this is usually port 10000 and is sometimes also called as Thriftserver port. What you are trying is port 10001 which is http transport port to connect to Hive. This will work from beeline alright. For PA, You should be able to find "Hive server 2 port" in configuration on hadoop for Hive and use this port during connectivity without specifying any additional parameters. Will this be okay for you ? thanks Priti

Former Member
0 Kudos

It is on purpose that I'm using the port 10001. We are indeed using that port (HTTP transport mode), and not the default binary transport mode (port 10000).

Could SAP PA work with the port 10001 (with HTTP transport mode) or does it have to be on port 10000?

Thx for your feedback.

PritiSAC
Advisor
Advisor
0 Kudos

Hi Jean-Baptiste, Sorry for the delayed reply. I made all possible test configuring HTTP port instead of Binary from PA-Expert tool. I am running into same issue as yours. I am afraid that this does not seem to be working. So the only option is using 'Binary' transport mode and associated thriftserver port. This is also the restriction BTW in Automated mode because the driver thats shipped does only support Binary mode . Thanks.

Answers (3)

Answers (3)

Former Member
0 Kudos

Hi. I'd like to access tsv files from hadoop cluster since the HTTP transport mode is not supported and this is a blocking factor internally. Can I do that through Spark ? It seems that the use of Spark inside SAP PA can only be done on Hive tables. Is this correct ? Thanks in advance for the information.

alanmcshane
Advisor
Advisor
0 Kudos

Hi Jean-Baptiste,

Automated will work with Hive tables created via TSV or other delimiters okay.

You can use Spark via Native Spark Modeling in Automated which uses Spark scala APIs to build the model on the hadoop cluster.  It currently supports Hive tables.

thanks,

Alan

Message was edited by: Alan McShane to be clearer

PritiSAC
Advisor
Advisor
0 Kudos

Hi Jean-Baptise, We finally seem to have worked out the solution for Hive HTTP mode.

So if you use Simba or Hortownworks(based on Simba) drivers then ODBC driver can work with HTTP mode on hive. Using the settings on driver window , you can specify transport mode, http_path and other details as per your needs. And then PA can be used for subsequent actions.

So this means that embedded DD driver can not be used for HTTP mode on Hive.

Let us know if you face any other issues and if you can work with Simba/other compliant drivers.

Thanks.

Thank you alan.mcshane@sap.com

Former Member
0 Kudos

Hi,

I could get it work with the Hortonworks ODBC driver. However when creating an entity it gives me an error message (see below). Any clue what I should do here ? (using SAP PA 2.5.0)

Thx in advance,

JB

PritiSAC
Advisor
Advisor
0 Kudos

Yes. Please change the metadata repository to be locally some folder on the desktop or on PA server. We don't support saving of Data manager or model objects on Hadoop cluster.

abdel_dadouche
Active Contributor
0 Kudos

Hi Jean-Baptiste,

Did Priti comment helped solve you problem?

Regards

@bdel

achab
Product and Topic Expert
Product and Topic Expert
0 Kudos

Hi Jean-Baptiste, please see if this helps. Requested array size exceeds VM limit when crea... | SCN

Also looping

Former Member
0 Kudos

If this line in beeline works (see printscreen below). What should I change in the config I did (see above).

Thx in advance.

achab
Product and Topic Expert
Product and Topic Expert
0 Kudos

I see from your printscreen that the Hive DB that you are using has the version 1.2.1. Is this correct?

If yes, we do not support this particular version in Expert Analytics, only in Automated Analytics. See our PAM here: https://support.sap.com/content/dam/library/ssp/infopages/pam-essentials/Pred_Ana_20.pdf

That could be a reason...

I am not a big data expert, I will loop in colleagues to provide more input.

Thanks & regards


Antoine

Former Member
0 Kudos

It is indeed the version 1.2.1. Thx in advance for asking your colleagues.

JB

achab
Product and Topic Expert
Product and Topic Expert
0 Kudos

Done 😉 They will reach out, hopefully soon. Give it a try with Automated Analytics, in the meantime?