Support Questions
Find answers, ask questions, and share your expertise

Hive Long running query failed at times - Broken Pipe

Hive Long running query failed at times - Broken Pipe

Explorer

Hi,

I am running an ETL which populates the data from HIVE to MySQL. The SELECT Query have multiple UNION and the time taken for executing the job is about 1 hour. Most of the time the transformation(ETL) succeed and few times it is failed with the following exception. Moreover the Hive JDBC connection is on non-kerberized HDP cluster and there is non error logged into Hiveserver2/Hive metastore:

org.apache.thrift.transport.TTransportException 2018/01/18 15:11:21 - Table input 3.0 - 2018/01/18 15:11:21 - Table input 3.0 - at org.pentaho.di.core.database.Database.openQuery(Database.java:1768) 2018/01/18 15:11:21 - Table input 3.0 - at org.pentaho.di.trans.steps.tableinput.TableInput.doQuery(TableInput.java:236) 2018/01/18 15:11:21 - Table input 3.0 - at org.pentaho.di.trans.steps.tableinput.TableInput.processRow(TableInput.java:140) 2018/01/18 15:11:21 - Table input 3.0 - at org.pentaho.di.trans.step.RunThread.run(RunThread.java:62) 2018/01/18 15:11:21 - Table input 3.0 - at java.lang.Thread.run(Thread.java:748) 2018/01/18 15:11:21 - Table input 3.0 - Caused by: java.sql.SQLException: org.apache.thrift.transport.TTransportException 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.hive.jdbc.HiveStatement.waitForOperationToComplete(HiveStatement.java:365) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.hive.jdbc.HiveStatement.execute(HiveStatement.java:242) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.hive.jdbc.HiveStatement.executeQuery(HiveStatement.java:437) 2018/01/18 15:11:21 - Table input 3.0 - at org.pentaho.di.core.database.Database.openQuery(Database.java:1757) 2018/01/18 15:11:21 - Table input 3.0 - ... 4 more 2018/01/18 15:11:21 - Table input 3.0 - Caused by: org.apache.thrift.transport.TTransportException 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.thrift.transport.TIOStreamTransport.read(TIOStreamTransport.java:132) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.thrift.transport.TTransport.readAll(TTransport.java:86) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.thrift.transport.TSaslTransport.readLength(TSaslTransport.java:376) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.thrift.transport.TSaslTransport.readFrame(TSaslTransport.java:453) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.thrift.transport.TSaslTransport.read(TSaslTransport.java:435) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.thrift.transport.TSaslClientTransport.read(TSaslClientTransport.java:37) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.thrift.transport.TTransport.readAll(TTransport.java:86) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.thrift.protocol.TBinaryProtocol.readAll(TBinaryProtocol.java:429) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.thrift.protocol.TBinaryProtocol.readI32(TBinaryProtocol.java:318) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:219) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:77) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.hive.service.cli.thrift.TCLIService$Client.recv_GetOperationStatus(TCLIService.java:413) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.hive.service.cli.thrift.TCLIService$Client.GetOperationStatus(TCLIService.java:400) 2018/01/18 15:11:21 - Table input 3.0 - at sun.reflect.GeneratedMethodAccessor31.invoke(Unknown Source) 2018/01/18 15:11:21 - Table input 3.0 - at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) 2018/01/18 15:11:21 - Table input 3.0 - at java.lang.reflect.Method.invoke(Method.java:498) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.hive.jdbc.HiveConnection$SynchronizedHandler.invoke(HiveConnection.java:1374) 2018/01/18 15:11:21 - Table input 3.0 - at com.sun.proxy.$Proxy59.GetOperationStatus(Unknown Source) 2018/01/18 15:11:21 - Table input 3.0 - at org.apache.hive.jdbc.HiveStatement.waitForOperationToComplete(HiveStatement.java:332) 2018/01/18 15:11:21 - Table input 3.0 - ... 7 more

Is there any workaround/ configuration interms of Hive JDBC connnection to get rid of this error. ? Appreciate your feedback/suggestion. Thanks!!!

1 REPLY 1
Highlighted

Re: Hive Long running query failed at times - Broken Pipe

Explorer

Any suggestion?