Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Online Batch indexer for Solr cannot Parse.

Online Batch indexer for Solr cannot Parse.

Explorer

I am runnig the online batch and I am getting this error bellow, how could I find more information about this error or solve it.

Thanks in advance.

 

Error: org.kitesdk.morphline.api.MorphlineRuntimeException: org.kitesdk.morphline.api.MorphlineRuntimeException: Cannot parse
  at org.kitesdk.morphline.base.FaultTolerance.handleException(FaultTolerance.java:73)
  at org.apache.solr.hadoop.morphline.MorphlineMapRunner.map(MorphlineMapRunner.java:213)
  at org.apache.solr.hadoop.morphline.MorphlineMapper.map(MorphlineMapper.java:86)
  at org.apache.solr.hadoop.morphline.MorphlineMapper.map(MorphlineMapper.java:54)
  at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:145)
  at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
  at org.apache.hadoop.mapred.MapTask.run(MapTask.java:340)
  at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168)
  at java.security.AccessController.doPrivileged(Native Method)
  at javax.security.auth.Subject.doAs(Subject.java:415)
  at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548)
  at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:163)
  Caused by: org.kitesdk.morphline.api.MorphlineRuntimeException: Cannot parse
  at org.kitesdk.morphline.solrcell.SolrCellBuilder$SolrCell.doProcess(SolrCellBuilder.java:245)
  at org.kitesdk.morphline.stdio.AbstractParser.doProcess(AbstractParser.java:93)
  at org.kitesdk.morphline.base.AbstractCommand.process(AbstractCommand.java:159)
  at org.kitesdk.morphline.base.Connector.process(Connector.java:64)
  at org.kitesdk.morphline.stdlib.LogDebugBuilder$LogDebug.doProcess(LogDebugBuilder.java:58)
  at org.kitesdk.morphline.base.AbstractCommand.process(AbstractCommand.java:159)
  at org.kitesdk.morphline.stdlib.TryRulesBuilder$TryRules.doProcess(TryRulesBuilder.java:104)
  at org.kitesdk.morphline.base.AbstractCommand.process(AbstractCommand.java:159)
  at org.kitesdk.morphline.base.Connector.process(Connector.java:64)
  at org.kitesdk.morphline.base.AbstractCommand.doProcess(AbstractCommand.java:181)
  at org.kitesdk.morphline.tika.DetectMimeTypeBuilder$DetectMimeType.doProcess(DetectMimeTypeBuilder.java:166)
  at org.kitesdk.morphline.base.AbstractCommand.process(AbstractCommand.java:159)
  at org.kitesdk.morphline.base.Connector.process(Connector.java:64)
  at org.kitesdk.morphline.base.AbstractCommand.doProcess(AbstractCommand.java:181)
  at org.kitesdk.morphline.stdlib.SeparateAttachmentsBuilder$SeparateAttachments.doProcess(SeparateAttachmentsBuilder.java:79)
  at org.kitesdk.morphline.base.AbstractCommand.process(AbstractCommand.java:159)
  at org.kitesdk.morphline.base.AbstractCommand.doProcess(AbstractCommand.java:181)
  at org.kitesdk.morphline.base.AbstractCommand.process(AbstractCommand.java:159)
  at org.apache.solr.hadoop.morphline.MorphlineMapRunner.map(MorphlineMapRunner.java:201)
  ... 10 more
  Caused by: java.io.IOException: Invalid header signature; read 0x87B01A1F1446D478, expected 0xE11AB1A1E011CFD0
  at org.apache.poi.poifs.storage.HeaderBlock.<init>(HeaderBlock.java:140)
  at org.apache.poi.poifs.storage.HeaderBlock.<init>(HeaderBlock.java:115)
  at org.apache.poi.poifs.filesystem.NPOIFSFileSystem.<init>(NPOIFSFileSystem.java:265)
  at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:158)
  at org.kitesdk.morphline.solrcell.SolrCellBuilder$SolrCell.doProcess(SolrCellBuilder.java:243)
  ... 28 more

 

1 REPLY 1

Re: Online Batch indexer for Solr cannot Parse.

Explorer
Any idea on how to debug this problem ?