Support Questions

Find answers, ask questions, and share your expertise

How does the XML Ingest Mapper for Hadoop-Solr parse a xml file?

avatar
New Contributor

Can anyone point out how the output document of a XML file ingested by the Hadoop-Solr XML Ingest Mapper looks like?

1 ACCEPTED SOLUTION

avatar
Super Guru

Start here: http://lucene.apache.org/solr/quickstart.html

Search for "Indexing Solr XML" and perform the steps indicated.

In the end, you could browse the documents indexed at http://localhost:8983/solr/gettingstarted/browse. That is how the output you are interested looks like. Of courser, replace "localhost" with your case host in the URL. The /browse UI view defaults to assuming the gettingstarted schema and data are a catch-all mix of structured XML, JSON, CSV example data, and unstructured rich documents. Your own data may not look ideal at first, though the /browse templates are customizable.

View solution in original post

1 REPLY 1

avatar
Super Guru

Start here: http://lucene.apache.org/solr/quickstart.html

Search for "Indexing Solr XML" and perform the steps indicated.

In the end, you could browse the documents indexed at http://localhost:8983/solr/gettingstarted/browse. That is how the output you are interested looks like. Of courser, replace "localhost" with your case host in the URL. The /browse UI view defaults to assuming the gettingstarted schema and data are a catch-all mix of structured XML, JSON, CSV example data, and unstructured rich documents. Your own data may not look ideal at first, though the /browse templates are customizable.