Created on 04-24-2014 01:58 PM - edited 09-16-2022 01:57 AM
Hi,
I am trying to get more information on extracHbaseCells command.How ever I am unable to find it in the morphline reference guide.Can some one please let me know where I can find the documentation on this.The following is the refernce guide Iam looking at.
http://cloudera.github.io/cdk/docs/0.9.1/cdk-morphlines/morphlinesReferenceGuide.html
Created 04-24-2014 02:33 PM
Created 04-24-2014 02:02 PM
Created 04-24-2014 02:29 PM
Hi ,
I understood that part.But let us say i extract an xml from the hbase cell with following elements(name,city,country) and I want to index the solr .My solr schema also has fields (name,city and country).Now I need to parse the xml ,ge these fileds and index it to solr.
extractHBaseCells {
mappings : [
{
inputColumn : "messages:name"
outputField : "name"
type : String
source : value
}
{
inputColumn : "messages:city"
outputField : "city"
type : String
source : value
}
{
inputColumn : "messages:country"
outputField : "country"
type : String
source : value
}
]
}
This would have been possible If was able to retrieve data from hbase in this format.But what extractHbaseCell would give me is an xml file.I am loking for a way to parse this using xquery and then assign then assign values to solr field.
Created 04-24-2014 02:33 PM
Created 04-25-2014 01:05 PM
Thank you.If I am not wrong can I can also put my java code to parse the the soap message here.I am a java guy,so it seems to be easy for me.
Created 04-25-2014 01:43 PM
Created 04-29-2014 11:46 AM
Hi,
I was able to parse the xml from stored in hbase and then put all the values into the record object.How do i set this to the required solr fields now ?.
My conf file is right npow like this
{extract habse}
{java # have all values extracted and set to the record object)
Now how can I set these extracted values to solr fields?..
Thanks,
Nishanth
Created 04-29-2014 12:04 PM
Created on 04-30-2014 10:53 AM - edited 04-30-2014 10:55 AM
Thanks a lot.I have created the solr cloud and was able to index a sample data(extract the mesagae and put it into one solr field) just to check that my configuration is correct and it works.
How ever when I try to extract data and assign it to solr schema elements it does not work.Extract hbase cell looks like this.Do I need to have _attachment_body" field or an "_attachment_mimetype" field defined in my schema?
extractHBaseCells {
mappings : [
{
inputColumn : "messages:*"
outputField : "_attachment_body"
type : byte[]
source : value
}
]
}
}