Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Is there any tool in Hadoop which can do the language translation on my data?

Solved Go to solution
Highlighted

Is there any tool in Hadoop which can do the language translation on my data?

Contributor

I have data in a SQL Server RDBMS. The data is in French and I need to save that data on hdfs. I also need the data translated into English.

1 ACCEPTED SOLUTION

Accepted Solutions

Re: Is there any tool in Hadoop which can do the language translation on my data?

Guru

There are a number of online translation services which can be used to do this. Most of them work as REST APIs, which you can integrate into your ingestion process, whether that is through realtime ingest via something like Storm, or post processing through a custom UDF, or Oozie process.

Something to look at would be the YandexTranslate processor in Hortonworks Data Flow. So you could for example use the ExecuteSQL process to get data out of your SQL Server and then translate the content with the YandexTranslate processor, before using PutHDFS to store the data in HDP.

2 REPLIES 2

Re: Is there any tool in Hadoop which can do the language translation on my data?

Guru

There are a number of online translation services which can be used to do this. Most of them work as REST APIs, which you can integrate into your ingestion process, whether that is through realtime ingest via something like Storm, or post processing through a custom UDF, or Oozie process.

Something to look at would be the YandexTranslate processor in Hortonworks Data Flow. So you could for example use the ExecuteSQL process to get data out of your SQL Server and then translate the content with the YandexTranslate processor, before using PutHDFS to store the data in HDP.

Re: Is there any tool in Hadoop which can do the language translation on my data?

Mentor

@bandhu gupta has this been resolved? Can you accept best answer or provide your own solution?

Don't have an account?
Coming from Hortonworks? Activate your account here