<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314833#M226213</link>
    <description>&lt;P&gt;Thank you.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We will work on this solution and get back. Do we have the same to read data from Amazon-s3 and put into the topic?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 19 Apr 2021 03:16:41 GMT</pubDate>
    <dc:creator>sriven</dc:creator>
    <dc:date>2021-04-19T03:16:41Z</dc:date>
    <item>
      <title>How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314795#M226193</link>
      <description>&lt;P&gt;We are looking for some kind of utility or tool to read the data from HDFS and place it in the kafka topic.&amp;nbsp;Appreciate your inputs.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;From the community section, we came across this "&lt;SPAN&gt;You could use Apache NiFi with a ListHDFS + FetchHDFS processor followed by PublishKafka"...Can you provide more insight how this can be acheived&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you&lt;BR /&gt;Srinu&lt;/P&gt;</description>
      <pubDate>Fri, 16 Apr 2021 23:14:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314795#M226193</guid>
      <dc:creator>sriven</dc:creator>
      <dc:date>2021-04-16T23:14:48Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314830#M226211</link>
      <description>&lt;P&gt;Hello&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You can try Kafka Connect&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Reference:&amp;nbsp;&lt;A href="https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/kafka-connect/kafka-connect.pdf" target="_blank"&gt;https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/kafka-connect/kafka-connect.pdf&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Apr 2021 02:42:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314830#M226211</guid>
      <dc:creator>Daming Xue</dc:creator>
      <dc:date>2021-04-19T02:42:34Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314833#M226213</link>
      <description>&lt;P&gt;Thank you.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We will work on this solution and get back. Do we have the same to read data from Amazon-s3 and put into the topic?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Apr 2021 03:16:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314833#M226213</guid>
      <dc:creator>sriven</dc:creator>
      <dc:date>2021-04-19T03:16:41Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314836#M226215</link>
      <description>&lt;P&gt;Hello&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You can try below&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/kafka-connect/topics/kafka-connect-connector-s3-example.html" target="_blank"&gt;https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/kafka-connect/topics/kafka-connect-connector-s3-example.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Apr 2021 03:27:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314836#M226215</guid>
      <dc:creator>Daming Xue</dc:creator>
      <dc:date>2021-04-19T03:27:40Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314905#M226246</link>
      <description>&lt;P&gt;We already have the data in HDFS and we want to pull the data from HDFS and put in in kafka topic.&lt;/P&gt;&lt;P&gt;So, we are looking for source connector here in pulling the data from HDFS and placing in kafka.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Apr 2021 18:34:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314905#M226246</guid>
      <dc:creator>sriven</dc:creator>
      <dc:date>2021-04-19T18:34:54Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314906#M226247</link>
      <description>&lt;P&gt;Same as previous request. We are looking for source connector here as well to pull the data from Amazon s3 and put in kafka.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Apr 2021 18:38:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314906#M226247</guid>
      <dc:creator>sriven</dc:creator>
      <dc:date>2021-04-19T18:38:50Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314950#M226256</link>
      <description>&lt;P&gt;Hello &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/25363"&gt;@sriven&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;- As&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/11821"&gt;@Daming Xue&lt;/a&gt;&amp;nbsp;mentioned Kafka Connect is one of the good options, the doc&amp;nbsp;&lt;A href="https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/kafka-connect/kafka-connect.pdf" target="_blank"&gt;https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/kafka-connect/kafka-connect.pdf&lt;/A&gt;&amp;nbsp;shares an example of HDFS as a sink connector.&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.cloudera.com/cdp-private-cloud-base/7.1.5/kafka-connect/topics/kafka-connect-connector-hdfs-example.html" target="_blank"&gt;https://docs.cloudera.com/cdp-private-cloud-base/7.1.5/kafka-connect/topics/kafka-connect-connector-hdfs-example.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;- Flume (CDH)&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.cloudera.com/documentation/kafka/latest/topics/kafka_flume.html#concept_rsb_tyb_kv__section_iwb_tyb_kv" target="_blank"&gt;https://docs.cloudera.com/documentation/kafka/latest/topics/kafka_flume.html#concept_rsb_tyb_kv__section_iwb_tyb_kv&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;- Nifi (&lt;A href="https://blog.cloudera.com/adding-nifi-and-kafka-to-cloudera-data-platform/" target="_blank"&gt;https://blog.cloudera.com/adding-nifi-and-kafka-to-cloudera-data-platform/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.cloudera.com/t5/Community-Articles/Integrating-Apache-NiFi-and-Apache-Kafka/ta-p/247433" target="_blank"&gt;https://community.cloudera.com/t5/Community-Articles/Integrating-Apache-NiFi-and-Apache-Kafka/ta-p/247433&lt;/A&gt;&amp;nbsp;)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;- Kafka- Hive Integeration (&lt;A href="https://docs.cloudera.com/cdp-private-cloud-base/7.1.5/integrating-hive-and-bi/topics/hive-kafka-integration.html" target="_blank"&gt;https://docs.cloudera.com/cdp-private-cloud-base/7.1.5/integrating-hive-and-bi/topics/hive-kafka-integration.html&lt;/A&gt;)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;- Custom Java app (&lt;A href="https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/kafka-developing-applications/topics/kafka-develop-example-producer.html" target="_blank"&gt;https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/kafka-developing-applications/topics/kafka-develop-example-producer.html&lt;/A&gt;)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;- To try out quickly (testing purpose), you can use the console producer&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;hadoop fs -cat file.txt |&amp;nbsp;&lt;/SPAN&gt;kafka-console-producer --broker-list &amp;lt;host:port&amp;gt;&lt;SPAN class="Apple-converted-space"&gt;&amp;nbsp; &lt;/SPAN&gt;--topic &amp;lt;topic&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.cloudera.com/cdp-private-cloud-base/7.1.5/kafka-managing/topics/kafka-manage-cli-producer.html" target="_blank"&gt;https://docs.cloudera.com/cdp-private-cloud-base/7.1.5/kafka-managing/topics/kafka-manage-cli-producer.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;- Spark (which you do not want)&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/developing-spark-applications/topics/spark-using-spark-streaming.html" target="_blank"&gt;https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/developing-spark-applications/topics/spark-using-spark-streaming.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;These are some I could quickly think of, there must be many more options.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;&lt;P&gt;Nandini&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;P.S. If you found this answer useful please upvote/accept.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 20 Apr 2021 10:23:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314950#M226256</guid>
      <dc:creator>Nandinin</dc:creator>
      <dc:date>2021-04-20T10:23:37Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314971#M226265</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/78191"&gt;@Nandinin&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;We have requirement like spark program are writing the files into HDFS.&lt;/P&gt;&lt;P&gt;So we want to read those files and send to kaka.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We know HDFS sink connector is useful for writing to HDFS as well and HDFS source connector is useful when the files written by hdfs sink connector.&lt;/P&gt;&lt;P&gt;HDFS source connector is also not the solution if files written by spark programming.&lt;/P&gt;&lt;P&gt;Please let us know if there is any solutions for this requirement?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 20 Apr 2021 13:55:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/314971#M226265</guid>
      <dc:creator>sriven</dc:creator>
      <dc:date>2021-04-20T13:55:35Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315046#M226299</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What is the file format?&lt;/P&gt;&lt;P&gt;Why is it that you say&amp;nbsp;&lt;SPAN&gt;HDFS source connector is also not the solution if files written by spark programming.?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;BR /&gt;Spark - HDFS - Kafka is your entire flow correct?&lt;BR /&gt;Spark to HDFS they have done now you are looking for HDFS - Kafka.&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;If you can help me understand the file format that Spark saves it while I can find if HDFS Source connector should not be able to help your usecase.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 21 Apr 2021 12:01:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315046#M226299</guid>
      <dc:creator>Nandinin</dc:creator>
      <dc:date>2021-04-21T12:01:40Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315052#M226301</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/78191"&gt;@Nandinin&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Yes the flow is correct&lt;/P&gt;&lt;P&gt;The files are in&amp;nbsp;parquet format.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 21 Apr 2021 19:49:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315052#M226301</guid>
      <dc:creator>sriven</dc:creator>
      <dc:date>2021-04-21T19:49:53Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315358#M226437</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/25363"&gt;@sriven&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Found this -&amp;nbsp;&lt;A href="https://community.cloudera.com/t5/Support-Questions/How-to-insert-parquet-file-to-Kafka-and-pass-them-to-HDFS/td-p/178340" target="_blank"&gt;https://community.cloudera.com/t5/Support-Questions/How-to-insert-parquet-file-to-Kafka-and-pass-them-to-HDFS/td-p/178340&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please let me know if it helps.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;&lt;P&gt;Nandini&lt;/P&gt;</description>
      <pubDate>Tue, 27 Apr 2021 04:58:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315358#M226437</guid>
      <dc:creator>Nandinin</dc:creator>
      <dc:date>2021-04-27T04:58:49Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315539#M226470</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/78191"&gt;@Nandinin&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;We have gonbe through this already.&lt;/P&gt;&lt;P&gt;Anything without Scala/Spark ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 28 Apr 2021 18:46:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315539#M226470</guid>
      <dc:creator>sriven</dc:creator>
      <dc:date>2021-04-28T18:46:52Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315544#M226472</link>
      <description>&lt;P&gt;Please try Kafka connect then, that seems to be the best option suited.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 28 Apr 2021 21:34:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315544#M226472</guid>
      <dc:creator>Nandinin</dc:creator>
      <dc:date>2021-04-28T21:34:16Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315688#M226537</link>
      <description>&lt;P&gt;How to read parquet files using Kconnect.?&lt;/P&gt;&lt;P&gt;In simple,We just want to read the parquet files on HDFS using kconnect and without spark jobs?&lt;/P&gt;&lt;P&gt;Please let us know if there is a solution or not?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 30 Apr 2021 22:39:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315688#M226537</guid>
      <dc:creator>sriven</dc:creator>
      <dc:date>2021-04-30T22:39:41Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315827#M226584</link>
      <description>&lt;P&gt;As you know,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We have limitation with source kafka connector that it works for HDFS&amp;nbsp; objects/files created only by the&amp;nbsp;&lt;A href="https://docs.confluent.io/kafka-connect-hdfs/current/index.html" target="_blank"&gt;HDFS 2 Sink Connector for Confluent Platform &lt;/A&gt;&lt;/P&gt;&lt;P&gt;and how we can pull the files if created by other spark,mapreduce or any other jobs on HDFS?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The use case of HDFS source connector is only to mirror the same data on kafka.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 03 May 2021 14:44:00 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315827#M226584</guid>
      <dc:creator>sriven</dc:creator>
      <dc:date>2021-05-03T14:44:00Z</dc:date>
    </item>
    <item>
      <title>Re: How to read data from HDFS and place into Kafka (don’t want to use Scala/Spark)?  Any utilities or methods?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315969#M226656</link>
      <description>&lt;P&gt;Please try Nifi - Kakfa&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.cloudera.com/t5/Community-Articles/Apache-NiFi-1-10-Support-for-Parquet-RecordReader/ta-p/282390#:~:text=With%20the%20release%20of%20Apache,data%20as%20a%20single%20unit" target="_blank"&gt;https://community.cloudera.com/t5/Community-Articles/Apache-NiFi-1-10-Support-for-Parquet-RecordReader/ta-p/282390#:~:text=With%20the%20release%20of%20Apache,data%20as%20a%20single%20unit&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 07 May 2021 05:34:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-read-data-from-HDFS-and-place-into-Kafka-don-t-want/m-p/315969#M226656</guid>
      <dc:creator>Nandinin</dc:creator>
      <dc:date>2021-05-07T05:34:50Z</dc:date>
    </item>
  </channel>
</rss>

