<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: WebHDFS Performance in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/WebHDFS-Performance/m-p/138785#M35332</link>
    <description>&lt;P&gt;Agree to your answer and your are spot on on the HTTP server performance implications. WebHDFS is really temping given that we can expose HDFS on a browser with minimal coding ad ability to integrate to non java clients as well. Do share if you get your hands on some benchmark performance numbers. Thanks!!!&lt;/P&gt;</description>
    <pubDate>Thu, 21 Jul 2016 10:38:07 GMT</pubDate>
    <dc:creator>srinivasan_h1</dc:creator>
    <dc:date>2016-07-21T10:38:07Z</dc:date>
    <item>
      <title>WebHDFS Performance</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/WebHDFS-Performance/m-p/138782#M35329</link>
      <description>&lt;P&gt;Could anyone share the performance differences between Webhdfs and Native Java clients. We are creating a webservice end point to ingest an attachment on to HDFS.  The files are typically in &amp;lt;10 MB range. &lt;/P&gt;&lt;P&gt;Found &lt;/P&gt;&lt;P&gt;&lt;A href="http://randomlydistributed.blogspot.com/2012/01/webhdfs-performance.html" target="_blank"&gt;http://randomlydistributed.blogspot.com/2012/01/webhdfs-performance.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://wittykeegan.blogspot.com/2013/10/webhdfs-vs-native-performance.html" target="_blank"&gt;http://wittykeegan.blogspot.com/2013/10/webhdfs-vs-native-performance.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;and my test results nearly match with second link. However wanted to see if any benchmark studies exist. &lt;/P&gt;</description>
      <pubDate>Thu, 21 Jul 2016 00:18:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/WebHDFS-Performance/m-p/138782#M35329</guid>
      <dc:creator>srinivasan_h1</dc:creator>
      <dc:date>2016-07-21T00:18:32Z</dc:date>
    </item>
    <item>
      <title>Re: WebHDFS Performance</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/WebHDFS-Performance/m-p/138783#M35330</link>
      <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/11993/srinivasanh1.html"&gt;Srinivasan Hariharan&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I am sure that you are aware already that WebHDFS concept is based on HTTP operations like GET, PUT, POST and DELETE. There you encounter performance implications due to the use of the HTTP server, Jetty. The FileSystem Shell API is a java application that uses java FileSystem class to provide FileSystem operations. FileSystem Shell API creates RPC connection for the operations.
&lt;A href="https://community.hortonworks.com/users/11993/srinivasanh1.html"&gt;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Here are some numbers, but this is not a serious benchmarking study. I am not surprised to see the results for &amp;lt;10 MB files. That's what I expect to see. You can run the test for yourself. If that is the size of your files, then &amp;lt;10 MB should be fine. From my past experience, performance was a concern for large files, visible from 1 GB and higher.&lt;/P&gt;&lt;P&gt;&lt;A href="http://wittykeegan.blogspot.com/2013/10/webhdfs-vs-native-performance.html" target="_blank"&gt;http://wittykeegan.blogspot.com/2013/10/webhdfs-vs-native-performance.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I'm checking for newer in Hortonworks docs and will post the link, if found.&lt;/P&gt;&lt;P&gt;If this is a reasonable response, please vote it or accept it as a best answer.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jul 2016 02:01:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/WebHDFS-Performance/m-p/138783#M35330</guid>
      <dc:creator>cstanca</dc:creator>
      <dc:date>2016-07-21T02:01:55Z</dc:date>
    </item>
    <item>
      <title>Re: WebHDFS Performance</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/WebHDFS-Performance/m-p/138784#M35331</link>
      <description>&lt;P&gt;I will give you a qualitative answer- Ambari (UI) uses WebHDFS and it is designed for scale and performance (vs. httpfs). In future, we will also look into enabling WebHDFS to seamlessly handle Name Node failover scenarios so that the apps dependent on WebHDFS does not have to keep track.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jul 2016 04:25:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/WebHDFS-Performance/m-p/138784#M35331</guid>
      <dc:creator>sburagohain</dc:creator>
      <dc:date>2016-07-21T04:25:53Z</dc:date>
    </item>
    <item>
      <title>Re: WebHDFS Performance</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/WebHDFS-Performance/m-p/138785#M35332</link>
      <description>&lt;P&gt;Agree to your answer and your are spot on on the HTTP server performance implications. WebHDFS is really temping given that we can expose HDFS on a browser with minimal coding ad ability to integrate to non java clients as well. Do share if you get your hands on some benchmark performance numbers. Thanks!!!&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jul 2016 10:38:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/WebHDFS-Performance/m-p/138785#M35332</guid>
      <dc:creator>srinivasan_h1</dc:creator>
      <dc:date>2016-07-21T10:38:07Z</dc:date>
    </item>
    <item>
      <title>Re: WebHDFS Performance</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/WebHDFS-Performance/m-p/138786#M35333</link>
      <description>&lt;P&gt;Thanks for sharing your thoughts and direction of evolution. Agree WebHDFS is way better performant than httpfs&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jul 2016 10:41:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/WebHDFS-Performance/m-p/138786#M35333</guid>
      <dc:creator>srinivasan_h1</dc:creator>
      <dc:date>2016-07-21T10:41:48Z</dc:date>
    </item>
  </channel>
</rss>

