<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Slow ingest on 64 core machine in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Slow-ingest-on-64-core-machine/m-p/414156#M254737</link>
    <description>&lt;P&gt;A few things stand out from the numbers you've shared.&lt;/P&gt;&lt;P&gt;On a 64-core machine, ingesting ~11 million rows in 17 minutes (around 10–11K rows/sec) is significantly below what I'd expect if the workload were effectively parallelized. Before focusing on CPU count, I'd investigate where the bottleneck actually is.&lt;/P&gt;&lt;P&gt;Some areas worth checking:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Storage throughput&lt;/STRONG&gt;: Is the data being written to local SSDs, network-attached storage, or slower disks? Ingest workloads are often I/O-bound rather than CPU-bound.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;File size and partitioning strategy&lt;/STRONG&gt;: Large numbers of small files can severely impact write performance.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Compression settings&lt;/STRONG&gt;: Certain codecs provide better compression but consume more CPU during ingest.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Thread parallelism&lt;/STRONG&gt;: Verify that the ingestion framework is actually utilizing all available cores rather than being limited by a small worker pool.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Memory pressure and GC activity&lt;/STRONG&gt;: If the JVM is spending significant time in garbage collection, additional CPU cores won't help much.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Network throughput&lt;/STRONG&gt;: If data is being pulled from a remote source, the bottleneck may be upstream rather than on the ingest node itself.&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;I'd also recommend collecting:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;CPU utilization during ingest&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Disk IOPS and throughput metrics&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Memory usage and GC logs&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Number of concurrent ingest tasks&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Average file size being generated&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;One quick diagnostic is to look at overall CPU utilization. If the machine is only using 10–20% of available CPU during ingest, then the workload is likely blocked on I/O, synchronization, network transfers, or application-level limits rather than raw compute capacity.&lt;/P&gt;&lt;P&gt;Can you share:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;P&gt;Which ingestion tool/framework you're using?&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;The storage type (SSD, NVMe&lt;A href="https://iyacinetvapk.com/" target="_self"&gt;,&lt;/A&gt; HDD, cloud volume, etc.)?&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Average CPU utilization during the 17-minute ingest?&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Whether the target table is partitioned and, if so, by what key?&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Those details would make it much easier to determine whether the bottleneck is CPU, disk, network, or configuration-related.&lt;/P&gt;</description>
    <pubDate>Tue, 02 Jun 2026 04:48:52 GMT</pubDate>
    <dc:creator>robert12231</dc:creator>
    <dc:date>2026-06-02T04:48:52Z</dc:date>
    <item>
      <title>Slow ingest on 64 core machine</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Slow-ingest-on-64-core-machine/m-p/414124#M254650</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;We are having problems with a single instance of nifi 2.5.0 on a physical windows 11 machine with 64 CPU cores, 64 GB RAM and NVME 12 TB ( 4 x 3 TB drives ) striped D : drive set ingesting slowly.&amp;nbsp; C: drive is for windows.&lt;/P&gt;&lt;P&gt;I was hoping someone could maybe give a sequential list of things to check so we at least are approaching this the right way.&lt;/P&gt;&lt;P&gt;The people who set up the machine have put the ingest directory ( ingest files via Getfiles ), and the nifi instance ( content repo and everything else ) on the same single D ; drive striped disk set.&amp;nbsp;&amp;nbsp;&lt;SPAN&gt;Being a striped set I wouldn't have thought speed would be an issue even with both ingest and content repo on the same striped set.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;So one thing I wondered was can we force nifi to use all cores, or do we need keep some cores&amp;nbsp; ( maybe 4 ? ) for windows to use? And if we did use say 60 cores, what is the best way to do this -&amp;nbsp; and is it set at a nifi processor level,&amp;nbsp; or do you need to force Nifi to use all cores at Nifi boot up? The Nifi JVM has 8 GB of memory allocated and I haven't seen it go much past 50% utilization, which hopefully keeps garbage collection in check.&lt;/P&gt;&lt;P&gt;We do ingest individual 200 GB size files ( but also smaller files ) , so would it be worth re-formatting the stripe set for much larger block sizes to stop additional seeks? I'm guessing they used standard 4 KB blocks, I was thinking of maybe using 1 MB blocks?&lt;/P&gt;&lt;P&gt;Standard windows Antivirius is running on this machine and also our Dev machine, but in the Dev&amp;nbsp;environment 8 core windows 11 machine, ingest is fast as we would expect.&lt;/P&gt;&lt;P&gt;Would welcome thoughts, thanks in advance....&lt;/P&gt;</description>
      <pubDate>Thu, 28 May 2026 05:34:51 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Slow-ingest-on-64-core-machine/m-p/414124#M254650</guid>
      <dc:creator>zzzz77</dc:creator>
      <dc:date>2026-05-28T05:34:51Z</dc:date>
    </item>
    <item>
      <title>Re: Slow ingest on 64 core machine</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Slow-ingest-on-64-core-machine/m-p/414127#M254653</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/136792"&gt;@zzzz77&lt;/a&gt;&amp;nbsp; In your bigger machine env are you adjusting the flow to tune peformance?&amp;nbsp; &amp;nbsp;E.G.&amp;nbsp; Do you increase concurrency, adjust active threads pool, etc to make sure that you are getting the most possible use of the cores?&amp;nbsp; &amp;nbsp; This is where you should start.&amp;nbsp; &amp;nbsp; You should be able to get a lot more active threads going in the larger env before needing to worry about disk contention.&amp;nbsp;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;You may want to bump up the ram min/max, but i would do this methodically.&amp;nbsp; If its 8, go 16 and see the results, then 32 and compare all 3.&amp;nbsp; &amp;nbsp;32 should be as high as you need to go, but I have seen higher.&amp;nbsp; NiFi does a good job of memory management above the min/max.&amp;nbsp; &amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Ideally you would want nifi disks mounted separately (see docs) but since you already have a baseline in dev likely without dedicated disks,&amp;nbsp; I suspect you will see improvements using all 32+ cores vs 8 even with "slow" disks...&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;This is in k8s/nifi, but you will see how to crank up the CPU:&lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://stevenmatison.com/blog/Max-CPU-with-NiFi-on-Minikube/" target="_blank" rel="noopener"&gt;https://stevenmatison.com/blog/Max-CPU-with-NiFi-on-Minikube/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 28 May 2026 13:00:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Slow-ingest-on-64-core-machine/m-p/414127#M254653</guid>
      <dc:creator>steven-matison</dc:creator>
      <dc:date>2026-05-28T13:00:05Z</dc:date>
    </item>
    <item>
      <title>Re: Slow ingest on 64 core machine</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Slow-ingest-on-64-core-machine/m-p/414135#M254656</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;Thank you very much for this info, we will try it and let you know how it goes.&lt;/P&gt;&lt;P&gt;One of the other engineers was thinking of trying to start up nifi with a switch using a string that had&amp;nbsp; "FFFFFFFFFF" in it ( I only saw it briefly )&amp;nbsp; I think with the idea of forcing nifi to start with all CPUs initially.&lt;/P&gt;&lt;P&gt;I think he was looking at also changing CPU affinity on the workstation via windows task manager but don't know where he got to with it.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hypothetically, if you had say a setting of&amp;nbsp;&amp;nbsp;&lt;SPAN&gt;&lt;STRONG&gt;nifi.flowcontroller.max.timer.driven.thread.count=36&lt;/STRONG&gt;&amp;nbsp; &amp;nbsp;and you had 10 processors on your canvas,&amp;nbsp; and each processor was set to run 4 concurrent tasks. In my mind this means there is either complete starvation of 1 processor downstream with zero threads and cause a bottleneck. or would Nifi reduce thread allocation to all processors to ensure it had max output like say allocating 3 threads per processor and leave 6 in reserve?&amp;nbsp; In this case would it be better to set &amp;nbsp;&lt;STRONG&gt;nifi.flowcontroller.max.timer.driven.thread.cou&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;STRONG&gt;nt=48&lt;/STRONG&gt; to ensure there is no thread starvation and have a safety window of 48 - (10x4) = 8 ?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;If you have a safety window of 8 threads, would these be allocated to say garbage collection and other activities or is that controlled by the threads in use by JVM ? Does Nifi tell the JVM it needs to run with minimum 48 threads since &lt;STRONG&gt;nifi.flowcontroller.max.timer.driven.thread.count=48&lt;/STRONG&gt; has been set&amp;nbsp; ? How does that JVM/nifi interplay work please?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I'm trying to understand the right way to get tuning right to avoid resource starvation.&amp;nbsp; &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Cheers&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 29 May 2026 01:11:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Slow-ingest-on-64-core-machine/m-p/414135#M254656</guid>
      <dc:creator>zzzz77</dc:creator>
      <dc:date>2026-05-29T01:11:10Z</dc:date>
    </item>
    <item>
      <title>Re: Slow ingest on 64 core machine</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Slow-ingest-on-64-core-machine/m-p/414142#M254661</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/136792"&gt;@zzzz77&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;I would say you start with Step 1 Defender exclusions it is the single most impactful change, requires no downtime, and takes under 5 minutes. Monitor throughput immediately after applying it before doing anything else. In most cases this alone resolves 80% of the performance gap.&lt;BR /&gt;&lt;STRONG&gt;SYMPTOMS:&lt;/STRONG&gt;&lt;BR /&gt;✗ Slow ingest on production (64 core, 64GB RAM)&lt;BR /&gt;✓ Fast ingest on dev (8 core machine)&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;LIKELY CAUSES IN PRIORITY ORDER:&lt;/STRONG&gt;&lt;BR /&gt;&lt;EM&gt;&lt;STRONG&gt;STEP 2 — WINDOWS POWER PLAN&lt;BR /&gt;1. Windows Defender scanning 200GB files in real-time ← MOST LIKELY&lt;/STRONG&gt;&lt;/EM&gt;&lt;BR /&gt;2. NiFi thread pool under-configured for 64 cores&lt;BR /&gt;3. JVM heap too small (8GB for 200GB file processing)&lt;BR /&gt;4. Ingest dir and content repo on same I/O path&lt;BR /&gt;5. 4KB block size misaligned for large file processing&lt;BR /&gt;6. NiFi content repository not spread across drives&lt;BR /&gt;7. Java GC pressure under large file load&lt;BR /&gt;8. Windows power plan not set to High Performance&lt;BR /&gt;9. GetFile processor running single-threaded&lt;BR /&gt;10. NiFi provenance repository competing for I/O&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P class="font-claude-response-body break-words whitespace-normal leading-[1.7]"&gt;&lt;STRONG&gt;STEP 1 WINDOWS DEFENDER (DO THIS FIRST)&lt;/STRONG&gt;&lt;/P&gt;&lt;P class="font-claude-response-body break-words whitespace-normal leading-[1.7]"&gt;This is almost certainly your primary bottleneck. Defender scans every byte of every 200GB file as NiFi reads it.&lt;/P&gt;&lt;LI-SPOILER&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;# Run PowerShell as Administrator&lt;/SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;# Check current exclusions&lt;/SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;Get-MpPreference&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;|&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;Select-Object&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;ExpandProperty ExclusionPath&lt;BR /&gt;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;# Add NiFi directories to Defender exclusions&lt;/SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;Add-MpPreference&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;ExclusionPath &lt;SPAN class="token token"&gt;"D:\nifi"&lt;/SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;Add-MpPreference&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;ExclusionPath &lt;SPAN class="token token"&gt;"D:\nifi\content_repository"&lt;/SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;Add-MpPreference&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;ExclusionPath &lt;SPAN class="token token"&gt;"D:\nifi\flowfile_repository"&lt;/SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;Add-MpPreference&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;ExclusionPath &lt;SPAN class="token token"&gt;"D:\nifi\database_repository"&lt;/SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;Add-MpPreference&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;ExclusionPath &lt;SPAN class="token token"&gt;"D:\nifi\provenance_repository"&lt;/SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;Add-MpPreference&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;ExclusionPath &lt;SPAN class="token token"&gt;"D:\ingest"&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;# Also exclude by process&lt;/SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;Add-MpPreference&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;ExclusionProcess &lt;SPAN class="token token"&gt;"java.exe"&lt;/SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;Add-MpPreference&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;ExclusionProcess &lt;SPAN class="token token"&gt;"nifi.bat"&lt;/SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;Add-MpPreference&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;ExclusionProcess &lt;SPAN class="token token"&gt;"run-nifi.bat"&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;# Verify exclusions applied&lt;/SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;Get-MpPreference&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;|&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;Select-Object&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;ExpandProperty ExclusionPath&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;# Check if Defender is actively scanning (run during ingest)&lt;/SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="token token"&gt;Get-MpComputerStatus&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;|&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;Select-Object&lt;/SPAN&gt; AMRunningMode&lt;SPAN class="token token"&gt;,&lt;/SPAN&gt; RealTimeProtectionEnabled&lt;/SPAN&gt;&lt;/LI-SPOILER&gt;&lt;P class="font-claude-response-body break-words whitespace-normal leading-[1.7]"&gt;&lt;STRONG&gt;How to verify Defender is the issue before making changes:&lt;/STRONG&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;LI-SPOILER&gt;&lt;SPAN class="token token"&gt;# Monitor Defender activity during ingest&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class="token token"&gt;# Open Resource Monitor → CPU tab&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class="token token"&gt;# Look for MsMpEng.exe (Antimalware Service Executable)&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class="token token"&gt;# If it's consuming 10-40% CPU during ingest → Defender is the culprit&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN class="token token"&gt;&lt;BR /&gt;#Quick test: temporarily disable real-time protection&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class="token token"&gt;# (coordinate with your security team first)&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class="token token"&gt;Set-MpPreference&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;DisableRealtimeMonitoring &lt;SPAN class="token token"&gt;$true&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class="token token"&gt;&lt;BR /&gt;# Run your ingest test&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class="token token"&gt;# Then re-enable&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class="token token"&gt;Set-MpPreference&lt;/SPAN&gt; &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;DisableRealtimeMonitoring &lt;SPAN class="token token"&gt;$false&lt;/SPAN&gt;&lt;/LI-SPOILER&gt;&lt;P class="font-claude-response-body break-words whitespace-normal leading-[1.7]"&gt;&lt;STRONG&gt;STEP 2 WINDOWS POWER PLAN&lt;BR /&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;LI-SPOILER&gt;&lt;SPAN class="token token"&gt;# Check current power plan&lt;/SPAN&gt;&lt;BR /&gt;powercfg &lt;SPAN class="token token"&gt;/&lt;/SPAN&gt;getactivescheme&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN class="token token"&gt;# Set to High Performance&lt;BR /&gt;&lt;/SPAN&gt;powercfg &lt;SPAN class="token token"&gt;/&lt;/SPAN&gt;setactive 8c5e7fda-e8bf-4a96-9a85-a6e23a8c635c&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN class="token token"&gt;# Or create Ultimate Performance plan (Windows 10/11 Pro)&lt;BR /&gt;&lt;/SPAN&gt;powercfg &lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;duplicatescheme e9a42b02-d5df-448d-aa00-03f14749eb61&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN class="token token"&gt;# Then activate it from output GUID&lt;BR /&gt;&lt;/SPAN&gt;powercfg &lt;SPAN class="token token"&gt;/&lt;/SPAN&gt;setactive &amp;lt;GUID-&lt;SPAN class="token token"&gt;from&lt;/SPAN&gt;&lt;SPAN class="token token"&gt;-&lt;/SPAN&gt;above&amp;gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN class="token token"&gt;# Verify&lt;BR /&gt;&lt;/SPAN&gt;powercfg &lt;SPAN class="token token"&gt;/&lt;/SPAN&gt;getactivescheme&lt;BR /&gt;&lt;SPAN class="token token"&gt;# Expected: High Performance or Ultimate Performance&lt;/SPAN&gt;&lt;/LI-SPOILER&gt;&lt;P class="font-claude-response-body break-words whitespace-normal leading-[1.7]"&gt;&lt;STRONG&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/STRONG&gt;Happy hadooping&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;Let me know how it works out&lt;/P&gt;</description>
      <pubDate>Sat, 30 May 2026 10:08:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Slow-ingest-on-64-core-machine/m-p/414142#M254661</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2026-05-30T10:08:23Z</dc:date>
    </item>
    <item>
      <title>Re: Slow ingest on 64 core machine</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Slow-ingest-on-64-core-machine/m-p/414153#M254735</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;Thank you very much for this info, we will apply and let you know how it goes.&lt;/P&gt;&lt;P&gt;Cheers&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 01 Jun 2026 22:21:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Slow-ingest-on-64-core-machine/m-p/414153#M254735</guid>
      <dc:creator>zzzz77</dc:creator>
      <dc:date>2026-06-01T22:21:03Z</dc:date>
    </item>
    <item>
      <title>Re: Slow ingest on 64 core machine</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Slow-ingest-on-64-core-machine/m-p/414156#M254737</link>
      <description>&lt;P&gt;A few things stand out from the numbers you've shared.&lt;/P&gt;&lt;P&gt;On a 64-core machine, ingesting ~11 million rows in 17 minutes (around 10–11K rows/sec) is significantly below what I'd expect if the workload were effectively parallelized. Before focusing on CPU count, I'd investigate where the bottleneck actually is.&lt;/P&gt;&lt;P&gt;Some areas worth checking:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Storage throughput&lt;/STRONG&gt;: Is the data being written to local SSDs, network-attached storage, or slower disks? Ingest workloads are often I/O-bound rather than CPU-bound.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;File size and partitioning strategy&lt;/STRONG&gt;: Large numbers of small files can severely impact write performance.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Compression settings&lt;/STRONG&gt;: Certain codecs provide better compression but consume more CPU during ingest.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Thread parallelism&lt;/STRONG&gt;: Verify that the ingestion framework is actually utilizing all available cores rather than being limited by a small worker pool.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Memory pressure and GC activity&lt;/STRONG&gt;: If the JVM is spending significant time in garbage collection, additional CPU cores won't help much.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Network throughput&lt;/STRONG&gt;: If data is being pulled from a remote source, the bottleneck may be upstream rather than on the ingest node itself.&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;I'd also recommend collecting:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;CPU utilization during ingest&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Disk IOPS and throughput metrics&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Memory usage and GC logs&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Number of concurrent ingest tasks&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Average file size being generated&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;One quick diagnostic is to look at overall CPU utilization. If the machine is only using 10–20% of available CPU during ingest, then the workload is likely blocked on I/O, synchronization, network transfers, or application-level limits rather than raw compute capacity.&lt;/P&gt;&lt;P&gt;Can you share:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;P&gt;Which ingestion tool/framework you're using?&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;The storage type (SSD, NVMe&lt;A href="https://iyacinetvapk.com/" target="_self"&gt;,&lt;/A&gt; HDD, cloud volume, etc.)?&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Average CPU utilization during the 17-minute ingest?&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Whether the target table is partitioned and, if so, by what key?&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Those details would make it much easier to determine whether the bottleneck is CPU, disk, network, or configuration-related.&lt;/P&gt;</description>
      <pubDate>Tue, 02 Jun 2026 04:48:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Slow-ingest-on-64-core-machine/m-p/414156#M254737</guid>
      <dc:creator>robert12231</dc:creator>
      <dc:date>2026-06-02T04:48:52Z</dc:date>
    </item>
  </channel>
</rss>

