Hadoop FSCK Optimization for Large System

New Contributor

I am working on a 1.6 pettabyte system. Since it will be resource intensive to run FSCK on the entire system.

How frequently should I run the FSCK command?

Should I limit it to certain specific paths? If so, on what criteria should I choose these paths.