Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Too high cardinality in kylin in HDP 2.3.4

avatar
Contributor

java.lang.IllegalArgumentException: Too high cardinality is not suitable for dictionary -- kylin_db.kylin_test.cust_id cardinality: 10305648 at org.apache.kylin.dict.DictionaryGenerator.buildDictionaryFromValueEnumerator(DictionaryGenerator.java:81) at org.apache.kylin.dict.DictionaryGenerator.buildDictionary(DictionaryGenerator.java:100) at org.apache.kylin.dict.DictionaryManager.buildDictionary(DictionaryManager.java:212) at org.apache.kylin.cube.CubeManager.buildDictionary(CubeManager.java:166) at org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:52) at org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:41) at org.apache.kylin.job.hadoop.dict.CreateDictionaryJob.run(CreateDictionaryJob.java:52) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:76) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:90) at org.apache.kylin.job.common.HadoopShellExecutable.doWork(HadoopShellExecutable.java:62) at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107) at org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:51) at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107) at org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:130) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:745) result code:2

1 ACCEPTED SOLUTION

avatar
Contributor

I have resolved this issue by changing the dictionary(Row key) value from N to Y in Cube Advanced settings option .

View solution in original post

1 REPLY 1

avatar
Contributor

I have resolved this issue by changing the dictionary(Row key) value from N to Y in Cube Advanced settings option .