Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Process csv files where delimited is unknown?

Process csv files where delimited is unknown?

New Contributor

Hi,

To convert csv to dataframe we must be aware about the delimiter character at coding time, however in my case we are not aware about the same. The source file will be delimited by some character, our code should be able to infer the delimiter into the file and convert the file into dataframe.

As of now i've written a java snippet to check the delimiter character first and tried to read the file.

Do we have any predefined function to satisfy my need?

Thanks,

R

1 REPLY 1

Re: Process csv files where delimited is unknown?

@RAUI,

I'm not aware of any such function to derive the delimiters. I found this link which may help you

https://www.computer.org/csdl/proceedings/hpcc/2016/4297/00/07828554.pdf

.

-Aditya

Don't have an account?
Coming from Hortonworks? Activate your account here