Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

PIG UDFs Python - Gurantee that String have the format 'yyyy-MM-dd hh:ss:mm'

Solved Go to solution

PIG UDFs Python - Gurantee that String have the format 'yyyy-MM-dd hh:ss:mm'

Explorer

Hi experts,

I've the following part of script in Apache Pig:

....

A = foreach Source_Data generate (int) ID,

ToString( ToDate((long) Time), 'yyyy-MM-dd hh:ss:mm') as date,

(int) Code;

Store A into '.../newfile'; ...

Now I want to create a new Script using Python UDF to guarantee that in my newfile on column Date (#1) I only have String in the format 'yyyy-MM-dd hh:ss:mm'.

Is possible to do that?

Many thanks!

1 ACCEPTED SOLUTION

Accepted Solutions
Highlighted

Re: PIG UDFs Python - Gurantee that String have the format 'yyyy-MM-dd hh:ss:mm'

Mentor

you can write a new script using regex to test this column and throw away bad fields or do it all in one step where you pass the date field to UDF and check for formatting

View solution in original post

1 REPLY 1
Highlighted

Re: PIG UDFs Python - Gurantee that String have the format 'yyyy-MM-dd hh:ss:mm'

Mentor

you can write a new script using regex to test this column and throw away bad fields or do it all in one step where you pass the date field to UDF and check for formatting

View solution in original post

Don't have an account?
Coming from Hortonworks? Activate your account here