It appears that Impala is counting 3 characters (226, 128,147) for what is displaying as 1 character (150) in the source system.
The query to get the results is:
LENGTH(investigateNode) AS datavalue
FROM vw_closure where investigateNode like 'Malfunction%'
Hive returns 31 for the length. Impala returns 33 for the length.
The HDFS files are stored in Parquet format.