About shishir_saxena4

shishir_saxena4 · ‎11-18-2016

I finally figured it out. Nifi node was unable to talk to cluster because of entry in hosts file. It was looking for partial hostname not fully qualified domain name.

shishir_saxena4 · ‎11-17-2016

I have a HDF 2.0 node that was running in standalone mode. When I convert it to cluster mode, by changing nifi.properties file and restarting HDF 2.0, I am getting following error message. Cluster is still in the process of voting on the appropriate Data Flow. I have removed existing flow.xml.gz from conf directory and working with an empty flow file.

shishir_saxena4 · ‎08-23-2016

@Adi Jabkowsky You are trying to convert Avro data directly to text. You need to first convert Avro to a text format and then extract text value before using ReplacementText processor. You can use processors pipeline as shown. 1. Convert Avro output to JSON 2. Split JSON to handle multiple lines from output. 3. Use EvaluateJSON processor to get individual column from output and set it to attributes in flowfile. 4. Use ReplacementText processor to generate an insert statement and then use putHiveQL processor. Other option is to generate output in CSV format and then use regular expression to read column values.

shishir_saxena4 · ‎08-23-2016

@Jay See I could get it working without SSLContext Service. Please see attached. Can you please post screenshot of your InvokeHttp processor with configuration ?

shishir_saxena4 · ‎07-19-2016

@BigDataRocks @mclark You will need to plan your nifi production cluster based on your volume requirements. - If you are just looking to transfer huge volume of data from a source to sink, you need to ensure you have enough space available for content repository. Also, ensure that your content repository is setup on a separate disk from flowfile and provenance repository. - Also from productionizing perspective, it is important to have error handling built in your flows, so your teams can get notified in case of errors and any errors are logged to logs. - It will probably be better to run multiple instances of nifi for data from multiple sources, because currently nifi doesn't offer security based on flows. In current security model, flow administrator will have access to all flows running on one instance. By running multiple instances of nifi, you can control security for each flow. - If you decide to use one instance of nifi, you can use ProcessGroups to organize your dataflows. - You should also think about setting up MonitorTasks for disk usage and memory that will give you warnings at appropriate thresholds. - For dataflows, with significant processing requirements, you will need a cluster setup to distribute load across different nodes. You can also increase number of concurrent tasks for any processor that requires more processing power.

shishir_saxena4 · ‎07-19-2016

@Sreekanth Munigati , As @Bryan Bende mentioned, there is no direct way of manipulating Avro data, but in your case you can try modifying SQL being executed by ExecuteSQL processor to add an additional column in SQL itself.

shishir_saxena4 · ‎06-22-2016

Thank You @Bryan Bende for creating JIRA for this issue. Are there any workarounds to this, till this issue gets resolved in a new release ? I can only think of creating a Hive table with appropriate column types, then writing a select query to transform to correct data type and inserting in new table. But this requires a post process and breaks real time ingestion.

shishir_saxena4 · ‎06-17-2016

@Bryan Bende Thanks Bryan. I double checked all the details. My ORACLE table has NUMBER columns. Here are the settings that I am using to establish DBCPConnectionPool. But my Avro data has all NUMBER columns coming as strings. Is there any way to look at details of how NiFi is generating Avro fields formats ? oracleconnection.jpg

shishir_saxena4 · ‎06-16-2016

ExecuteSQL processor currently generates data in Avro format. While fetching data from Oracle, generated Avro format currently converts any data type to string format. Is there a way to ensure that it retains data type of original column or an equivalent type ?

shishir_saxena4 · ‎04-25-2016

Easily convert any XML document to JSON format using TransformXML processor. Save following stylesheet in a file. Use TransformXML processor and specify xslt stylesheet. It will convert any XML document to JSON format. <?xml version="1.0"?> <xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform"> <xsl:output method="text"/> <xsl:template match="/">{ <xsl:apply-templates select="*"/>} </xsl:template>  <xsl:template match="*"> "<xsl:value-of select="name()"/>" : <xsl:call-template name="Properties"/> </xsl:template>  <xsl:template match="*" mode="ArrayElement"> <xsl:call-template name="Properties"/> </xsl:template>  <xsl:template name="Properties"> <xsl:variable name="childName" select="name(*[1])"/> <xsl:choose> <xsl:when test="not(*|@*)">"<xsl:value-of select="."/>"</xsl:when> <xsl:when test="count(*[name()=$childName]) > 1">{ "<xsl:value-of select="$childName"/>" :[<xsl:apply-templates select="*" mode="ArrayElement"/>] }</xsl:when> <xsl:otherwise>{ <xsl:apply-templates select="@*"/> <xsl:apply-templates select="*"/> }</xsl:otherwise> </xsl:choose> <xsl:if test="following-sibling::*">,</xsl:if> </xsl:template>  <xsl:template match="@*">"<xsl:value-of select="name()"/>" : "<xsl:value-of select="."/>", </xsl:template> </xsl:stylesheet> That's it !

Online	Offline
Last Visited	‎02-14-2022 09:06 AM

Member Since	‎02-16-2016 01:09 PM
Last Visited	‎02-14-2022 09:06 AM
Posts	176
Kudos received	196

Cloudera Community

Re: Migrating HDF 2.0 node from standalone to clus...

Re: How to download JSON files from live feed?

Re: Newly added DataNodes won't joining the party

Re: Is there a list of available metrics in Ambari...

Re: HIve not loading escape character (\)

Re: Migrating HDF 2.0 node from standalone to clus...

Migrating HDF 2.0 node from standalone to cluster ...

Re: How to insert data into Hive using NiFi ?

Re: How to download JSON files from live feed?

Re: Productionizing Apache Nifi

Re: How do I add additional columns to Flow File c...

Re: Nifi - ExecuteSQL processor changing data type...

Re: Nifi - ExecuteSQL processor changing data type...

Nifi - ExecuteSQL processor changing data types to...

NiFi - Converting XML to JSON