About elserj

elserj · ‎10-14-2016

With sufficient backpressure, I would imagine that NiFi will drop data eventually. Based on your screenshot, you are writing no data via this processor (Out is 0bytes). I would recommend you verify that Phoenix and HBase are healthy before trying to introduce NiFi into the picture.

elserj · ‎10-14-2016

Oh, that's different than what you said earlier: Both values have "false" (not the values being inverted). Maybe it is a bug with 2.4.0.0. I'll have to see if I can use that exact version and see if I can reproduce the issue.

elserj · ‎10-13-2016

You made a typo: 20050 versus 20550

elserj · ‎10-13-2016

"In the csv file that value is represented as 1/0 and when it goes into phoenix it should go as true/false. but it is going as false/true" > create table booltest(pk varchar not null primary key, truth boolean); > upsert into booltest values('true', 1); Error: ERROR 203 (22005): Type mismatch. INTEGER cannot be coerced to BOOLEAN (state=22005,code=203) It seems like UPSERTS will not coerce integers into a boolean. I'm curious how the CSV tool is doing this. psql.py seems to do this fine: $ echo "true,1" > ~/booleans.csv $ echo "false,0" >> ~/booleans.csv $ /usr/local/lib/phoenix/bin/psql.py -t BOOLTEST localhost:2181:/hbase-1.2 ~/booleans.csv $ sqlline.py ... > 0: jdbc:phoenix:localhost:2181:/hbase-1.2> select * from booltest; +--------+--------+ | PK | TRUTH | +--------+--------+ | false | false | | true | true | +--------+--------+ Similarly, using the CsvBulkLoadTool shows the same for me: yarn jar /usr/local/lib/phoenix/phoenix-4.9.0-HBase-1.2-SNAPSHOT-client.jar org.apache.phoenix.mapreduce.CsvBulkLoadTool -Dmapred.map.child.java.opts="-Xmx1G" --table BOOLTEST --input booleans.csv -z localhost:2181:/hbase-1.2 Maybe do a sanity check on your processing?

elserj · ‎10-13-2016

"phoenix putsql is unabel toinsert it seems" Are you referring to the INSERT SQL command? Phoenix does not expose INSERT, it exposes UPSERT which matches the actual semantics of how data is written. "Nifi - putsql for phoenix upsert very slow , getting records 1000/sec" The PutSQL processor seems to be written in a way that would allow for optimal performance with Phoenix. Care to share your PutSQL processor configuration? Have you tried to write a simple JDBC application to verify the "theoretical" performance of your system? One thing I can see is that if PutSQL is getting triggered very frequently, you will be making a large number of RPCs and not batching updates into HBase. How many FlowFiles does PutSQL process per invocation?

elserj · ‎10-12-2016

Check for: 1. JVM GC pauses. If the JVM is doing a stop-the-world garbage collection, it will cause the server to become disconnected from ZK, and likely lose its session. Read the lines in the HBase service log prior to this error. 2. Errors in the ZooKeeper log about maxClientCnxns (https://community.hortonworks.com/articles/51191/understanding-apache-zookeeper-connection-rate-lim.html) 3. Ensure operation system swappiness is reduced from the default (often 30 or 60), to a value of 0. You can inspect this via `cat /proc/sys/vm/swappiness`.

elserj · ‎10-05-2016

Please refer to the documentation on ports for HBase: http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.5.0/bk_reference/content/hbase-ports.html

elserj · ‎10-04-2016

Some extra thoughts on top of Rajeshbabu's reply: 1. Increase the heapsize of the Phoenix Query Server via the PHOENIX_QUERYSERVER_OPTS variable hbase-env.sh 2. For writing data, make sure the addBatch() and executeBatch() API calls are used for the best performance

elserj · ‎09-28-2016

I would not recommend anything other than Phoenix 🙂

elserj · ‎09-28-2016

First off: you should not use HBase APIs to write data into Phoenix tables. Use the Phoenix API. The reason you should not do this is the reason that you are not seeing the data you've written. Using Phoenix API's (the UPSERT command) is what triggers the update to the secondary index table. When you add data via the HBase APIs, Phoenix has no idea that you did this and thus cannot ensure referential integrity with your secondary indices. If you want to use Phoenix, use the Phoenix APIs to read and write data.

Online	Offline
Last Visited	‎07-01-2022 02:44 PM

Member Since	‎07-17-2019 08:58 AM
Last Visited	‎07-01-2022 02:44 PM
Posts	738
Kudos received	429

Cloudera Community

Re: Why can't Object Stores like Amazon S3 be used...

Re: Not a host:port pair: PBUF, how to resolve?

Re: versioning question in hbase

Re: Phoenix query call from java on larger data se...

Re: Revoke permissions to a superuser on Hbase

Re: Nifi - putsql for phoenix upsert very slow - i...

Re: phoenix upsert boolean values

Re: HBase Rest API Connection Refused

Re: phoenix upsert boolean values

Re: Nifi - putsql for phoenix upsert very slow - i...

Re: hbase region server going down

Re: Does HBASE require any specific ports open?

Re: HBase & Phoenix Query Server Tuning properties...

Re: phoneix seconday index blocking the updates of...

Re: phoneix seconday index blocking the updates of...