About helmi_khalifa

Arthur_ · ‎11-10-2021

Hi, it should. But when You need to use certs signed with Your organisation use: convert .p12 to pfx (you will need also pem file) openssl pkcs12 -export -out YOUROWNNAME.pfx -inkey YOUR_KEYS.pem -in YOUR_KEYS.pem -certfile YOUR_KEYS.pem When You manage to get pfx file use: keytool -importkeystore -srckeystore gateway.pfx -srcstoretype pkcs12 -srcalias [ALIAS_SRC] -destkeystore [MY_KEYSTORE.jks] -deststoretype jks -deststorepass [PASSWORD_JKS] -destalias gateway-identity [ALIAS_SRC] - read from pfx file to do that use: keytool -v -list -storetype pkcs12 -keystore YOUROWNNAME.pfx At end use this: mv gateway.jks /var/lib/knox/data-2.6.4.0-91/security/keystores/

CN · ‎03-24-2021

--show diff values if hour is 02 select '2021-03-14 02:01:03.118', cast('2021-03-14 02:01:03.118' as timestamp) OUTPUT _c0 _c1 1 2021-03-14 02:01:03.118 2021-03-14 03:01:03.118

helmi_khalifa · ‎01-17-2021

Hi @vjain , To configure the BuckeCache in the descripption there is a two JVM properties. Which one to use please? : HBASE_OPTS or HBASE_REGIONSERVER_OPTS In the hbase-env.sh file for each RegionServer, or in the hbase-env.sh file supplied to Ambari, set the -XX:MaxDirectMemorySize argument forHBASE_REGIONSERVER_OPTS to the amount of direct memory you wish to allocate to HBase. In the configuration for the example discussed above, the value would be 241664m. (-XX:MaxDirectMemorySize accepts a number followed by a unit indicator; m indicates megabytes.) HBASE_OPTS="$HBASE_OPTS -XX:MaxDirectMemorySize=241664m" Thanks, Helmi KHALIFA

hrishi_ · ‎02-24-2020

I tried with spark.streaming.backpressure.pid.minRate which work as expected. My configuration: spark.shuffle.service.enabled: "true" spark.streaming.kafka.maxRatePerPartition: "600" spark.streaming.backpressure.enabled: "true" spark.streaming.concurrentJobs: "1" spark.executor.extraJavaOptions: "-XX:+UseConcMarkSweepGC" batch.duration: 5 spark.streaming.backpressure.pid.minRate: 2000 so, by configuration it starts with = 15 (total Number of partitions) X 600 (maxRatePerPartition) X 5 (batch Duration) = 45000 but it doesn't able to process these many records in 5 seconds. It drops to ~10,000 = 2000 (pid.minRate) X 5 (batch duration) So, it spark.streaming.backpressure.pid.minRate is total records per seconds. Just set spark.streaming.backpressure.pid.minRate and leave following config as default spark.streaming.backpressure.pid.integral spark.streaming.backpressure.pid.proportional spark.streaming.backpressure.pid.derived

gsthina · ‎11-14-2019

Hey @avengers, Just thought, this could add some more value to this question here. Spark SQL uses a Hive Metastore to manage the metadata of persistent relational entities (e.g. databases, tables, columns, partitions) in a relational database (for fast access) [1]. Also, I don't think there would be a MetaStore crash if we use it along with HiveOnSpark. [1] https://jaceklaskowski.gitbooks.io/mastering-spark-sql/spark-sql-hive-metastore.html

helmi_khalifa · ‎11-05-2019

Hi @Rak ; here the script : CREATE EXTERNAL TABLE IF NOT EXISTS sample_date (sc_code string, ddate timestamp, co_code DECIMAL, high DECIMAL, low DECIMAL, open DECIMAL, close DECIMAL, volume DECIMAL, no_trades DECIMAL, net_turnov DECIMAL, dmcap DECIMAL, return DECIMAL, factor DECIMAL, ttmpe DECIMAL, yepe DECIMAL, flag string) ROW FORMAT DELIMITED FIELDS TERMINATED BY ' ' LINES TERMINATED BY '\n' STORED AS TEXTFILE LOCATION '/lab/itim/ccbd/helmi/sampleDate' tblproperties('skip.header.line.count'='1'); ALTER TABLE sample_date SET SERDEPROPERTIES ("timestamp.formats"="MM/DD/YYYY"); Could you accept the answer please ? Best, Helmi KHALIFA

helmi_khalifa · ‎09-24-2019

HI @hadoopguy Yes there is an impact you will have longer processing time and the operations will be queued. You have to carefully handle the timeout in your jobs. Best, @helmi_khalifa

helmi_khalifa · ‎09-24-2019

Hi Suresh, There is no command but you can easily find the information on the HBase Web UI. http://host:16010/master-status#baseStats Best, Helmi KHALIFA

jkatti · ‎03-18-2019

@Josh Elser Can you pls. guide me, how and where to set the Java heap space on the client ? I have windows machines where my app runs and the phoenix queries are trigger from these windows system. I see no logs on the server side, I believe, the query is failing on the client side itself.

arald · ‎06-07-2018

here are some hints given: http://hbase.apache.org/0.94/book/secondary.indexes.html In most cases you'll have to create a second index table.

Online	Offline
Last Visited	‎09-09-2024 05:41 AM

Member Since	‎08-05-2016 07:49 AM
Last Visited	‎09-09-2024 05:41 AM
Posts	52
Kudos received	1

Cloudera Community

Re: How to Load data from hdfs Multi level directo...

Re: Knox Gateway Start fail

Re: Hive - Convert Formatted String to Timestamp

Re: Optimizing HBase I/O for Large Scale Hadoop Im...

Re: What is expected behavior of spark.streaming.b...

Re: Can I use SparkSQL on a cluster using Hive on ...

Re: csv imported from local its not taking date va...

Re: How to move all the regions of a region server...

Re: How to find number of regions per region serve...

Re: Inner join Phoenix query out of memory ?

Re: What are the best practices for HBase secondar...