I am new to Cloudera & even Spark. I tested my code with the CoreNLP server. Suppose I deploy this code on Spark & expect it to run in parallel on multiple documents, won't the CoreNLP server become a bottleneck & even a single point of failure.
When I searched for "CoreNLP Server on Spark", the results take me to Databricks - https://github.com/databricks/spark-corenlp
How do I make the NLP tasks running on Spark independent of a singleton service? Or am I missing something.