Finance NLP 1.2.0 for Spark NLP has been released!

Sai Shailesh

Published in

spark-nlp

4 min readNov 11, 2022

“Positive” sentiment analysis on Financial Texts, taken from Prosus FINBERT

We are happy to welcome the new 1.2.0 version of Finance NLP, including the following new capabilities.

Spark Ecosystem

Finance NLP has been built on top of Spark NLP, which uses Spark MLLib pipelines. This means, You can have a common pipeline with any component of Spark NLP of Spark MLLib. Also, you combine it with the rest of our licensed libraries, such as Visual NLP, Healthcare NLP or Legal NLP. The library works on the top of Transformers and other Deep Learning architectures, providing state-of-the-art models which can be run on Spark Clusters. Remember, Spark NLP is the only library natively scalable to do parallel computing, so it is Legal NLP.

New Models

Relation extraction

finre_work_experience_md → This is a md (medium) version of the finre_work_experience model, trained with more data and with unidirectional relation extractions, meaning now the direction of the arrow matters: it goes from the source (chunk1) to the target (chunk2).

Acquisitions / Subsidiaries Relation Extraction (md, Unidirectional)

finre_acquisitions_subsidiaries_md → This model is a md model of finre_acquisitions_subsidiaries, meaning that the directions in the relations are meaningful: chunk1 is the source of the relation, chunk2 is the target.

Financial Relation Extraction on 10K filings (Small)

finre_financial_small → This model extracts relations between amounts, counts, percentages, dates and the financial entities extracted with one of these models: finner_financial_small finner_financial_medium finner_financial_large.

Text Classification

finclf_indian_news_sentiment → This is a small version of the Indian News Sentiment Analysis Text Classifier, which will retrieve if a text is either expression of a Positive Emotion or a Negative one.
finclf_indian_news_sentiment_medium → This is a medium version of the Indian News Sentiment Analysis Text Classifier, meaning this model is a more accurate and will retrieve if a text is either expression of a Positive Emotion or a Negative one.
finclf_acquisitions_item → This model is a Binary Classifier (True, False) for the acquisitions item type of 10K Annual Reports.
finclf_work_experience_item → This model is a Binary Classifier (True, False) for the work_experience item type of 10K Annual Reports.
finclf_auditor_sentiment_analysis → This is a Sentiment Analysis model which retrieves 3 sentiments (positive, negative or neutral) from the Auditors' comments.

Named Entity Recognition

finner_sec_dates → This NER model is trained on SEC documents and empowered with OntoNotes 2022, to extract DATES. This model is light but very accurate.

New Demos

You can find the existing demos on our Demo site, where you will find demos, showcasing some of the models available in Models Hub.

Financial Relation Extraction on 10K filings → This model extracts relations between amounts, counts, percentages, dates and the financial entities extracted with finner_financial models.

Solution Accelerator — Updated!

A Solution Accelerator is a ready-to-use series of notebooks, available and ready to spin up inside Databricks. The final aim of this accelerator is to help you analyze companies information.

All of that information is processed and stores in form of a graph…

The Solution Accelerator is now updated with new and better models. To top it off, a new End to End example using a Securities Exchange Commission 10K form from one company has been included.

Want to see more?

Check our Models Hub
Check our Notebooks
Check our Demos

How to install

!pip install johnsnowlabsfrom johnsnowlabs import *from johnsnowlabs import *
jsl.install(json_license_path=[your_finance_license_path])
jsl.start(json_license_path=[your_finance_license_path])

Do you want to request a free trial?

Go to our self-service installation page here and request a trial. Write to support@johnsnowlabs.com if you have enquiries, or find us at our Slack Channel (#finance)

spark-nlp

Finance NLP 1.2.0 for Spark NLP has been released!

Spark Ecosystem

New Models

Relation extraction

Text Classification

Named Entity Recognition

New Demos

Solution Accelerator — Updated!

Want to see more?

How to install

Do you want to request a free trial?

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Published in spark-nlp

Written by Sai Shailesh

No responses yet