Fast analysis of natural language with performance of under 1millisecond*1 for < 10 word queries. With this sub millisecond latency NLP performance, ThatNeedle is order of magnitude faster*2 than some of the big names providing such services. This is ideal for real time NLU applications involving text processing. ThatNeedle performs not just recognition but also performs entity processing/slot filling as required.
Our libraries work on CPUs and this makes for cost efficient operations.
This is unlike many other frameworks where GPU or hardware accelerator is mandatory for getting the system ready for natural language processing tasks.
You can work with NLP offline without the overhead of network latency. It will work offline without internet and can be hosted on your own servers residing on your premises. This not only removes network latency for NLP but also helps with security and various data privacy compliances such as GDPR.
An NLP framework that is suited for general purpose applications like Siri and Alexa assistants will not work well for domain specific applications. You need something that will be aware of your domain. ThatNeedle NER has the ability to be configured for your domain quickly.
For domain specific applications, geting IOB format annotated datsets (and other similarly annotated datasets) can be a problem. Even if you manage to get such annotations done, it is slow and expensive. With ThatNeedle NER, you are not dependent on IOB annotations. We are designed to work with minimum processing from raw data.
ThatNeedle will automatically handle natural language numbers in words and convert them to digits.
Your developers can setup with just a few lines of code in your favourite programming language such as Python, C++, etc. With ThatNeedle libraries, you can extract custom named entities from unstructured text query with just 1 line of code.
Let us know about your custom entity recognition needs.
Some topic extraction solutions restrict the entities to nouns, proper nouns etc. But depending on the business needs, you might want to have some particular types identified and extracted as entities.
You should be able to define what to extract as custom entity and what not to label as an entity. If done naively, this is a tricky exercise and people often end up burning their hands.
We will create the best solution for your text analysis and named entity recognition needs. We can custom create and test custom models for your niche and give you the pre-trained software solution that is ready to use for your niche and specific needs.
While the software allows the user to define custom entities and annotation, any other customization cost would be over and above the default price mentioned.
The default language is English, but the technology is capable of effective handling other languages, includes Asian languages like Chinese, Japanese, Arabic etc.
These are traditionally a challenge, but our algorithms are designed to solve these natural language understanding issues.
Please take a look at our other NLP libraries also.
NER or Named Entity Recognition / Entity extraction identifies, extracts and labels the information in text into pre-defined categories, or classes such as location, names of people, brand, product etc. It is a loosely used term to also include entity-extraction of information such as dates, numbers, phone, url etc. Entities could be any useful data or information for example, date time, names, location, dimensions etc that could be stored or used for text processing. Some extractors, identify proper nouns or nouns as entities but thats too rigid and is not a good rigid rule. A good entity extractor should be able to take a string of unstructured text and identify and produce annotated output or a structured output that helps in intelligent and better analysis of the text. Such intelligent understanding (NLU) of the intent of the user query will help in producing better responses from the system. If it is a search query, it would mean better understanding of the query and more relevant search results because of better intent inference. There is no universal entity extractor and the needs of the business must be taken into account before selecting a software tool to perform such tasks. Many such general purpose tools might be able to parse general entities like date, time, location etc but give poor accuracy for the context of the business in question, and are therefore not fit to be used in specific niches. A good tool will recognize the context of the niche and give annotations and analysis accordingly.
Most language processing software cannot parse the query and analyse the query fast enough to be used effectively in user interfacing applications. As a result response from the backend query processing system appears to be slow and tests the patience of the end user. A fast response to the query is essential not only to delight the customer, but to keep him engaged. A slow application will give the user a good reason to direct his valuable attention elsewhere. ThatNeedle has always recognised the need for speed in NLP and is making the core engine faster everyday. We are also proud to say that we are 10x faster than some leading natural language entity extraction from text service providers This would make ThatNeedle an ideal candidate for real time extraction tasks from plain text. ThatNeedle NER can serve as an ideal text processing tool for big data scientists, data architects, semantic search solution providers, realtime natural language processing, large scale NLP etc
Out of the box, ThatNeedle could be used as an effective and faster NLP microservice alternative to Microsoft Luis, alternative to IBM watson, alternative to Wit.ai, alternative to api.ai, alternative to Natty etc
Even if you are using traditional specialized parsers like Natty for Java or any similar library for date extraction etc, you should compare the performance with ThatNeedle and decide which is superior for yourself! Thatneedle can provide great speed and accuracy because of its high precision high recall engineering.
*1. Based on tests conducted by ThatNeedle in 2020. Performance tests are conducted using specific computer systems and reflect the approximate performance of ThatNeedle library. This test has been performed on a 2012 MacBookPro with an Intel Core i5 processor and 4GB RAM.
*2. Based on tests conducted by ThatNeedle in 2020. Performance tests are conducted using specific computer systems and reflect the approximate performance of ThatNeedle library and other systems. Their performance might have improved since.