Category: Natural language processing and text mining

The area of natural language processing is the main focus of Inspiratron.org and Nikola Milosevic. This is the area of artificial intelligence dealing with human language understanding. The topics in this category may include views, opinions, methodologies and techniques regarding information extraction, information retrieval, named entity recognition, classification, sentiment analysis, etc.

Posted in Artificial intelligence Natural language processing and text mining Science Software Tales from life

Starting an open source effort to bring biomedical data and tools together

Motivation from the community A week ago I posted an blog post about my idea of creating Open Source organization that would have an goal to bring biomedical data and tools together to a single place. The original post can be read found here: https://inspiratron.org/blog/2015/12/12/open-source-effort-to-bring-all-open-biomedical-data-together/. Since I posted it, quite large…

Posted in Artificial intelligence Natural language processing and text mining

Open source effort to bring all open biomedical data together

Motivation There is a vast amount of openly available biomedical information on the web in the form of open access journal publications, biomedical, gene or drug databases, drug labels and more. We are talking about millions of full text articles (i.e. indexed in PMC and possibly some in services such as…

Posted in Artificial intelligence Natural language processing and text mining Tales from life

Expirience from Lisbon Machine Learning Summer School

I have participated on Lisbon Machine Learning Summer School (LxMLS), which took place on July 16-23  at Instituto Superior Técnico, a leading Engineering and Science school in Portugal. It is organized jointly by IST, the Instituto de Telecomunicações and the Spoken Language Systems Lab – L2F of INESC-ID. It was quite…

Posted in Artificial intelligence Natural language processing and text mining Science Work

Marvin – semantic annotator

We created Marvin, an semantic text annotation tool that uses external resources such as DBPedia and WordNet to annotate semantically text. Marvin is built in java and can be used as a standalone application or as a library. Marvin semantic annotator already has a lot of knowledge, which will make…

Posted in Artificial intelligence Natural language processing and text mining Software Work

xgoogle python library upgrade for google image search

Couple of days ago I realized that I need some library that will allow me search google and especially face images. I was previously working a bit with the Google API, however Google offers only 100 requests per day and there are some other limitations. It is very good API…

Posted in Artificial intelligence Natural language processing and text mining Software Work

What is the structure in data?

Have you ever wondered what is structured data? Why is this important? Well, from structured data we can extract semantic and know what is in the data, use the data. However, if we don’t know how data is structured, we will be unable to extract semantics and to understand it….

Posted in Artificial intelligence Natural language processing and text mining Software Work

Building a search engine (Lucene tutorial)

Since the Google took over lives and branded a verb for searching as Googling, making a search engine is considered cool thing. I have crossed over search engines several times in my life. Even I worked in a company that was I guess pretending to build search engine (I was…

Posted in Artificial intelligence Natural language processing and text mining Science Software Work

Brief introduction to Linked Data

My recent research brought me to linked data as quite interesting concept. I will here write some brief introduction and notes on linked data. Probably at some future point of time I will go deeper into standards and usage. Current state of the Web Internet have revolutionized the way we…

Posted in Artificial intelligence Natural language processing and text mining Software Work

Political bot (AI) fighting human bots (using NLP and OCR)

Probably I should write this on Serbian, but to keep consistency, English it is. Since soon elections gonna be held in Serbia, there is a lot of talk about political campaigns. And one of the major issue in the news are human bots applied in the political campaign on the internet….

Posted in Artificial intelligence Natural language processing and text mining Science Software Work

What is the big deal with natural language processing?

Recently here at Manchester University, at one class for all PhD students we realized that almost half of student in a group are doing some kind of natural language processing and almost everyone was doing something related with machine learning (even hardware guys are building neural network like multi-processor architecture)….

Relevance
Posted in Artificial intelligence Natural language processing and text mining Software Work

Personalized relevance classifier of sentences

In this article I would like just to pitch idea about personalized classifier, and I would like to hear your opinion if this approach could be good and what can be problems with it. So what is the problem? I would like to build personalized relevance classifier. Problem definition Every…

spell checking
Posted in Artificial intelligence Natural language processing and text mining Science Software Work

How to create spell check?

Have you ever wondered what the underlying technology and math behind a spell checker are? I did, so I even built my own spell checker. Also, with this technology in mind, you can do many other things. It is useful to apply it to a voice recognition api. If you…