Category: Natural language processing and text mining
The area of natural language processing is the main focus of Inspiratron.org and Nikola Milosevic. This is the area of artificial intelligence dealing with human language understanding. The topics in this category may include views, opinions, methodologies and techniques regarding information extraction, information retrieval, named entity recognition, classification, sentiment analysis, etc.
Marvin – A tool for semantic annotation released
During the last week I have released a version of Marvin – a tool for semantic annotations, that is able to annotate text using various sources, such as UMLS (using MetaMap), DBPedia, using some SPARQL interface, WordNet and probably most importantly SKOS (Simple Knowledge Organization System ) format for representing lexicons,…
Starting an open source effort to bring biomedical data and tools together
Motivation from the community A week ago I posted an blog post about my idea of creating Open Source organization that would have an goal to bring biomedical data and tools together to a single place. The original post can be read found here: https://inspiratron.org/blog/2015/12/12/open-source-effort-to-bring-all-open-biomedical-data-together/. Since I posted it, quite large…
Open source effort to bring all open biomedical data together
Motivation There is a vast amount of openly available biomedical information on the web in the form of open access journal publications, biomedical, gene or drug databases, drug labels and more. We are talking about millions of full text articles (i.e. indexed in PMC and possibly some in services such as…
Expirience from Lisbon Machine Learning Summer School
I have participated on Lisbon Machine Learning Summer School (LxMLS), which took place on July 16-23 at Instituto Superior Técnico, a leading Engineering and Science school in Portugal. It is organized jointly by IST, the Instituto de Telecomunicações and the Spoken Language Systems Lab – L2F of INESC-ID. It was quite…
Marvin – semantic annotator
We created Marvin, an semantic text annotation tool that uses external resources such as DBPedia and WordNet to annotate semantically text. Marvin is built in java and can be used as a standalone application or as a library. Marvin semantic annotator already has a lot of knowledge, which will make…
xgoogle python library upgrade for google image search
Couple of days ago I realized that I need some library that will allow me search google and especially face images. I was previously working a bit with the Google API, however Google offers only 100 requests per day and there are some other limitations. It is very good API…
What is the structure in data?
Have you ever wondered what is structured data? Why is this important? Well, from structured data we can extract semantic and know what is in the data, use the data. However, if we don’t know how data is structured, we will be unable to extract semantics and to understand it….
Building a search engine (Lucene tutorial)
Since the Google took over lives and branded a verb for searching as Googling, making a search engine is considered cool thing. I have crossed over search engines several times in my life. Even I worked in a company that was I guess pretending to build search engine (I was…
Brief introduction to Linked Data
My recent research brought me to linked data as quite interesting concept. I will here write some brief introduction and notes on linked data. Probably at some future point of time I will go deeper into standards and usage. Current state of the Web Internet have revolutionized the way we…
Political bot (AI) fighting human bots (using NLP and OCR)
Probably I should write this on Serbian, but to keep consistency, English it is. Since soon elections gonna be held in Serbia, there is a lot of talk about political campaigns. And one of the major issue in the news are human bots applied in the political campaign on the internet….
What is the big deal with natural language processing?
Recently here at Manchester University, at one class for all PhD students we realized that almost half of student in a group are doing some kind of natural language processing and almost everyone was doing something related with machine learning (even hardware guys are building neural network like multi-processor architecture)….
Personalized relevance classifier of sentences
In this article I would like just to pitch idea about personalized classifier, and I would like to hear your opinion if this approach could be good and what can be problems with it. So what is the problem? I would like to build personalized relevance classifier. Problem definition Every…