Natural language processing and text mining

0

Announcing new online courses

0

Might of the word embeddings

Three most important lessons about neural networks and word embeddings: 1. No free lunch, 2. Size matters, 3. Engineering matters

0

[New Paper] Information extraction from tables in literature

About two months ago, a paper that resulted from my Ph.D. work has been published in the International Journal of Document Analysis and Recognition. The paper is titled “A framework for information extraction from tables in biomedical literature”.

0

Building Named Entity Recognizer (NER) using Conditional Random Fields (CRF)

Named entity recognizer is a program that recognizes named entity in text. The named entities could be anything from locations, company or person’s names, drug or disease names, etc.

0

Ideas for the future

I would like to state a couple of ideas that I have been thinking in the past number of days regarding what I do, which generally is natural language processing and machine learning. They may be something I am already working on and some ideas for the future and future directions. Only time will tell which I will manage to tackle.

Named Entity recognition

Interesting topic, however, it seems it is moving more towards industry domain, rather than academia. However, there are still a lot of people working on this, especially in specialized domains, such as biomedicine. Lately, we are having at the University one big project related to anonymization of clinical health records and lab reports. As a first

1

Awarded best paper award on NLDB 2018 conference

A paper called “Classification of Intangible Social Innovation Concepts” that was submitted and accepted for presentation at 23rd International Conference on Natural Language & Information Systems (NLDB2018) and was held in Paris, France from 13th to 15th June 2018, received one of the best paper award. In total 3 papers were awarded as the best papers with no ranking or order between them. Papers also received monetary award.

NLDB is quite established (organised already for 23 years) and good conference in the area of natural language processing. Usually about 15-18% of papers submitted are accepted as long papers. It seems like some more papers are accepted as short papers and poster presentations, so the percentages of accepted papers is higher, but

0

Impressions from HealTAC2018 conference

On 18th and 19th April 2018, the first UK health text analytics conference (HealTAC) took place in Manchester.  The main conference venue was Pendulum hotel, located on Stackville street, close to the north University of Manchester campus, the former UMIST. I had a pleasure to participate and help with a certain organisational things, as a member of local organisation committee member.

On the first day of the conference, people started arriving between 8:30-9:00 for registration. During the registration people could have some coffee and pastry for breakfast. People having posters were directed to the poster room, so they can already hang their posters. The conference started at 9:00, with welcome speech, some health and safety procedures, which was followed

0

The art of trolling and its connection to innovation

Recently I watched the following talk from DefCon named “The art of trolling”, which reminded me of my earlier days and stuff I did:

Also, interestingly enough, I was on an event where we had a number of speakers (academics and policy makers) talking about innovation policy, innovation management and similar. However, from all these talks, one point was missing: very often innovation is made for Lulz.

Let me start with two examples of trolling I was involved with, which required quite some bit of work and let’s call it out of the box thinking, with some degree of innovation.

 

Introducing trolls

Trolling politically motivated news commenters

I

0

Moment when my idea became a web standard

This is the story how one schema I worked on as a side project suddenly found its place in W3C recommendation.

In November 2015, I went with my supervisor to Japan. In small cities of Mishima and Ito, about 1 hour train ride from Tokyo was held Biomedical Linked Annotation Hackathon (BLAH2) to which my supervisor was invited. He could not stay for the whole period, so he offered me to go, which I accepted. The event was organised by Japanese Database Center for Life Sciences (DBCLS).

On the first day was the conference, where people were presenting their work mainly on annotating biomedical literature. My PhD was related, kind of similar topic, it was about information extraction from tables

OWASP Seraphimdroid
0

Educational framework added to OWASP Seraphimdroid

OWASP Seraphimdroid is back after Google Summer of Code with a new version, this time, it will be 2.5 and some exciting changes, we thought that may help users protect their security and privacy. I would like to briefly write about our new features. The app is available on old good place: Google play (https://play.google.com/store/apps/details?id=org.owasp.seraphimdroid)

What is new?

Educational component

From the very beginning of OWASP Seraphimdroid project, as a project leader, I saw in it a project that contains an educational component. However, until now we aimed to develop a number of features that will on technical level protect users. I have been writing about the development previously:

OWASP SERAPHIMDROID ANDROID SECURITY PUBLISHED

NEW VERSION OF OWASP SERAPHIMDROID