Jobs!
Meetups by TopicMeetups by CityStart a MeetupWhat's Meetup
Home > All Topics > Internet & Technology > Semantic Web Meetups Everywhere > The New York Semantic Web Meetup Group
Organizer:

Join The New York Semantic Web Meetup Group!

We'll never share your email address without your permission. Already a Meetup member? Sign in.

Past meeting

Automatic Machine Translation & Natural Language Processing

« go to calendar view

Thursday, February 21, 2008 at 6:30 PM
KONA
New York, NY 10010

4.75 (12 ratings)

Meeting Description

Organized by

Marco Neumann

Details

6:30 Networking
7:00 An Introduction to ANTLR
7:45 Automatically Linking Structured and Unstructured Data
8:30 Open Discussion


An Introduction to ANTLR
Andy Tripp
President, Jazillian Inc
http://www.jazillian....

Automatic machine translation and language parsing. ANTLR, Another Tool for Language Recognition, is a language tool that provides a framework for constructing recognizers, interpreters, compilers, and translators from grammatical descriptions containing actions in a variety of target languages. The SPARQL query language grammar for ANTLR v3 was recently updated to version 1.1 and provides an implementation of the W3C SPARQL grammar specification.

http://www.antlr.org/...

Automatically Linking Structured and Unstructured Data: Connecting Databases to Text
Breck Baldwin
President, Alias-i Inc
http://www.alias-i.co...

Natural language processing for text analytics, text data mining and search. LingPipe is a state-of-the-art suite of natural language processing tools written in Java that performs tokenization, sentence detection, named entity detection, coreference resolution, classification, clustering, part-of-speech tagging, general chunking, fuzzy dictionary matching. These general tools support a range of applications.

Breck will discuss the thorny problem of linking entities in a database to text mentions of those entities.

The challenges are:

- The John Smith problem: You have a text mention of "John Smith" and many possible John Smiths in the database. How to pick?

- The name variant problem: Your database has an incomplete list of aliases for a gene. Serpina3 has the alias 'ACT', but is also called 'AACT' in the literature but you don't know that.

- The new entity problem: You want to discover new performers when they show up in your entertainment text sources. Those new performers are not in your database yet, how is that handled?

Breck will discuss how you can approach these problems using the LingPipe suite of tools in context of entertainment news and bioinformatics.

Talk About This Meeting

Pre-event comments
You must be a member to post a comment. Join or sign in.

Who Attended

The organizer estimated that 33 people attended.