Risk Focus is hosting the latest NYC Pyladies gathering for Python developers titled “Build a Language Identifier” at our New York offices. Gregory Marton will be presenting with a hands-on coding session to follow. To sign up visit: http://www.meetup.com/NYC-PyLadies/events/219867637/
Overview: When you start to analyze free text, often the first task is to figure out what language it’s in. The tools that help with that are also broadly used for other tasks in text understanding. We will touch on advanced algorithms, data structures, and numerical methods, but the workshop is meant to be accessible to anyone comfortable with dictionaries in python and basic probability.
Preparation: Know about python dictionaries and have familiarty with basic probability. Bring a laptop with python 3 and git. Please make sure your laptop is wireless enabled and fully charged.
Itinerary: 7-8pm Roll out your own character bigram language identifier. 8-9p Show and tell with scikit-learn, langid, and the Google Translate API.
Instructor Bio: Gregory Marton is a software engineer with about 200 interviews at Google and a few in other settings. He regularly teaches a class aimed at helping candidates prepare for Google on-site interviews. He has Master’s and Engineer’s degrees in EECS from MIT.