First Workshop on Computational Approaches to Code Switching

Saturday October 25th, 2014

Session 1: Workshop talks
9:00-9:10 Welcome Remarks
The organizers
9:10–9:30 Foreign Words and the
Automatic Processing of Arabic Social Media Text Written in
Roman Script

Ramy Eskander, Mohamed Al-Badrashiny, Nizar
Habash and Owen Rambow
9:30–9:50 Code Mixing: A Challenge
for Language Identification in the Language of Social Media


Utsab Barman, Amitava Das, Joachim Wagner and Jennifer Foster
9:50–10:10 Detecting Code-Switching
in a Multilingual Alpine Heritage Corpus

Martin Volk and
Simon Clematide
10:10–10:30 Exploration of the Impact
of Maximum Entropy in Recurrent Neural Network Language Models
for Code-Switching Speech

Ngoc Thang Vu and Tanja Schultz
10:30-11:00 Coffee Break

Session 2: Workshop Talks and Shared Task Systems
11:00–11:20 Predicting Code-switching
in Multilingual Communication for Immigrant Communities

Evangelos
Papalexakis, Dong Nguyen and A. Seza Dog ̆ruöz
11:20–11:40 Twitter Users #CodeSwitch
Hashtags! #MoltoImportante #wow

David Jurgens, Stefan
Dimitrov and Derek Ruths
11:40–11:50 Overview for the First
Shared Task on Language Identification in Code-Switched Data


Thamar Solorio, Elizabeth Blair, Suraj Maharjan, Steven Bethard,
Mona Diab, Mahmoud Ghoneim, Abdelati Hawwari, Fahad AlGhamdi,
Julia Hirschberg, Alison Chang and Pascale Fung
11:50–12:10 Word-level Language
Identification using CRF: Code-switching Shared Task Report of
MSR India System

Gokul Chittaranjan, Yogarshi Vyas,
Kalika Bali and Monojit Choudhury
12:10–12:30 The CMU Submission for the
Shared Task on Language Identification in Code-Switched Data


Chu-Cheng Lin, Waleed Ammar, Lori Levin and Chris Dyer
12:30-14:00 Lunch break
Session 3: Shared Task and Next Steps
14:00–14:20 Language Identification in
Code-Switching Scenario

Naman Jain and Riyaz Ahmad Bhat
14:20–14:40 AIDA: Identifying Code
Switching in Informal Arabic Text

Heba Elfardy, Mohamed
Al-Badrashiny and Mona Diab
14:40–15:00 The IUCL+ System:
Word-Level Language Identification via Extended Markov Models


Levi King, Eric Baucom, Timur Gilmanov, Sandra Kübler, Dan
Whyatt, Wolfgang Maier and Paul Rodrigues
15:00-15:30 Panel Discussion: Next Steps in CS Research
Group Discussion
15:30-16:00 Coffee Break (Posters set up time)
Session 4: Poster Session
16:00-17:30 Workshop and Shared Task Posters
Multiple presenters
Mixed Language and
Code-Switching in the Canadian Hansard

Marine Carpuat
“I am borrowing ya mixing
?” An Analysis of English-Hindi Code Mixing in Facebook

Kalika
Bali, Jatin Sharma, Monojit Choudhury and Yogarshi Vyas
DCU-UVT: Word-Level
Language Classification with Code-Mixed Data

Utsab
Barman, Joachim Wagner, Grzegorz Chrupała and Jennifer Foster
Incremental N-gram
Approach for Language Identification in Code-Switched Text

Prajwol
Shrestha
The Tel Aviv University
System for the Code-Switching Workshop Shared Task

Kfir
Bar and Nachum Dershowitz
The CMU Submission for the
Shared Task on Language Identification in Code-Switched Data

Chu-Cheng
Lin, Waleed Ammar, Lori Levin and Chris Dyer
Word-level Language
Identification using CRF: Code-switching Shared Task Report of
MSR India System

Gokul Chittaranjan, Yogarshi Vyas,
Kalika Bali and Monojit Choudhury
Language Identification in
Code-Switching Scenario

Naman Jain and Riyaz Ahmad Bhat
AIDA: Identifying Code
Switching in Informal Arabic Text

Heba Elfardy, Mohamed
Al-Badrashiny and Mona Diab
The IUCL+ System:
Word-Level Language Identification via Extended Markov Models


Levi King, Eric Baucom, Timur Gilmanov, Sandra Kübler, Dan
Whyatt, Wolfgang Maier and Paul Rodrigues