Probability and Language Modeling

Will Styler - LIGN 6


We’ve got corpora now!


Today’s Plan


What is probability?


Probability


Sample Probabilities


Probabilities can be calculated from observation


Surprisal and Information


Some things in life are surprising


Defining Surprisal and information


What’s the surprisal of…


Now we can quantify how likely a given event is


We can also estimate our uncertainty


Conditional Probability


Conditional Probability

‘What is the probability of this event, given that this other event occurred?’


Probabilities are often conditional on other events


Differences in conditional probabilities are information!


Differences in conditional probability let us model language!


Using Probability for language modeling


Knowing probability of any individual word is helpful!


Knowing the mutual information of linguistic elements helps to solve problems!


Automatic Speech Recognition


Text-to-speech


Spelling correction


Document classification


Sentiment Analysis


Studying the racialization of language


Studying the racialization of language (cont.)


The most common use of word probability in our lives…


Predictive Text and Language Probability


Every word in language is informative about the next



Phrasal information decreases surprisal



Predictive text just formalizes this


Testing the first time and then it is fine but it’s a nice little app but it does have to a lot cheaper than it but it’s fun and it makes it very interesting and it is a great idea for a great time with good people to play for a bit and a great way home fun fun and good way cheaper cheaper and cheaper than a free version for a free app but


Hi I hope you’re doing alright girl you are so nice to you have fun I hope you’re having fun I’m sorry I’m not gonna was a nice night I just wanna


Swype/Swipe-to-type







Swype requires a language model!


Probability models are helpful for NLP!


How much data do you need?


How much data do you need to find…


Probability estimates get better with more data!


Wrapping up


Thank you!

… and thank you to Eric Meinhardt, on whose talk this is partially based