Sign up for our weekly newsletter!
Christopher Moody from StitchFix will be speaking in BIDS about novel techniques in Natural Language Processing.
Standard natural language processing (NLP) is a messy and difficult affair. It requires teaching a computer about English-specific word ambiguities as well as the hierarchical, sparse nature of words in sentences. At Stitch Fix, word vectors help computers learn from the raw text in customer notes. Our systems need to identify a medical professional when she writes that she 'used to wear scrubs to work', and distill 'taking a trip' into a Fix for vacation clothing. Applied appropriately, word vectors are dramatically more meaningful and more flexible than current techniques and let computers peer into text in a fundamentally new way. Chris will speak about word2vec, related techniques, and try to convince you that word vectors give us a simple and flexible platform for understanding text.
Word vectors allow us to capture the semantic "distance" between words at multiple scales, allowing greater flexibility than other contemporary techniques as well as fundamental changes in text understanding.