Turney & Littman's (2003) Semantic Orientation from Association

This is an implementation of the semantic/sentiment similarity method developed by Turney and Littman 2003. The underlying matrix is a word × word matrix extracted from the NYT section of English Gigaword. The rows are the Harvard Inquirer words and the columns are the full vocabulary. (Both rows and columns are restricted to items with at least 60 tokens in the data.) The notion of co-ocurrence is a restricted one: sharing a semantic dependency in a Stanford collapsed dependency representation. PPMI with contextual discounting was applied to the count matrix, truncated SVD was applied to that (150 dimensions), and then pairwise cosine similarity was defined for the row (Inquirer) vocabulary.

Provide two seed-sets drawn from the Harvard Inquirer vocabulary (comma-separated string)

Seed set 1:

Seed set 2:

or select one of the following random five-word subsets of some Harvard Inquirer oppositions:

Turney–Littman Pos: good, nice, excellent, positive, fortunate, correct, superior
Turney–Littman Neg: bad, nasty, poor, negative, unfortunate, wrong, inferior
Positiv: aptitude, daring, devotion, prosperity, inquisitive
Negativ: stifle, brazen, stormy, resent, toil
Strong: naval, say, audacity, roar, rebellion
Weak: beseech, fearful, tiny, absent, obey
Active: purchase, stood, near, say, shun
Passive: attention, worry, grow, expressive, coincidence
Pleasur: game, therapeutic, content, admiration, pride
Pain: myself>, bitter, anxiousness, torturous, unhappy
Ovrst: thereby, heroic, immortal, simple, atrocious
Undrst: coincidence, hesitation, indistinguishable, trivial, unimportant
Yes: least, ya, alright, yea, absolute
No: disagree, no, nope, ugh, mean