Describir: Multiple affordances of language corpora for data-driven learning /