FAQ
Hi all,

Just FYI, perhaps this is old news for you ... This large corpus is
freely available and it is pairwise sentence-aligned for all language
combinations. This looks like a good resource for linguistic
information, such as frequent words and phrases, n-gram profiles, etc.

http://wt.jrc.it/lt/Acquis/


--
Best regards,
Andrzej Bialecki <><
___. ___ ___ ___ _ _ __________________________________
[__ || __|__/|__||\/| Information Retrieval, Semantic Web
___|||__|| \| || | Embedded Unix, System Integration
http://www.sigram.com Contact: info at sigram dot com


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Search Discussions

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupjava-user @
categorieslucene
postedJan 24, '08 at 9:03p
activeJan 24, '08 at 9:03p
posts1
users1
websitelucene.apache.org

1 user in discussion

Andrzej Bialecki: 1 post

People

Translate

site design / logo © 2022 Grokbase