1. Convert PDF to file with e.g xpdf
2. Insert parsed text to a table of your choice.
3. Make vectors from the text.
Actually, if you're not going to use the headline() function, you cna
just store it directly in a vector, cutting down on the size
requirements.
What size requirements ?
Just insert to the to_tsvector() result. The full text is
required for headline() though, so you can't cheat on that.

//Magnus

---------------------------(end of broadcast)---------------------------
TIP 6: explain analyze is your friend

Search Discussions

Discussion Posts

Previous

Follow ups

Related Discussions

Discussion Navigation
viewthread | post
posts ‹ prev | 6 of 7 | next ›
Discussion Overview
grouppgsql-general @
categoriespostgresql
postedDec 11, '06 at 11:11a
activeDec 12, '06 at 7:51a
posts7
users4
websitepostgresql.org
irc#postgresql

People

Translate

site design / logo © 2022 Grokbase