Play with my stemmer.

Glenn Scheper glenn_scheper at earthlink.net
Tue Jan 30 09:19:30 CST 2007


This mornings work on WordsEx went very smoothly. Must have been good concurrent tantra.

You can click on works, and see what words the Porter Stemmer algorithm matches.

I can tell you the pages in memory containing the words,
but I have not written the fulltext search yet.

Maybe that will only be a couple days hence. A search results log:

Find:
awake

Thread started.

Search tokens, without stemming:
awake - 42 matches

Search tokens, matches after stemming:
awake - 42 matches
awaked - 8 matches
awaking - 1 match

Diagnostic: This text has matches:
file://\i\kn\01_gene.txt
Diagnostic: This text has matches:
file://\i\kn\07_judg.txt
Diagnostic: This text has matches:
file://\i\kn\09_1sam.txt
Diagnostic: This text has matches:
file://\i\kn\11_1kin.txt
Diagnostic: This text has matches:
file://\i\kn\12_2kin.txt
Diagnostic: This text has matches:
file://\i\kn\18_job_.txt
Diagnostic: This text has matches:
file://\i\kn\19_psal.txt
Diagnostic: This text has matches:
file://\i\kn\20_prov.txt
Diagnostic: This text has matches:
file://\i\kn\22_song.txt
Diagnostic: This text has matches:
file://\i\kn\23_isai.txt
Diagnostic: This text has matches:
file://\i\kn\24_jere.txt
Diagnostic: This text has matches:
file://\i\kn\27_dani.txt
Diagnostic: This text has matches:
file://\i\kn\29_joel.txt
Diagnostic: This text has matches:
file://\i\kn\35_haba.txt
Diagnostic: This text has matches:
file://\i\kn\38_zech.txt
Diagnostic: This text has matches:
file://\i\kn\41_mark.txt
Diagnostic: This text has matches:
file://\i\kn\42_luke.txt
Diagnostic: This text has matches:
file://\i\kn\43_john.txt
Diagnostic: This text has matches:
file://\i\kn\44_acts.txt
Diagnostic: This text has matches:
file://\i\kn\45_roma.txt
Diagnostic: This text has matches:
file://\i\kn\46_1cor.txt
Diagnostic: This text has matches:
file://\i\kn\49_ephe.txt

 ... The full text search part of this thread is not coded yet.

Thread ended.




More information about the Pynchon-l mailing list