LUCENE Indexing.....
What is a Search Index
Requirements of a Search Index
Indexing Strategies
Lucene Indexing Fundamentals
Some Indexing Terms
Merge Algorithm
Lucene Indexing Algorithm
Search Algorithm
A Lucene Segment
<SegName, SegSize> SegCount
Version
Name Counter
SegCount
Format
Per Index Files
Per Segment Files
FileFormat (contd..)
Per Segment Files(prefix=segname)
Per Segment Files (contd..)
Segment layout
Ti+1
Tj+k
Tj
Ti
.tii file (in memory)
.tis file (in disk)
Random seeks
Contiguous reads
Tq
IndexDelta
.frq/posting file (in disk)
Tq+1
Term
DocFreq
FreqDelta
ProxDelta
SkipDelta
TF1
TFd
Posting-list(term-freqs)
SD1
SDd/sk
SD2
DocId
FreqSk
TFsk
ProxSk
.prx file
Used to merge postings
Index Compression
Search Algorithm
Vector Space Model
Search Algorithm (contd.)
Others..
Open ended questions