Staged Logistic Regression
Pick a set of X feature types
- sum of frequencies of all terms in query x1
- sum of frequencies of all query terms in document x2
- query length x3
- document length x4
- sum of idf’s for all terms in query x5
Determine weights, c, to indicate how important each feature type is (use training examples)
To assign a score to the document:
- add up the feature weight times the term weight for each feature and each term in the query