xapian · uppinder · Jul 21, 2018 · Jul 21, 2018 · Aug 4, 2018 · Aug 11, 2018
diff --git a/conf.py b/conf.py
@@ -79,7 +79,7 @@
 _project = u'Getting Started with Xapian'
 _authors = u'Xapian Documentation Team & Contributors'
 project = u'%s %s' % ( _project, version)
-copyright = u'2003-2017 ' + _authors
+copyright = u'2003-2018 ' + _authors
 
 github_project_url = 'https://github.com/xapian/xapian-docsprint/blob/master'
 

diff --git a/glossary.rst b/glossary.rst
@@ -67,6 +67,9 @@ Retrieval, while others have a specific meaning in the context of Xapian.
  A family of probabilistic weighting schemes developed more recently than
  BM25.  Xapian 1.3 adds supports for a number of such schemes.
 
+**Diversification**
+ In order to increase user’s satisfaction, the presented result set should not only be relevant to the search topic, but should also present a variety of perspectives, that is, the results should be different from one another, especially for ambiguous queries. The effectiveness of web search and the satisfaction of users can be enhanced by providing various results of a search query in a certain order of relevance and concern, known as diversification.
+
 **Document ID**
  A unique positive integer identifying a document in a Xapian database.
 

diff --git a/howtos/diversification.rst b/howtos/diversification.rst
@@ -0,0 +1,37 @@
+Diversification of Search Results
+=================================
+
+.. contents:: Table of contents
+
+Introduction
+------------
+
+Xapian allows for diversification of documents which are stored in the form of an MSet.
+This feature is a well-known technique in information retrieval used to increase
+user satisfaction, especially for ambiguous queries.
+
+Xapian currently has an implementation of an *implict* method (using documents as features,
+as opposed to using query based features such as query logs) adapted from the C :sup:`2` - GLS method mentioned in Scalable and Efficient Web Search Results Diversification, Naini et al. 2016. This saves the cost of not having to provide external features such as query
+logs, while still achieving the desired diversification effect, which according to
+the paper is reasonable enough for practical uses as tested on the public data set - ClueWeb09 with TREC Web 09/10 queries.
+
+API
+---
+
+Diversification on an MSet of results can be achieved by using the
+:xapian-method:`Diversify` class, e.g.::
+
+    // Query a database and get 10 results, where 'enq' is an instantiated
+    // Enquire object over a database
+    matches = enq.get_mset(0, 10)
+
+Now, cluster the 10 candidate documents into 4 clusters and use (at most) top-2
+documents from each cluster for diversification::    
+
+    k, r = 4, 2
+    // Instantiate Diversify object
+    d = xapian.Diversify(k, r)
+
+Perform diversification over 'matches' and obtain an ordered list of documents::
+
+    dset = d.get_dmset(matches)
diff --git a/howtos/index.rst b/howtos/index.rst
@@ -13,3 +13,4 @@ How To...
    synonyms
    weighting_scheme
    iterate_all_docs
+   diversification