How to add html and plain text files in the Sphinx index?


Warning: count(): Parameter must be an array or an object that implements Countable in /home/styllloz/public_html/qa-theme/donut-theme/qa-donut-layer.php on line 274
0 like 0 dislike
11 views
In the documentation of Sphinx is written "The data to be indexed can generally come from very different sources: SQL databases, plain text files, HTML files, mailboxes, and so on".


But in the Quick Sphinx usage tour shows only how to configure Sphinx for Mysql database. How to configure it to work with html and plain text?
by | 11 views

1 Answer

0 like 0 dislike
You will need xmlpipe data source:
\rsphinxsearch.com/docs/1.10/xmlpipe2.html
And have to write a script that will do xml (in the format as described there) from the html or plaintext file (though it may already be something ready written, got to Google).
\r
An example of the indexing of MemcacheDb:
\rnutrun.com/weblog/distributed-key-value-store-indexing/
\r
Here even pdf index:
\rwww.sphinxsearch.com/forum/view.html?id=338
by

Related questions

0 like 0 dislike
1 answer
asked Apr 26, 2019 by andrei2018
0 like 0 dislike
1 answer
0 like 0 dislike
1 answer
asked Apr 14, 2019 by tigroid3
0 like 0 dislike
2 answers
0 like 0 dislike
5 answers
110,608 questions
257,186 answers
0 comments
28,013 users