Think through the system and following up tasks. Should:
1. To store about a million html files
2. The same text files
3. zip, pdf files
4. It is necessary to search for text and html files
If it matters, I have some experience on the use of bundles mysql+sphinx.
Scalability need about 10 million html and many text files.
What solutions can you recommend?
1. Where and how should I store the html and txt files?
2. Where and how to store files and pdf?
3. As the data is stored, for example, search engines? Where to read?