As on Habre to find a test in your review?


Warning: count(): Parameter must be an array or an object that implements Countable in /home/styllloz/public_html/qa-theme/donut-theme/qa-donut-layer.php on line 274
0 like 0 dislike
4 views
I have 1200 reviews. I want to find among them those which have the word "write". How to do it?
-Search the site for this word gives a lot of options on all users, not just me
-Search in Google\\Yandex on the website Habra by my nick and this word gives a bunch of pages with other people's comments with this word and without him my
-Search in Google\\Yandex this word on the website tangro.habrahabr.ru/comments/ betrays nothing
-Look through the comments on the pages and search on each search browser a little boring (lots of pages). Discover all the comments one can not (well, I don't know how).

There are some appropriate ways (than "write a spider to collect all the pages with comments and looked at them")?
by | 4 views

6 Answers

0 like 0 dislike
tangro &&/+3 "write" site:habrahabr.ru — for Yandex.
+3 means a distance of not more than three sentences in a forward direction. Gives, in my opinion, for the most part your comments.
by
0 like 0 dislike
in robots.txt (http://%username%.habrahabr.ru/robots.txt) is prohibited indexation just to be on the subdomain a user, so that search engines can't find anything
\r
User-agent: *
Disallow: /
Host: %username%.habrahabr.ru
by
0 like 0 dislike
You have described all the possible ways. It only remains a trendy option for geeks — find the SQL-Injection bug, and search through the database :)
by
0 like 0 dislike
Comments are on the page %USERNAME%.habrahabr.ru/comments/page%NUMBER%/
The number of the last page we find the hands — on direct the mouse arrow.
\r
Further options:
\r
( 0) YQL, unfortunately, there is no ban on indexing in robots.txt )
1) shell script that can call wget with a delay. Discover khtml N-nicknames can be found.
For Windows, if there is no wget or reluctance to write the batch file, it is possible in VBScript/JScript is too long.
2) JavaScript-one-liner in the address bar of the browser that a delay will add to the page N of the iframe.
In the browser disable pictures and flash, get naked text page, Ctrl + F steers.
\r
If it does not fall under the definition of a "writing spider", I think — quite a way out.
by
0 like 0 dislike
And here's the answers:
\r
1) Venda/wget in one line doesn't fit, plus c delay CMD in tight:
\r
\r
set MAXPSTO=30 set HABRUSER=tangro for /L %i in (1,1,%MAXPSTO%) DO @echo http://%HABRUSER%.habrahabr.ru/comments/page%i/ >> tmp.url wget-w 5 tmp.url 

\r
2) Knicks/wget — no comment:
\r
\r
$ for i in {1..30} ; do wget http://tangro.habrahabr.ru/comments/page$i/ && sleep 5 ; done 

\r
3) cross platform — JavaScript in the address bar of the browser, in Opera, and chrome sort of works, the main thing — in advance to cut off images and plugins. During found strange behavior of setInterval, and in General, some code in the format "javascript: code" seems to work only from exile, but not from the address bar, so the script has grown dramatically.
\r
\rpastebin.com/EXc7DFQC
\r
The legs kicking is pointless, coleone the solution :)
by
0 like 0 dislike
Perverted version:
1) put the AutoPager extension for chrome or Fox.
2) Go to the page with the comments and press "End". Browser machine pulls the next page and displays it below.
3) Poweram P2. until you get the "sheet" with all their creativity
4) Looking for tools browser
by

Related questions

0 like 0 dislike
3 answers
asked Mar 24, 2019 by xlamys
0 like 0 dislike
1 answer
asked Mar 25, 2019 by Fastto
0 like 0 dislike
4 answers
0 like 0 dislike
1 answer
0 like 0 dislike
4 answers
110,608 questions
257,186 answers
0 comments
27,899 users