aboutsummaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authorTatsuya Kinoshita <tats@debian.org>2020-12-19 04:20:49 +0000
committerTatsuya Kinoshita <tats@debian.org>2020-12-19 04:25:29 +0000
commit1d967a4d423c0571a86d319ac16d704bbf29e51a (patch)
treea36d9d7a43e048f908320fd68bd43ff08fe5d4cb
parentUpdate ChangeLog (diff)
downloadw3m-1d967a4d423c0571a86d319ac16d704bbf29e51a.tar.gz
w3m-1d967a4d423c0571a86d319ac16d704bbf29e51a.zip
Add examples of siteconf, set user_agent to Googlebot for Twitter
-rw-r--r--doc-jp/README.siteconf6
-rw-r--r--doc/README.siteconf6
2 files changed, 12 insertions, 0 deletions
diff --git a/doc-jp/README.siteconf b/doc-jp/README.siteconf
index 4189712..d638a9a 100644
--- a/doc-jp/README.siteconf
+++ b/doc-jp/README.siteconf
@@ -48,6 +48,12 @@ user_agent "Lynx/2.8.8dev.3 libwww-FM/2.14 SSL-MM/1.4.1"
Google に Lynx であると告げます。(これによりテキストブラウザ向けページが
返ります)
+url m!^https?://([a-z]+\.)?twitter\.com/!
+user_agent "Googlebot/2.1"
+
+Twitter に Googlebot であると告げます。(これによりJavaScriptが無効の
+ブラウザが拒否されずにページが返ります)
+
===== 正規表現について =====
次の正規表現はいずれも同じ意味を表します。
diff --git a/doc/README.siteconf b/doc/README.siteconf
index 0369926..8514edf 100644
--- a/doc/README.siteconf
+++ b/doc/README.siteconf
@@ -47,6 +47,12 @@ user_agent "Lynx/2.8.8dev.3 libwww-FM/2.14 SSL-MM/1.4.1"
Tell Google we're actually Lynx. (So they send us a text-browser friendly
results page.)
+url m!^https?://([a-z]+\.)?twitter\.com/!
+user_agent "Googlebot/2.1"
+
+Tell Twitter we're actually Googlebot. (So they send us a page without
+rejection of a JavaScript disabled browser.)
+
===== Regular expressions notes =====
Following expressions are all equivalent: