{"id":2446,"date":"2021-03-10T17:52:12","date_gmt":"2021-03-10T17:52:12","guid":{"rendered":"https:\/\/seonorth.ca\/?page_id=2446"},"modified":"2025-02-15T21:17:48","modified_gmt":"2025-02-15T21:17:48","slug":"custom-extraction","status":"publish","type":"page","link":"https:\/\/seonorth.ca\/zh\/screaming-frog\/custom-extraction\/","title":{"rendered":"\u5c16\u53eb\u86d9\u81ea\u5b9a\u4e49\u63d0\u53d6\u3002\u63d0\u53d6\u722c\u884c\u6570\u636e\u7684\u6307\u5357"},"content":{"rendered":"

Screaming Frog\uff08screamingfrog.co.uk\uff09\u662f\u4e00\u6b3e\u529f\u80fd\u5f3a\u5927\u7684\u641c\u7d22\u5f15\u64ce\u4f18\u5316\u5de5\u5177\uff0c\u5177\u6709\u8bb8\u591a\u641c\u7d22\u5f15\u64ce\u4f18\u5316\u529f\u80fd\uff0c\u5176\u4e2d\u5305\u62ec\u81ea\u5b9a\u4e49\u63d0\u53d6\u529f\u80fd\uff0c\u53ef\u4ee5\u8ba9\u60a8\u8f7b\u677e\u5730\u4ece\u6293\u53d6\u4e2d\u63d0\u53d6\u6570\u636e\u3002\u672c\u535a\u6587\u5c06\u8ba8\u8bba Screaming Frog \u81ea\u5b9a\u4e49\u63d0\u53d6\u529f\u80fd\u7684\u5de5\u4f5c\u539f\u7406\uff0c\u4ee5\u53ca\u4e3a\u4ec0\u4e48\u5b83\u53ef\u4ee5\u5e2e\u52a9\u6539\u8fdb\u641c\u7d22\u5f15\u64ce\u4f18\u5316\u5de5\u4f5c\u3001\u7535\u5b50\u5546\u52a1\u6570\u5b57\u8425\u9500\u548c\u7d22\u5f15\u7b56\u7565\u3002<\/p>\n\n\n\n

\"\u5c16\u53eb\u86d9\u5b9a\u5236\u63d0\u53d6\"<\/figure>\n\n\n\n

\u7f51\u7ad9\u4e0a\u6709\u5927\u91cf\u6709\u7528\u7684\u4fe1\u606f--\u5927\u591a\u6570\u60c5\u51b5\u4e0b\uff0c\u8981\u8bbf\u95ee\u7f51\u7ad9\u4e0a\u7684\u6bcf\u4e2a\u9875\u9762\uff0c\u5c06\u4ea7\u54c1\u6570\u636e\u3001\u5143\u6570\u636e\u3001\u6807\u9898\u6807\u7b7e\u548c\u951a\u6587\u672c\u590d\u5236\u5230\u7535\u5b50\u8868\u683c\u4e2d\uff0c\u65e2\u8d39\u529b\u53c8\u590d\u6742\u3002\u5728\u8fd9\u79cd\u60c5\u51b5\u4e0b\uff0cScreaming Frog \u4f7f\u7528 API \u548c\u6b63\u5219\u8868\u8fbe\u5f0f\u6765\u81ea\u52a8\u5b8c\u6210\u81ea\u5b9a\u4e49\u641c\u7d22\u6570\u636e\u63d0\u53d6\u3002\u81ea\u5b9a\u4e49\u63d0\u53d6\u662f\u4e00\u79cd\u7f51\u7edc\u641c\u522e\u3001\u7f51\u7edc\u91c7\u96c6\u6216\u7f51\u7edc\u6570\u636e\u63d0\u53d6\u5f62\u5f0f\uff0c\u7528\u4e8e\u4ece\u7f51\u7ad9\u4e0a\u641c\u522e\u548c\u63d0\u53d6\u6570\u636e\uff0c\u4f7f\u60a8\u53ef\u4ee5\u5c06\u6570\u636e\u5b58\u50a8\u5728\u672c\u5730\u8ba1\u7b97\u673a\u4e0a\u3002<\/p>\n\n\n\n

\u5bf9\u4e8e\u521d\u5b66\u8005\uff0c\u4f60\u53ef\u80fd\u6709\u4e00\u4e9b\u95ee\u9898\u3002<\/p>\n\n\n\n

\u4ec0\u4e48\u662f <\/strong>\u5c16\u53eb\u86d9SEO\u8718\u86db<\/strong>?<\/strong><\/h2>\n\n\n\n

Screaming Frog SEO Spider \u8f6f\u4ef6\u662f\u4e00\u6b3e\u7f51\u7ad9\u722c\u866b\uff0c\u53ef\u901a\u8fc7\u56fe\u5f62\u7528\u6237\u754c\u9762\uff08GUI\uff09\u63d0\u53d6\u548c\u5206\u6790\u7f51\u7ad9\u7684\u7ed3\u6784\u5316\u6570\u636e\uff0c\u6709\u6548\u5904\u7406 XML \u548c JavaScript \u6e32\u67d3\u7684\u5185\u5bb9\uff0c\u4ece\u800c\u63d0\u9ad8\u7f51\u7ad9\u7684\u641c\u7d22\u5f15\u64ce\u4f18\u5316\u6c34\u5e73\u3002<\/p>\n\n\n\n

\u4ec0\u4e48\u662f <\/strong>\u5b9a\u5236\u62d4\u7259<\/strong>?<\/strong><\/h2>\n\n\n\n

\u81ea\u5b9a\u4e49\u63d0\u53d6\u662f Screaming Frog \u7684 SEO \u8718\u86db\u4ece\u7f51\u9875\u4e2d\u63d0\u53d6\u660e\u786e\u4fe1\u606f\u7684\u529f\u80fd\u3002\u8fd9\u4e9b\u63d0\u53d6\u4fe1\u606f\u6709\u52a9\u4e8e\u4f18\u5316\u60a8\u7684\u7f51\u7ad9\uff0c\u4ee5\u4fbf\u8fdb\u884c\u6280\u672f\u6027\u641c\u7d22\u5f15\u64ce\u4f18\u5316\u5ba1\u6838\uff0c\u5305\u62ec\u641c\u7d22\u7ed3\u679c\u3001\u6536\u96c6\u6709\u5173\u526f\u672c\u7684\u91cd\u8981\u6570\u636e\uff0c\u4ee5\u53ca\u5e2e\u52a9\u5b9a\u4f4d\u548c\u4fee\u590d\u6807\u9898\u548c\u5176\u4ed6\u5143\u7d20\u4e2d\u7684\u9519\u8bef\u3002<\/p>\n\n\n\n

\u6570\u636e\u63d0\u53d6\u662f\u5982\u4f55\u8fdb\u884c\u7684\uff1f<\/strong><\/h2>\n\n\n\n

\u5982\u679c\u60a8\u60f3\u8fdb\u884c\u6570\u636e\u63d0\u53d6\uff0c\u5373\u4ece\u60a8\u7684\u7f51\u7ad9\u4e0a\u63d0\u53d6\u6240\u9700\u7684\u6570\u636e\uff0c\u8bf7\u4f7f\u7528 Screaming Frog\u3002\u8fd9\u4e9b\u4fe1\u606f\u4fdd\u5b58\u5728 Screaming Frog \u7684\u5185\u5b58\u4e2d\uff0c\u60a8\u53ef\u4ee5\u9009\u62e9\u5c06\u626b\u63cf\u7ed3\u679c\u5bfc\u51fa\u5230 Excel \u6216 Google Sheets\uff0c\u4ee5\u4fbf\u8fdb\u4e00\u6b65\u5ba1\u67e5\u3002\u8fd9\u53ef\u80fd\u5305\u62ec\u4e0b\u62c9\u83dc\u5355\u548c\u5185\u90e8\u94fe\u63a5\u7ed3\u6784\u4e2d\u7684\u6570\u636e\u3002<\/p>\n\n\n\n

\u4e3a\u4ec0\u4e48\u6570\u636e\u63d0\u53d6\u81f3\u5173\u91cd\u8981\uff1f<\/strong><\/h2>\n\n\n\n

\u6570\u636e\u63d0\u53d6\u53ef\u8ba9\u60a8\u5feb\u901f\u9ad8\u6548\u5730\u83b7\u53d6\u5927\u91cf\u6570\u636e\u3002\u8fd9\u79cd\u81ea\u52a8\u5316\u53ef\u4e3a\u60a8\u63d0\u4f9b\u7f51\u7edc\u67b6\u6784\u7684\u5373\u65f6\u7ed3\u679c\u3002\u8fd9\u4e00\u8fc7\u7a0b\u53ef\u8282\u7701\u60a8\u7684\u65f6\u95f4\u548c\u8d44\u6e90\uff0c\u540c\u65f6\u4e3a\u60a8\u63d0\u4f9b\u89c4\u5212\u548c\u5236\u5b9a\u641c\u7d22\u5f15\u64ce\u4f18\u5316\u6218\u7565\u6240\u9700\u7684\u5b9d\u8d35\u6570\u636e\u3002Screaming Frog \u662f\u641c\u7d22\u5f15\u64ce\u4f18\u5316\u4eba\u5458\u7684\u9996\u9009\u7f51\u7edc\u6293\u53d6\u5de5\u5177\u548c\u6570\u636e\u63d0\u53d6\u5de5\u5177\u3002\u5b83\u6709\u65e0\u7a77\u65e0\u5c3d\u7684\u9009\u9879\uff1b\u8fd9\u91cc\u6709\u5927\u91cf\u81ea\u5b9a\u4e49\u7f51\u7edc\u6293\u53d6\u8bed\u6cd5\u3002\u8bf7\u67e5\u770b\u4e0b\u9762\u7684\u6559\u7a0b\u3002<\/p>\n\n\n\n

\u5982\u4f55\u4f7f\u7528Screaming Frog\u63d0\u53d6\u81ea\u5b9a\u4e49\u6570\u636e<\/h2>\n\n\n\n

\u5982\u679c\u60a8\u60f3\u8fdb\u884c\u6570\u636e\u63d0\u53d6\uff0c\u5373\u4ece\u60a8\u7684\u7f51\u7ad9\u4e0a\u63d0\u53d6\u6240\u9700\u7684\u6570\u636e\uff0c\u8bf7\u4f7f\u7528 Screaming Frog\u3002\u8fd9\u4e9b\u4fe1\u606f\u4fdd\u5b58\u5728 Screaming Frog \u7684\u5185\u5b58\u4e2d\uff0c\u60a8\u53ef\u4ee5\u9009\u62e9\u5c06\u626b\u63cf\u7ed3\u679c\u5bfc\u51fa\u5230 Excel \u6216 Google Sheets \u4e2d\uff0c\u4ee5\u4fbf\u8fdb\u4e00\u6b65\u67e5\u770b\u3002\u5bf9\u4e8e\u66f4\u9ad8\u7ea7\u7684\u9700\u6c42\uff0c\u60a8\u53ef\u4ee5\u4f7f\u7528\u6b63\u5219\u8868\u8fbe\u5f0f\u4ece HTML \u6216 JavaScript \u6e32\u67d3\u7684\u5185\u5bb9\uff08\u5305\u62ec\u8282\u70b9\u548c\u7247\u6bb5\uff09\u4e2d\u7cbe\u786e\u5b9a\u4f4d\u548c\u63d0\u53d6\u7279\u5b9a\u6a21\u5f0f\u3002<\/p>\n\n\n\n

\u901a\u8fc7\u6574\u5408\u8fd9\u4e9b\u6280\u672f\uff0c\u60a8\u53ef\u4ee5\u6709\u6548\u4f18\u5316\u641c\u7d22\u5f15\u64ce\u4f18\u5316\u7b56\u7565\uff0c\u5229\u7528 Screaming Frog \u7b49\u5de5\u5177\u7684\u5f3a\u5927\u529f\u80fd\uff0c\u751a\u81f3\u5229\u7528 ChatGPT \u7b49\u4eba\u5de5\u667a\u80fd\u6280\u672f\u83b7\u5f97\u66f4\u6df1\u5165\u7684\u89c1\u89e3\u3002<\/p>\n\n\n\n

1.\u5728ScreamingFrog\u4e2d\uff0c\u8f6c\u5230 \u914d\u7f6e>\u81ea\u5b9a\u4e49>\u63d0\u53d6\u3002<\/strong><\/p>\n\n\n\n

\"\u5c16\u53eb\u86d9\u5b9a\u5236\u63d0\u53d6\"
\u5c16\u53eb\u86d9\u5b9a\u5236\u63d0\u53d6<\/figcaption><\/figure>\n\n\n\n

2.\u63a5\u4e0b\u6765\uff0c\u4f60\u5c06\u9700\u8981 +\u6dfb\u52a0<\/strong> \u5e76\u8bbe\u7f6e\u4f60\u7684\u63d0\u53d6\u89c4\u5219\u3002<\/p>\n\n\n\n

\"\u81ea\u5b9a\u4e49\u63d0\u53d6\u8bbe\u7f6e\"
\u4f7f\u7528\u81ea\u5b9a\u4e49\u63d0\u53d6\u6807\u7b7e\u9009\u62e9\u5185\u90e8HTML\u7684\u5143\u7d20<\/figcaption><\/figure>\n\n\n\n

3.\u52a0\u5165\u4e00\u4e2a \u6807\u9898<\/strong>,
4.\u9009\u62e9\u4f60\u662f\u5426\u9700\u8981 CSSPath\u3002 XPath\r\n \r\n \r\n \r\n <\/g>\r\n \r\n \r\n \r\n <\/clippath>\r\n <\/defs><\/svg><\/span><\/a>\uff0c\u6216 Regex<\/use><\/svg><\/span><\/a><\/strong>,
5.\u52a0\u5165\u4f60\u7684 \u641c\u7d22\u529f\u80fd<\/strong>. <\/p>\n\n\n\n

\u5982\u679c\u60a8\u4e0d\u786e\u5b9a\u9700\u8981\u54ea\u79cd\u9009\u62e9\u5668\u6216\u51fd\u6570\uff0c\u8bf7\u67e5\u770b\u4e0b\u9762\u7684\u793a\u4f8b\uff0c\u6216\u4f7f\u7528\u4e0b\u5217\u6587\u4ef6\u4e2d\u7684\u68c0\u67e5\u5143\u7d20\u51fd\u6570 \u8c37\u6b4c\u6d4f\u89c8\u5668\u5f00\u53d1\u5de5\u5177<\/a>.\u60a8\u53ef\u4ee5\u5728 Google Chrome \u6d4f\u89c8\u5668\u4e2d\u4f7f\u7528 \"\u53f3\u952e\u5355\u51fb \"\u6253\u5f00 \"\u5f00\u53d1\u5de5\u5177\"\u3002<\/p>\n\n\n\n

\u4f8b\u5b50\u3002<\/h3>\n\n\n\n

\u4e0b\u9762\u4e3e\u4f8b\u8bf4\u660e \u522e\u524a<\/a> \u83b7\u53d6 Facebook \u50cf\u7d20 ID<\/p>\n\n\n\n

\"Facebook\u50cf\u7d20ID\u63d0\u53d6\"
Facebook\u50cf\u7d20ID\u63d0\u53d6<\/figcaption><\/figure>\n\n\n\n

\u5728 \u7ed3\u679c<\/strong>\u4f60\u53ef\u4ee5\u770b\u5230\uff0c\u6211\u7684\u4e00\u4e2a\u9875\u9762\u7f3a\u5c11\u4e00\u4e2aFacebook Pixel\u3002<\/p>\n\n\n\n

\"\u4e22\u5931\u7684Facebook
\u4e22\u5931\u7684Facebook ID<\/figcaption><\/figure>\n\n\n\n

\u4e0b\u9762\u662f\u9884\u5b9a\u4e49\u7684\u81ea\u5b9a\u4e49\u63d0\u53d6\u6570\u636e\u96c6\uff0c\u53ef\u4ee5\u8ba9\u4f60\u5f00\u59cb\u3002<\/p>\n\n\n\n

\u4f7f\u7528XPath\u7f51\u7edc\u522e\u524a\u7684\u57fa\u672c\u8bed\u6cd5<\/h2>\n\n\n\n
SYNTAX<\/th>\u529f\u80fd\u4ecb\u7ecd<\/th><\/tr><\/thead>
\/\/<\/code><\/td>\u5728\u6587\u4ef6\u7684\u4efb\u4f55\u5730\u65b9\u8fdb\u884c\u641c\u7d22<\/td><\/tr>
\/<\/code><\/td>\u7684\u6839\u90e8\u5185\u641c\u7d22\u3002 \u7f51\u7ad9<\/use><\/svg><\/span><\/a><\/td><\/tr>
@<\/code><\/td>
\u9009\u62e9\u4e00\u4e2a\u5143\u7d20\u7684\u7279\u5b9a\u5c5e\u6027<\/td><\/tr>
*<\/code><\/td>\u901a\u914d\u7b26\u7528\u4e8e\u9009\u62e9\u4efb\u4f55\u5143\u7d20<\/td><\/tr>
[ ]<\/code><\/td>\u627e\u5230\u4e00\u4e2a\u7279\u5b9a\u7684\u5143\u7d20<\/td><\/tr>
.<\/code><\/td>\u6307\u5b9a\u5f53\u524d\u5143\u7d20<\/td><\/tr>
..<\/code><\/td>\u6307\u5b9a\u7236\u5143\u7d20<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n
\n\n\n\n

XPath<\/strong> \u804c\u80fd<\/h2>\n\n\n\n
XPATH<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
\/\/h1<\/code><\/td>\u63d0\u53d6\u6240\u6709H1\u6807\u7b7e<\/td><\/tr>
\/\/h2[1]<\/code><\/td>\u63d0\u53d6\u7b2c\u4e00\u4e2aH2\u6807\u7b7e<\/td><\/tr>
\/\/h2[2]<\/code><\/td>\u63d0\u53d6\u7b2c\u4e8c\u4e2aH2\u6807\u7b7e<\/td><\/tr>
\/\/div\/p<\/code><\/td>\u63d0\u53d6\u4efb\u4f55 <p> \u5305\u542b\u5728\u4e00\u4e2a <div><\/td><\/tr>
\/\/div[@class='author']<\/code><\/td>\u63d0\u53d6\u4efb\u4f55 <div> \u4e0e\u7c7b "\u4f5c\u8005"<\/td><\/tr>
\/\/p[@class='content']<\/code><\/td>\u63d0\u53d6\u4efb\u4f55 <p> \u4e0e "\u5185\u5bb9 "\u7c7b<\/td><\/tr>
\/\/*[@class='content']<\/code><\/td>\u63d0\u53d6\u4efb\u4f55\u5177\u6709 \"content \"\u7c7b\u7684\u5143\u7d20<\/td><\/tr>
\/\/ul\/li[last()]<\/code><\/td>\u63d0\u53d6
    \u4e2d\u7684\u6700\u540e\u4e00\u4e2a
  • \u3002<\/td><\/tr>
\/\/ol[@class='cat']\/li[1]\u3002<\/code><\/td>\u63d0\u53d6\u7c7b\u4e3a \"cat \"\u7684
    \u4e2d\u7684\u7b2c\u4e00\u4e2a
  1. \u3002<\/td><\/tr>
count(\/\/h2)<\/code><\/td>\u8ba1\u7b97H2\u7684\u6570\u91cf\uff08\u8bbe\u7f6e\u63d0\u53d6\u8fc7\u6ee4\u5668\u4e3a \"\u51fd\u6570\u503c\"\uff09\u3002<\/td><\/tr>
\/a[\u5305\u542b(.,'\u4e86\u89e3\u66f4\u591a')]<\/code><\/td>\u63d0\u53d6\u4efb\u4f55\u542b\u6709 \"\u4e86\u89e3\u66f4\u591a \"\u951a\u6587\u672c\u7684\u94fe\u63a5<\/td><\/tr>
\/a[\u4ee5@title,'written by'\u5f00\u5934]<\/code><\/td>\u63d0\u53d6\u4efb\u4f55\u6807\u9898\u4ee5 \"\u64b0\u5199\u8005 \"\u5f00\u5934\u7684\u94fe\u63a5\u3002<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u5982\u4f55\u63d0\u53d6\u5e38\u89c1\u7684HTML\u5143\u7d20<\/h2>\n\n\n\n
XPATH<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
\/\/@href<\/code><\/td>\u63d0\u53d6\u6240\u6709\u94fe\u63a5<\/td><\/tr>
\/\/a[starts-with(@href,'mailto')]\/@href<\/code><\/td>\u63d0\u53d6\u4ee5 \"mailto:\"\uff08\u7535\u5b50\u90ae\u4ef6\u5730\u5740\uff09\u5f00\u5934\u7684\u94fe\u63a5\u3002<\/td><\/tr>
\/\/a[starts-with(@href,'tel')]\/@href<\/code><\/td>\u63d0\u53d6\u4ee5 \"tel:\"\uff08\u7535\u8bdd\u53f7\u7801\uff09\u5f00\u5934\u7684\u94fe\u63a5<\/td><\/tr>
\/\/img\/@src<\/code><\/td>\u63d0\u53d6\u6240\u6709\u56fe\u50cf\u6e90URL<\/td><\/tr>
\/\/img[\u5305\u542b(@class,'aligncenter')]\/@src<\/code><\/td>\u63d0\u53d6\u5305\u542b\u7c7b\u540d \"aligncenter \"\u7684\u56fe\u50cf\u7684\u6240\u6709\u56fe\u50cf\u6e90URL\u3002<\/td><\/tr>
\/\/link[@rel='alternate']<\/code><\/td>\u63d0\u53d6rel\u5c5e\u6027\u8bbe\u7f6e\u4e3a \"alternate \"\u7684\u5143\u7d20\u3002<\/td><\/tr>
\/\/@hreflang<\/code><\/td>\u63d0\u53d6\u6240\u6709hreflang\u503c<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u5143\u6807\u7b7e\uff08\u4f7f\u7528\u5185\u90e8HTML\u5143\u7d20\uff09<\/h2>\n\n\n\n
XPATH<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
\/\/meta[@property='article:published_time']\/@content<\/code><\/td>\u63d0\u53d6\u6587\u7ae0\u53d1\u5e03\u65e5\u671f\uff08WordPress\u7f51\u7ad9\u4e0a\u5e38\u89c1\u7684\u5143\u6807\u7b7e\uff09\u3002<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u5f00\u653e\u56fe\u8c31<\/h2>\n\n\n\n
XPATH<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
\/\/meta[@property='og:type']\/@content<\/code><\/td>\u63d0\u53d6Open Graph\u7c7b\u578b\u7684\u5bf9\u8c61<\/td><\/tr>
\/\/meta[@property='og:image']\/@content<\/code><\/td>\u63d0\u53d6Open Graph\u7279\u8272\u56fe\u7247\u7684URL<\/td><\/tr>
\/\/meta[@property='og:uped_time']\/@content<\/code><\/td>\u63d0\u53d6Open Graph\u7684\u66f4\u65b0\u65f6\u95f4<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6Twitter\u5361\u7247<\/h2>\n\n\n\n
XPATH<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
\/\/meta[@name='twitter:card']\/@content<\/code><\/td>\u63d0\u53d6Twitter\u5361\u7684\u7c7b\u578b<\/td><\/tr>
\/\/meta[@name='twitter:title']\/@content<\/code><\/td>\u63d0\u53d6Twitter\u5361\u7247\u7684\u6807\u9898<\/td><\/tr>
\/\/meta[@name='twitter:site']\/@content<\/code><\/td>\u63d0\u53d6Twitter\u5361\u7247\u7ad9\u70b9\u5bf9\u8c61\uff08Twitter\u624b\u67c4\uff09\u3002<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u6a21\u5f0f\u7c7b\u578b<\/h2>\n\n\n\n
XPATH<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
\/\/*[@itemtype]\/@itemtype<\/code><\/td>\u63d0\u53d6\u4e00\u4e2a\u9875\u9762\u4e0a\u6240\u6709\u7c7b\u578b\u7684\u6a21\u5f0f\u6807\u8bb0<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u9762\u5305\u5c51\u6a21\u5f0f<\/h2>\n\n\n\n

\u8fd9\u91cc\u662f\u4f60\u7528\u6765\u68c0\u67e5\u9762\u5305\u5c51\u7684\u81ea\u5b9a\u4e49\u63d0\u53d6\uff0c\u5728 \u5c16\u53eb\u7684\u9752\u86d9<\/a>.<\/p>\n\n\n\n

XPATH<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
\/\/*[\u5305\u542b(@itemtype,'BreadcrumbList')]\/*[@itemprop]\/a\/@href<\/code><\/td>\u63d0\u53d6\u6240\u6709\u9762\u5305\u5c51\u94fe\u63a5<\/td><\/tr>
\/\/*[\u5305\u542b\uff08@itemtype,'BreadcrumbList'\uff09]\/*[@itemprop][1]\/a\/@href<\/code><\/td>\u63d0\u53d6\u7b2c\u4e00\u4e2a\u9762\u5305\u5c51\u94fe\u63a5<\/td><\/tr>
\/\/*[\u5305\u542b\uff08@itemtype,'BreadcrumbList'\uff09]\/*[@itemprop]<\/code><\/td>\u63d0\u53d6\u9762\u5305\u5c51\u540d\u79f0\uff08\u8bbe\u7f6e\u63d0\u53d6\u8fc7\u6ee4\u5668\u4e3a \"\u63d0\u53d6\u6587\u672c\"\uff09\u3002<\/td><\/tr>
count(\/\/*[\u5305\u542b(@itemtype,'BreadcrumbList')]\/*[@itemprop])<\/code><\/td>\u8ba1\u7b97\u9762\u5305\u5c51\u5217\u8868\u9879\u76ee\u7684\u6570\u91cf\uff08\u8bbe\u7f6e\u63d0\u53d6\u8fc7\u6ee4\u5668\u4e3a \"\u529f\u80fd\u503c\"\uff09\u3002<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u4ea7\u54c1\u6a21\u5f0f<\/h2>\n\n\n\n
XPATH<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
\/\/*[@itemprop='name']\/@content<\/code><\/td>\u63d0\u53d6\u4ea7\u54c1\u540d\u79f0<\/td><\/tr>
\/\/*[@itemprop='description']\/@content<\/code><\/td>\u63d0\u53d6\u4ea7\u54c1\u63cf\u8ff0<\/td><\/tr>
\/\/*[@itemprop='price']\/@content<\/code><\/td>\u63d0\u53d6\u4ea7\u54c1\u4ef7\u683c<\/td><\/tr>
\/\/*[@itemprop='priceCurrency']\/@content<\/code><\/td>\u63d0\u53d6\u4ea7\u54c1\u8d27\u5e01<\/td><\/tr>
\/\/*[@itemprop='\u53ef\u7528\u6027']\/@href<\/code><\/td>\u63d0\u53d6\u4ea7\u54c1\u7684\u53ef\u7528\u6027<\/td><\/tr>
\/\/*[@itemprop='sku']\/@content<\/code><\/td>\u63d0\u53d6\u4ea7\u54c1SKU<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u5ba1\u67e5\u6a21\u5f0f<\/h2>\n\n\n\n
XPATH<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
\/\/*[@itemprop='reviewCount']<\/code><\/td>\u63d0\u53d6\u5ba1\u67e5\u6570<\/td><\/tr>
\/\/*[@itemprop='ratingValue']<\/code><\/td>\u63d0\u53d6\u8bc4\u7ea7\u503c<\/td><\/tr>
\/\/*[@itemprop='bestRating']<\/code><\/td>\u63d0\u53d6\u6700\u4f73\u8bc4\u8bba\u8bc4\u7ea7<\/td><\/tr>
\/\/*[@itemprop='\u56de\u987e']\/*[@itemprop='\u540d\u79f0']<\/code><\/td>\u63d0\u53d6\u5ba1\u67e5\u540d\u79f0<\/td><\/tr>
\/\/*[@itemprop='\u8bc4\u8bba']\/*[@itemprop='\u4f5c\u8005']<\/code><\/td>\u6458\u5f55\u8bc4\u8bba\u4f5c\u8005<\/td><\/tr>
\/\/*[@itemprop='review']\/*[@itemprop='datePublished']\/@content<\/code><\/td>\u63d0\u53d6\u8bc4\u8bba\u7684\u53d1\u5e03\u65e5\u671f<\/td><\/tr>
\/\/*[@itemprop='review']\/*[@itemprop='reviewBody']<\/code><\/td>\u63d0\u53d6\u8bc4\u8bba\u7684\u6b63\u6587\u5185\u5bb9<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u672c\u5730\u4f01\u4e1a\u548c\u7ec4\u7ec7\u6a21\u5f0f<\/h2>\n\n\n\n
XPATH<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
\/\/*[\u5305\u542b(@itemtype,'Organization')]\/*[@itemprop='name']<\/code><\/td>\u63d0\u53d6\u8be5\u7ec4\u7ec7\u7684\u540d\u79f0<\/td><\/tr>
\/\/*[@itemprop='\u5730\u5740']\/*[@itemprop='\u8857\u9053\u5730\u5740']<\/code><\/td>\u63d0\u53d6\u8857\u9053\u5730\u5740<\/td><\/tr>
\/\/*[@itemprop='address']\/*[@itemprop='addressLocality']<\/code><\/td>\u63d0\u53d6\u5730\u5740\u4f4d\u7f6e<\/td><\/tr>
\/\/*[@itemprop='\u5730\u5740']\/*[@itemprop='\u5730\u5740\u533a\u57df']<\/code><\/td>\u63d0\u53d6\u5730\u5740\u533a\u57df<\/td><\/tr>
\/\/*[@itemprop='\u7535\u8bdd']<\/code><\/td>\u63d0\u53d6 \u7535\u8bdd\u53f7\u7801<\/a><\/td><\/tr>
\/\/*[@itemprop='sameAs']\/@href<\/code><\/td>\u63d0\u53d6 \"\u540c\u4e3a \"\u94fe\u63a5<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u6587\u7ae0\u6a21\u5f0f<\/h2>\n\n\n\n
XPATH<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
\/\/*[\u5305\u542b(@itemtype,'Article')]\/*[@itemprop='headline']<\/code><\/td>\u63d0\u53d6\u6587\u7ae0\u7684\u6807\u9898<\/td><\/tr>
\/\/*[@itemprop='author']\/*[@itemprop='name']\/@content<\/code><\/td>\u63d0\u53d6\u4f5c\u8005\u59d3\u540d<\/td><\/tr>
\/\/*[@itemprop='\u51fa\u7248\u5546']\/*[@itemprop='\u59d3\u540d']\/@\u5185\u5bb9<\/code><\/td>\u63d0\u53d6\u51fa\u7248\u5546\u540d\u79f0<\/td><\/tr>
\/\/*[@itemprop='datePublished']\/@content<\/code><\/td>\u6458\u5f55\u51fa\u7248\u65e5\u671f<\/td><\/tr>
\/\/*[@itemprop='dateModified']\/@content<\/code><\/td>\u63d0\u53d6\u4fee\u6539\u65e5\u671f<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n
\n\n\n\n

\u81ea\u5b9a\u4e49\u6570\u636e\u63d0\u53d6\u4e0e Regex<\/strong><\/h2>\n\n\n\n

\u91ce\u751f\u52a8\u7269<\/h3>\n\n\n\n
SYNTAX<\/th>\u529f\u80fd\u4ecb\u7ecd<\/th><\/tr><\/thead>
.<\/code><\/td>\u5339\u914d\u4efb\u4f551\u4e2a\u5b57\u7b26<\/td><\/tr>
*<\/code><\/td>\u5339\u914d\u524d\u9762\u7684\u5b57\u7b260\u6b21\u6216\u66f4\u591a\u6b21<\/td><\/tr>
?<\/code><\/td>\u5339\u914d\u524d\u9762\u7684\u5b57\u7b260\u62161\u6b21<\/td><\/tr>
+<\/code><\/td>\u5339\u914d\u524d\u9762\u7684\u5b57\u7b261\u6b21\u6216\u66f4\u591a\u6b21<\/td><\/tr>
|<\/code><\/td>\u6216<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u951a\u70b9<\/h3>\n\n\n\n
SYNTAX<\/th>\u529f\u80fd\u4ecb\u7ecd<\/th><\/tr><\/thead>
^<\/code><\/td>\u5b57\u7b26\u4e32\u4ece\u540e\u7eed\u7684\u5b57\u7b26\u5f00\u59cb\u3002<\/td><\/tr>
$<\/code><\/td>\u8be5\u5b57\u7b26\u4e32\u4ee5\u524d\u9762\u7684\u5b57\u7b26\u7ed3\u675f\u3002<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u7fa4\u4f53<\/h3>\n\n\n\n
SYNTAX<\/th>\u529f\u80fd\u4ecb\u7ecd<\/th><\/tr><\/thead>
( )<\/code><\/td>\u6309\u7167\u51c6\u786e\u7684\u987a\u5e8f\u5339\u914d\u6240\u9644\u7684\u5b57\u7b26<\/td><\/tr>
[ ]<\/code><\/td>\u4ee5\u4efb\u4f55\u987a\u5e8f\u5339\u914d\u6240\u5305\u56f4\u7684\u5b57\u7b26<\/td><\/tr>
-<\/code><\/td>\u5339\u914d\u6307\u5b9a\u8303\u56f4\u5185\u7684\u4efb\u4f55\u5b57\u7b26<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u9003\u79bb<\/h3>\n\n\n\n
SYNTAX<\/th>\u529f\u80fd\u4ecb\u7ecd<\/th><\/tr><\/thead>
\\<\/code><\/td>\u6309\u5b57\u9762\u610f\u601d\u5904\u7406\u5b57\u7b26\uff0c\u800c\u4e0d\u662f\u4f5c\u4e3aregex\u3002<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

Regex\u81ea\u5b9a\u4e49\u6570\u636e\u63d0\u53d6<\/h2>\n\n\n\n
REGEX<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
[\"'](ua-.*?) [\"']<\/code><\/td>\u63d0\u53d6\u8c37\u6b4c\u5206\u6790\u7684\u8ddf\u8e2aID<\/td><\/tr>
[\"'](G-.*?)[\"']<\/code><\/td>\u63d0\u53d6\u8c37\u6b4c\u5206\u67904\uff08GA4\uff09\u7684\u8ddf\u8e2aID<\/td><\/tr>
[\"'](aw-.*?) [\"']<\/code><\/td>\u63d0\u53d6\u8c37\u6b4c\u5e7f\u544a\u8f6c\u6362ID\u548c\/\u6216\u518d\u8425\u9500\u6807\u7b7e<\/td><\/tr>
[\"'](gtm-.*?)[\"']<\/code><\/td>\u63d0\u53d6\u8c37\u6b4c\u6807\u7b7e\u7ba1\u7406\u5668\u548c\/\u6216\u8c37\u6b4c\u4f18\u5316\u7684ID<\/td><\/tr>
fbq\\([\"']init[\"'], [\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6Facebook Pixel ID<\/td><\/tr>
\\{ti:[\"'](.*?)[\"']}<\/code><\/td>\u63d0\u53d6Bing Ads\u7684UET\u6807\u7b7e<\/td><\/tr>
adroll_adv_id = [\"'](.*?) [\"']<\/code><\/td>\u63d0\u53d6AdRoll\u5e7f\u544a\u5546ID<\/td><\/tr>
adroll_pix_id = [\"'](.*?) [\"']<\/code><\/td>\u63d0\u53d6AdRoll Pixel ID<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u6240\u6709\u6a21\u5f0f\u6807\u8bb0\u548c\u6a21\u5f0f\u7c7b\u578b<\/h2>\n\n\n\n
REGEX<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
[\"']application\/ld\/+json[\"']>(.*?)\/script><\/code><\/td>\u63d0\u53d6\u6240\u6709\u7684JSON-LD\u6a21\u5f0f\u6807\u8bb0<\/td><\/tr>
[\"']@type[\"']\u3002*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u4e00\u4e2a\u9875\u9762\u4e0a\u6240\u6709\u7c7b\u578b\u7684JSON-LD\u6a21\u5f0f\u6807\u8bb0<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u9762\u5305\u5c51\u6a21\u5f0f<\/h3>\n\n\n\n
REGEX<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
[\"']\u9879\u76ee[\"']\u3002*{[\"']@id[\"']\u3002*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u9762\u5305\u5c51\u94fe\u63a5<\/td><\/tr>
[\"']\u9879\u76ee[\"']\u3002*{[\"']@id[\"']\u3002*[\"'].*?[\"'], *[\"']name[\"']\u3002*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u9762\u5305\u5c51\u540d\u79f0<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u4ea7\u54c1\u6a21\u5f0f<\/h3>\n\n\n\n
REGEX<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
[\"']@type[\"']\u3002*[\"']Product[\"'].*?[\"']name[\"']:*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u4ea7\u54c1\u540d\u79f0<\/td><\/tr>
[\"']@type[\"']\u3002*[\"']Product[\"'].*?[\"']description[\"']:*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u4ea7\u54c1\u63cf\u8ff0<\/td><\/tr>
[\"']@type[\"']\u3002*[\"']Product[\"'].*?[\"']price[\"']:*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u4ea7\u54c1\u4ef7\u683c<\/td><\/tr>
[\"']@type[\"']\u3002*[\"']Product[\"'].*?[\"']priceCurrency[\"']:*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u4ea7\u54c1\u8d27\u5e01<\/td><\/tr>
[\"']@type[\"']\u3002*[\"']Product[\"'].*?[\"']availability[\"']:*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u4ea7\u54c1\u7684\u53ef\u7528\u6027<\/td><\/tr>
[\"']@type[\"']\u3002*[\"']Product[\"'].*?[\"']sku[\"']:*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u4ea7\u54c1SKU<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u5ba1\u67e5\u6a21\u5f0f<\/h3>\n\n\n\n
REGEX<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
[\"']reviewCount[\"']\u3002*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u5ba1\u67e5\u6570<\/td><\/tr>
[\"']ratingValue[\"']\u3002*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u8bc4\u7ea7\u503c<\/td><\/tr>
[\"']bestRating[\"']\u3002*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u6700\u4f73\u8bc4\u7ea7<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u672c\u5730\u4f01\u4e1a\u548c\u7ec4\u7ec7\u6a21\u5f0f<\/h3>\n\n\n\n
REGEX<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
[\"']@\u7c7b\u578b[\"']\u3002*[\"']Organization[\"'].*?[\"']name[\"']:*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u7ec4\u7ec7\u540d\u79f0<\/td><\/tr>
[\"']streetAddress[\"']\u3002*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u8857\u9053\u5730\u5740<\/td><\/tr>
[\"']addressLocality[\"']\u3002*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u5730\u5740\u4f4d\u7f6e<\/td><\/tr>
[\"']addressRegion[\"']\u3002*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u5730\u5740\u533a\u57df<\/td><\/tr>
[\"']\u7535\u8bdd[\"']\u3002*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u7535\u8bdd\u53f7\u7801<\/td><\/tr>
[\"']sameAs[\"']\u3002*\\[(.*?)\\]<\/code><\/td>\u63d0\u53d6 \"\u540c\u4e3a \"\u94fe\u63a5<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u63d0\u53d6\u6587\u7ae0\u6216BlogPosting\u6a21\u5f0f<\/h3>\n\n\n\n
REGEX<\/th>\u8f93\u51fa<\/th><\/tr><\/thead>
[\"']\u5934\u6761[\"']\u3002*[\"'](.*?)[\"']<\/code><\/td>\u6458\u5f55\u6587\u7ae0\u6807\u9898<\/td><\/tr>
[\"']author[\"'].*?[\"']name[\"']:*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u4f5c\u8005\u59d3\u540d<\/td><\/tr>
[\"']publisher[\"'].*?[\"']name[\"']:*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u51fa\u7248\u5546\u540d\u79f0<\/td><\/tr>
[\"']datePublished[\"']\u3002*[\"'](.*?)[\"']<\/code><\/td>\u6458\u5f55\u51fa\u7248\u65e5\u671f<\/td><\/tr>
[\"']dateModified[\"']\u3002*[\"'](.*?)[\"']<\/code><\/td>\u63d0\u53d6\u4fee\u6539\u65e5\u671f<\/td><\/tr><\/tbody>
<\/td><\/td><\/tr><\/tfoot><\/table><\/figure>\n\n\n\n

\u8fd9\u79cd\u53ef\u80fd\u6027\u662f\u65e0\u7a77\u65e0\u5c3d\u7684\uff1b\u5982\u679c\u4f60\u60f3\u5728\u8fd9\u4e2a\u5217\u8868\u4e2d\u52a0\u5165\u4efb\u4f55\u63d0\u53d6\u7269\uff0c\u8bf7\u8ba9\u6211\u77e5\u9053\u3002<\/p>\n\n\n