nutch中的 nutch-site.xml 配置

前端之家收集整理的这篇文章主要介绍了nutch中的 nutch-site.xml 配置前端之家小编觉得挺不错的,现在分享给大家,也给大家做个参考。
  1. <?xmlversion="1.0"?>
  2. <?xml-stylesheettype="text/xsl"href="configuration.xsl"?>
  3.  
  4. <!--Putsite-specificpropertyoverridesinthisfile.-->
  5.  
  6. <configuration>
  7.  
  8. <property>
  9. <name>plugin.folders</name>
  10. <value>./src/plugin</value>
  11. <description>Directorieswherenutchpluginsarelocated.Each
  12. elementmaybearelativeorabsolutepath.Ifabsolute,itisused
  13. asis.Ifrelative,itissearchedforontheclasspath.</description>
  14. </property>
  15. <property>
  16. <name>http.agent.name</name>
  17. <value>YourNutchSpider</value>
  18. </property>
  19. <property>
  20. <name>http.accept.language</name>
  21. <value>ja-jp,en-us,en-gb,en;q=0.7,*;q=0.3</value>
  22. <description>ValueoftheAccept-Languagerequestheaderfield.
  23. Thisallowsselectingnon-Englishlanguageasdefaultonetoretrieve.
  24. Itisausefulsettingforsearchenginesbuildforcertainnationalgroup.</description>
  25. </property>
  26. <property>
  27. <name>storage.data.store.class</name>
  28. <value>org.apache.gora.sql.store.sqlStore</value>
  29. <description>TheGoraDataStoreclassforstoringandretrievingdata.
  30. Currentlythefollowingstoresareavailable:.</description>
  31. </property>
  32. <property>
  33. <name>parser.character.encoding.default</name>
  34. <value>utf-8</value>
  35. <description>Thecharacterencodingtofallbacktowhennootherinformation
  36. isavailable</description>
  37. </property>
  38. <property>
  39. <name>generate.batch.id</name>
  40. <value>*</value>
  41. </property>
  42. </configuration>

猜你在找的XML相关文章