package.html
1 <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN"> 2 <html> 3 <head> 4 <!-- 5 HTMLParser Library $Name: v1_6_20060319 $ - A java-based parser for HTML 6 http://sourceforge.org/projects/htmlparser 7 Copyright (C) 2004 Somik Raha 8 9 Revision Control Information 10 11 $Source: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications/package.html,v $ 12 $Author: derrickoswald $ 13 $Date: 2005/04/12 11:27:42 $ 14 $Revision: 1.20 $ 15 16 This library is free software; you can redistribute it and/or 17 modify it under the terms of the GNU Lesser General Public 18 License as published by the Free Software Foundation; either 19 version 2.1 of the License, or (at your option) any later version. 20 21 This library is distributed in the hope that it will be useful, 22 but WITHOUT ANY WARRANTY; without even the implied warranty of 23 MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU 24 Lesser General Public License for more details. 25 26 You should have received a copy of the GNU Lesser General Public 27 License along with this library; if not, write to the Free Software 28 Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA 29 --> 30 </head> 31 <body bgcolor="white"> 32 Example applications. 33 <p> 34 <table width="94%" border="0"> 35 <tr> 36 <td valign="top"> 37 <strong>Link Extractor</strong><br> 38 </td> 39 <td> 40 <i>Extract links/mail addresses from a web page.</i><br> 41 42 <a href="javadoc/org/htmlparser/parserapplications/LinkExtractor.html" target="_parent">org.htmlparser.parserapplications.LinkExtractor</a> 43 <pre> 44 <code>bin/linkextractor http://website_url [-maillinks]</code> 45 the optional -maillinks argument causes mailto: links to be printed 46 </pre> 47 </td> 48 </tr> 49 <tr> 50 <td valign="top"> 51 52 <strong>String Extractor</strong><br> 53 </td> 54 <td> 55 <i>Extract text from a web page.</i><br> 56 <a href="javadoc/org/htmlparser/parserapplications/StringExtractor.html" target="_parent">org.htmlparser.parserapplications.StringExtractor</a> 57 <pre> 58 <code>bin/stringextractor http://website_url [-links]</code> 59 60 the optional -links argument causes hyperlinks to be shown within the text 61 </pre> 62 </td> 63 </tr> 64 <tr> 65 <td valign="top"> 66 <strong>Site Capturer</strong><br> 67 </td> 68 <td> 69 70 <i>Save a web site locally.</i><br> 71 <a href="javadoc/org/htmlparser/parserapplications/SiteCapturer.html" target="_parent">org.htmlparser.parserapplications.SiteCapturer</a> 72 <pre> 73 <code>bin/sitecapturer http://source_website /target_directory/ [true|false]</code> 74 75 the optional boolean argument determines whether resources such as images, 76 audio and video are to be captured 77 </pre> 78 </td> 79 </tr> 80 <tr> 81 <td valign="top"> 82 <strong>Wiki Capturer</strong><br> 83 </td> 84 <td> 85 86 <i>Save a wiki locally.</i><br> 87 <a href="javadoc/org/htmlparser/parserapplications/WikiCapturer.html" target="_parent">org.htmlparser.parserapplications.WikiCapturer</a> 88 Subclass of SiteCapturer (see above) that eliminates specific Wiki pages. 89 </td> 90 </tr> 91 </table> 92 <!-- Put @see and @since tags down here. --> 93 94 </body> 95 </html>