package.html
 1  <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
 2  <html>
 3  <head>
 4  <!--
 5   HTMLParser Library $Name: v1_6_20060319 $ - A java-based parser for HTML
 6   http://sourceforge.org/projects/htmlparser
 7   Copyright (C) 2004 Somik Raha
 8  
 9   Revision Control Information
10  
11   $Source: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications/package.html,v $
12   $Author: derrickoswald $
13   $Date: 2005/04/12 11:27:42 $
14   $Revision: 1.20 $
15  
16   This library is free software; you can redistribute it and/or
17   modify it under the terms of the GNU Lesser General Public
18   License as published by the Free Software Foundation; either
19   version 2.1 of the License, or (at your option) any later version.
20  
21   This library is distributed in the hope that it will be useful,
22   but WITHOUT ANY WARRANTY; without even the implied warranty of
23   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU
24   Lesser General Public License for more details.
25  
26   You should have received a copy of the GNU Lesser General Public
27   License along with this library; if not, write to the Free Software
28   Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
29  -->
30  </head>
31  <body bgcolor="white">
32  Example applications.
33  <p>
34  <table width="94%" border="0">
35    <tr> 
36      <td valign="top">
37      <strong>Link Extractor</strong><br>
38      </td>
39      <td>
40      <i>Extract links/mail addresses from a web page.</i><br>
41  
42      <a href="javadoc/org/htmlparser/parserapplications/LinkExtractor.html" target="_parent">org.htmlparser.parserapplications.LinkExtractor</a>
43      <pre>
44      <code>bin/linkextractor http://website_url [-maillinks]</code>
45      the optional -maillinks argument causes mailto: links to be printed
46      </pre>
47      </td>
48    </tr>
49    <tr> 
50      <td valign="top">
51  
52      <strong>String Extractor</strong><br>
53      </td>
54      <td>
55      <i>Extract text from a web page.</i><br>
56      <a href="javadoc/org/htmlparser/parserapplications/StringExtractor.html" target="_parent">org.htmlparser.parserapplications.StringExtractor</a>
57      <pre>
58      <code>bin/stringextractor http://website_url [-links]</code>
59  
60      the optional -links argument causes hyperlinks to be shown within the text
61      </pre>
62      </td>
63    </tr>
64    <tr> 
65      <td valign="top">
66      <strong>Site Capturer</strong><br>
67      </td>
68      <td>
69  
70      <i>Save a web site locally.</i><br>
71      <a href="javadoc/org/htmlparser/parserapplications/SiteCapturer.html" target="_parent">org.htmlparser.parserapplications.SiteCapturer</a>
72      <pre>
73      <code>bin/sitecapturer http://source_website /target_directory/ [true|false]</code>
74  
75      the optional boolean argument determines whether resources such as images,
76      audio and video are to be captured
77      </pre>
78      </td>
79    </tr>
80    <tr> 
81      <td valign="top">
82      <strong>Wiki Capturer</strong><br>
83      </td>
84      <td>
85  
86      <i>Save a wiki locally.</i><br>
87      <a href="javadoc/org/htmlparser/parserapplications/WikiCapturer.html" target="_parent">org.htmlparser.parserapplications.WikiCapturer</a>
88      Subclass of SiteCapturer (see above) that eliminates specific Wiki pages.
89      </td>
90    </tr>
91  </table>
92  <!-- Put @see and @since tags down here. -->
93  
94  </body>
95  </html>