HTM2TXT v 1.0, Aug.03,1996 by Otto Rder

Description:

   HTM2TXT.CMD is a REXX script which extracts HTML-tags from
   .HTML-files used in Internet www-communication and stores
   the remaining text into an ASCII-file.

Group:

   HTM2TXT belongs to group: os2 - textutil

Freeware:

   HTM2TXT may be distributed freely under the following conditions.
   Copyright notices must NOT be removed, all files contained in the file
   inventory below must be distributed together (you may not remove any
   files), and you may not charge for the program.

   If you find the program useful then send a post-card (picture of
   the location where you live) to:

       Otto Rder
       Hauptstrasse 61B/13
       A3001 Mauerbach
       ---------------
       Austria

Prerequisites:

   HTM2TXT requires OS/2 and REXX.

   It has been developed and tested under OS/2 Warp,
   there is no intention to move it to other platforms.

Distribution:

   The following files are contained in HTM2TXT1.ZIP:

    HTM2TXT.CMD       96-08-03 the REXX command
    HTM2TXT.ICO       96-07-10 an icon file contributed by  Gerard Pinkas, pinkas@en.com
    MAKEOBJ.CMD       96-07-12 a command to create a desktop program object
    README.TXT                 documentation, this file
    FILE_ID.DIZ                Id-file

Installation:

   To install HTM2TXT just UNZIP the HTM2TXT1.ZIP file and place the
   command into a directory contained in your CONFIG.SYS PATH= statement.

   You may use the MAKEOBJ.CMD to create a desktop oject for HTM2TXT.CMD.
   You should run MAKEOBJ.CMD from that directory where HTM2TXT.CMD and
   HTM2TXT.ICO are installed.

Usage:

     From an OS/2 command line start HTM2TXT:

       htm2txt filename.htm

     Make sure filename.htm is in the current directory.

   or

     Drag and drop a .HTML object to the HTM2TXT object if you
     have created one with makeobj.cmd.

   HTM2TXT will create an output file 'filename.txt' and it will
   start an editor to view this file.

   Note: HTM2TXT will follow <a href="...> tags and tries
         to resolve the given link-address. If it can be
         accessed it will be included in the .txt file.

   Following statements may be changed to customize result:

     line 32: linemax=72    maximum line length in output file.
                            Any text longer will be split to
                            output lines not longer than 'linemax'

     line 33: pixlbyt=6     when <td width="nnnPIX"> then the
                            column width in tables is determined
                            by: nnn/pixlbyt.

     line 34: editor='e'    the name of an ASCII editor to display
                            result file. It may be changed to the
                            installations favoured editor.

                            editor='' causes no editor to be called

     line 45: consts=       this is a table of variables to substitute
                            special characters. I could not find an
                            official table in any web-site, so this
                            table is a collection of variables i have
                            seen up to today.

   You may temporarily overwrite these parameters by adding options
   when starting htm2txt from an os/2 command line:

     htm2txt filename.html l 80    to set linemax to 80 characters
     htm2txt filename.html p  8    to set pixlbyt to  8 pixels/char
     htm2txt filename.html e te    to set editor  to tinyedit

   These options may appear in any order after the filename:

     htm2txt filename.html e te p 8 l 80

Warranty:

   The program is distributed on an as-is basis.
   It tries to extract as much text as possible,
   however, i am sure, there are some special forms
   of tags which i missed.
   Normally such tags are simply ignored.

   There is no guarantee to get certain results
   nor is any guarantee to avoid damages of existing files.

   Note: In the current directory the program will
         overwrite any file with filename of
         input-file and a file extension of .TXT, eg.: filename.TXT !

Comments:

   Comments and recommendations pls to:

         oraeder@ibm.net
