HTML parse unit v1.0
Delphi 3.0, 4.0, 5.0

Register Now ( $19) and you get full source code of HTML Parser v2.0



This unit allows you to parse HTML code, extract HTML elements and NAME=Value pairs. Unit contains THtmlParser class and 3 functions

THtmlParser = class( TPersistent)
Properties
BaseURL need to expand HTML reference values


Public procedures
Clear clears buffer
LoadFromStream load buffer from stream
Tags Get HTML tags list, adds to List tags with NAME=VALUE elements
Parameters Add to List NAME=VALUE pairs
SelectedParameters Add to ParList NAME=VALUE pairs where NAME=NameStr,  if Pairs is false then adds only VALUE string
HRefParameters Expand HTML references using BaseUrl
Unit functions
UrlType  Return specific URL type 
TUrlType = ( utUnknow, utHtmlLink, utText, utImage, utOther);
SplitURL Split URL string in to it components: Protocol, Domain, Port, Directory, Document
ExpandURL Expands URL depends on BaseURL


Demo program shows you THtmlParser and public procedures usage
 
Press retrieve button and load source HTML code into memo
Extract tags from HTML source with NAME=VALUE elements
Study NAME=VALUE elements
Is there references to other documents?
Unit functions


Components, Programs, units  Jump center