Java HTML Parsing Primer Tutorial

Java HTML Parsing Primer Tutorial

Java HTML Parsing Primer Tutorial

Here is a tutorial that parses HTML in a Java console application overseen by the Eclipse IDE. For this tutorial, where we parse an HTML url via a parent HTML element of a designated HTML element type and class, and then drill down to display any text elements within that parent HTML element, we needed to get from the brilliant SourceForge, and all its brilliant Open Source contributors, an external jar Java archive file for the HTML parsing HtmlCleaner library. During the tutorial you’ll see one of the advantages of IDEs for Java work … making the use of external jar files easier to organize (than with command line javac/java usage).

You may be interested by the use of command line argument checking code, and the associated Run Configuration settings within the Eclipse Run->Run Configurations… to make use of this possibility, and if it is not used, the interactive Scanner methods to accept interactive input from a Java user.

Link to Java programming source code which you should rename to AnalyzeHtml.java

Really really really weally (yes weally) really really hope you enjoy the tutorial. By the way, a strange thing happened while making this tutorial … lunar coding.

Did you know?

Tim Berners-Lee invented HTML.

If this was interesting you may be interested in this too.

This entry was posted in Eclipse, eLearning, Tutorials and tagged , , , , , , . Bookmark the permalink.

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>