ARTICLE

Working with XML

Posted by Dinesh Beniwal Articles | XML in VB.NET November 15, 2009
This article begins with basic definitions of Hypertext Markup Language (HTML), XML, and other Web-related technologies.
 
Reader Level:

THE PROGRAMMING WORLD IS moving more and more toward the Web, and Extensible Markup Language (XML) is an essential part of Web-based programming.

This article begins with basic definitions of Hypertext Markup Language (HTML), XML, and other Web-related technologies in coming articles. Then you'll take a look at the .NET Framework Library namespaces and classes that provide XML functionality in the .NET Framework.

I'll explain how to read, write and navigate XML documents using XML and Document Object Model (DOM) .NET classes. I'll also discuss XML transformations. This article also covers the
relationship between ADO.NET and XML and shows how to mix them up and use rich ADO.NET database components to display and manipulate XML data. At the end of this article I'll cover the XPathNavigator class, which you can use to navigate through XML documents.

Defining XML - Related Terminology

The ADO.NET and XML.NET Framework Application Programming Interface (API) combination provides a unified way to work with XML in the Microsoft .NET Framework. There are two ways to represent data using XML: in a tagged-text format metalanguage similar to HTML and in a relational table format. You use ADO .NET to access relational table formats. You would use DOM to access the text format.

Before talking about the role of XML in the .NET Framework and how to work with it, it's important you understand the basic building blocks of XML and its related terminology. You'll learn the basic definitions of Standard Generalized Markup Language (SGML) and HTML in the following sections. If you're already familiar with these languages, you can skip to the "XML Overview" section.

Standard Generalized markup Language (SGML)

In 1986, Standard Generalized Markup Language (SGML) because the international standards for representing electronic documents in a unified way. SGML provides a standard format for designing your own markup schemes. Markup is a way to represent some information about data.

Later Hypertext Markup Language (HTML) became the international standard for representing documents on the Web in a unified way.

Hyper text Markup Language (HTML)

The HTML file format is text format that contains, rather heavily. Markup tags. A tag is a section of a program that starts with < and ends with > such as <name>. (An element consists of a pair of tags, starting with <name> and ending with </name>). The language defines all of the markup tags. All browsers support HTML tags, which tell a browser how to display the text of an HTML document. You can create an HTML file using a simple text editor such as Notepad. After typing text in a text editor, you save the file with an.htm or .html extension.

Note: An HTML document is also called HTML pages or HTML file.

Listing 6-1 shows an example of an HTML file, type the following in a text editor, and save it myfile.htm.

Listing 6-1. A simple HTML file


<
html>
<
head>
    <title>A Test HTML Page </title>

</
head>
<
body>
    Here is the body part.

</
body>
</
html>

If you view this field in a browser, you'll see the text Here is the body part. In Listing 6-1, your HTML file starts with the <html> tag and ends with the </html> tag. The <html> tag tells a browser that this is the starting point of an HTML document. The </html> tag tells a browser that this is the ending point of an HTML documents. These tags are required in all HTML documents. The <head> tag is header information of a document and is not displayed in the browser. The <body> and</body> tags, which are required, makeup the main content of a document. As you can see, all tags ends with a</> tag.

Note: HTML tags are not case sensitive. However, the World Wide Web Consortium (W3C) recommends using lowercase tags in HTML4. The next generation of HTML, XHTML, doesn't support uppercase tags. (The W3C promotes the web worldwide and makes it more it more useful. You can find more information on the W3C at http://www.w3c.org.)

Tags can have attributes, which provide additional information about the tags. Attributes are part of the starting tag. For example:

<table border ="0">

In this example the <table> tag has an attribute border and its value is 0. This value applies to the entire <table> tag, ending with the </table> tag. Table 6-1 describes some common HTML tags.

Table 6-1: Common HTML Tags

TAG

DESCRIPTION

<html>

Indicates start and end of an HTML document

<title>

Contains the title of the page

<body>

Contains the main content, or body, of the page

<h1...h6>

Creates headings (from level 1 to 6)

<p>

Starts a new paragraph

<br>

Insert a single line break

<hr>

Defines a horizontal rule

<!-->

Defines a comment tag in a document

<b>

Defines bold text

<i>

Defines italic text

<strong>

Defines strong text

<table>

Defines a table

<tr>

Defines a row of a table

<td>

Defines a cell of a table row

<font>

Defines a font name and size


There are comes tags beyond those described in table 6-1. In fact the W3C's HTML 4 specification is quite extensive. However, discussing all of the HTML tags is beyond the scope of this article. Before moving to the next topic, you'll take a look at one more HTML example using the tags discussed in the table. Listing 6-2 shows you another HTML document example.

Listing 6-2: HTML tag their usage


<
html>
<
head>
    <title>A Test HTML Page</title>

</
head>
<!- -
This is a comment - ->
<
body>
    <h1>
        Heading 1</h1>
    <h2>
        Heading 2</h2>
    <p>
        <b><i><font size="4">Bold and Italic Text. </font></i></b>
    </p>
    <table border="1" width="43%">
        <tr>
            <td width="50%">
                Row1, Column1
            </td>
            <td width="50%">
                Row1, column2
            </td>
        </tr>
        <tr>
            <td width="50%">
                Row2, Column1
            </td>
            <td width="50%">
                Row2, Column2
            </td>
        </tr>
    </table>

</
body>
</
html>

Note:
In Listing 6-2, the <font> and <td> tags contain size and width attributes, respectively. The size attribute tells the browser to display the size of the font, which is 4 in this example, and the width attribute tells the browser to display the table cell as 50 percent of the browser window.

Login to add your contents and source code to this article
share this article :
post comment
 
Team Foundation Server Hosting
Become a Sponsor
PREMIUM SPONSORS
  • ceTE software specializes in components for dynamic PDF generation and manipulation. The DynamicPDF™ product line allows you to dynamically generate PDF documents, merge PDF documents and new content to existing PDF documents from within your applications.
    Finally – a virtual platform that delivers next-generation Windows Server 2008 Hyper-V virtualization technology from a managed hosting partner you can truly depend on. Visit www.maximumasp.com/max for a FREE 30 day trial. Hurry offer ends soon. Climb aboard the MaxV platform and take advantage of High Availability, Intelligent Monitoring, Recurrent Backups, and Scalability – with no hassle or hidden fees. As a managed hosting partner focused solely on Microsoft technologies since 2000, MaximumASP is uniquely qualified to provide the superior support that our business is built on. Unparalleled expertise with Microsoft technologies lead to working directly with Microsoft as first to offer IIS 7 and SQL 2008 betas in a hosted environment; partnering in the Go Live Program for Hyper-V; and product co-launches built on WS 2008 with Hyper-V technology.
6 Months Free & No Setup Fees ASP.NET Hosting!
Become a Sponsor