Skip to main content
Glossary Term

URL

History - Uniform Resource Locators (URLs) were defined in RFC1738 in 1994 by Tim Berners-Lee and the URI working group of the Internet Engineering Task Force (IETF). - Early collaborators proposed the use of Universal Document Identifiers (UDIs) before settling on URLs. - The term 'universal' was originally preferred over 'uniform' in the expansion of URL, but it was later changed. - Berners-Lee expressed regret at using dots to separate parts of the domain name within URIs and wished he had used slashes throughout. Syntax - Every HTTP URL conforms to the syntax of a generic URI. - The URI generic syntax consists of five components: scheme, authority, path, query, and fragment. - URI schemes should be registered with the Internet Assigned Numbers Authority (IANA). - The authority component can include userinfo, host, and port subcomponents. - The path component consists of path segments separated by a slash and can resemble a file system path. Usage - URLs are commonly used to reference web pages (HTTP/HTTPS). - URLs are also used for file transfer (FTP), email (mailto), database access (JDBC), and other applications. - Most web browsers display the URL of a web page in an address bar. - A typical URL includes a protocol, hostname, and file name. - In http and https URIs, the last part of the path is often named pathinfo and is used to select dynamic content. Internationalized URL - An Internationalized Resource Identifier (IRI) is a form of URL that includes Unicode characters. - The domain name in an IRI is known as an Internationalized Domain Name (IDN). - Web and Internet software automatically convert IDNs into punycode usable by the Domain Name System. - The URL path name can be specified in the user's local writing system and is converted to UTF-8. - Characters not part of the basic URL character set are escaped using percent-encoding. Related Standards - URL is a specific type of Uniform Resource Identifier (URI). - Other related standards include URI and URN. - The World Wide Web relies on the use of URLs to locate resources. - The URL specification is licensed under CC BY 4.0. - Organizations such as whatwg.org are involved in maintaining and developing URL standards.