INTERNET-DRAFT                                             Eric A. Hall 
  Document: draft-hall-dns-data-00.txt                           May 2003 
  Expires: December, 2003                                                 
  Category: Informational                                                 
      
                   Considerations for DNS Resource Records 
      
      
     Status of this Memo 
      
     This document is an Internet-Draft and is in full conformance with 
     all provisions of Section 10 of RFC 2026. 
      
     Internet-Drafts are working documents of the Internet Engineering 
     Task Force (IETF), its areas, and its working groups. Note that 
     other groups may also distribute working documents as Internet-
     Drafts. 
      
     Internet-Drafts are draft documents valid for a maximum of six 
     months and may be updated, replaced, or obsoleted by other 
     documents at any time. It is inappropriate to use Internet-Drafts 
     as reference material or to cite them other than as "work in 
     progress." 
      
     The list of current Internet-Drafts can be accessed at 
     http://www.ietf.org/ietf/1id-abstracts.txt 
      
     The list of Internet-Draft Shadow Directories can be accessed at 
     http://www.ietf.org/shadow.html. 
      
      
     Copyright Notice 
      
     Copyright (C) The Internet Society (2003).  All Rights Reserved. 
      
      
     Abstract 
      
     This document discusses some common issues which should be taken 
     into consideration whenever any new service proposes to extend the 
     Domain Name Service. 
      
   
   
  Internet Draft        draft-hall-dns-data-00.txt            May 2003 
   
   
      
     Table of Contents 
      
     1.   Introduction...............................................2 
     2.   Prerequisites and Terminology..............................3 
     3.   DNS Architectural Principles...............................3 
       3.1.  Resource Records........................................3 
       3.2.  Hierarchical Partitioning...............................4 
       3.3.  Minimalist Messages.....................................4 
       3.4.  Built-In Record Caching.................................5 
     4.   Inherent Design Limitations................................5 
       4.1.  Domain Name Length......................................5 
       4.2.  Ambiguity...............................................5 
       4.3.  Incomplete Answer Sets..................................6 
       4.4.  Lookups Only............................................6 
       4.5.  UDP and TCP Restriction.................................7 
       4.6.  Compression.............................................7 
       4.7.  Cache Overflow..........................................8 
       4.8.  Cache Lag...............................................8 
       4.9.  World-Readable Data.....................................9 
     5.   Design Conclusion.........................................10 
     6.   Going Standards-Track.....................................10 
     7.   Security Considerations...................................11 
     8.   IANA Considerations.......................................11 
     9.   Author's Address..........................................11 
     10.  Normative References......................................11 
     11.  Acknowledgments...........................................11 
     12.  Full Copyright Statement..................................11 
      
  1.      Introduction 
      
     In terms of deployment, the Domain Name System (DNS) [STD13] is an 
     extremely successful network service, having perhaps the widest 
     installed base and usage of any Internet service. Unfortunately, 
     this omnipresence makes DNS a favorite target for well-intentioned 
     but often-misguided efforts to extend the service into roles it is 
     unsuited for, particularly due to its specialized nature. This 
     document attempts to itemize the issues which prevent this 
     expansion so that future developers and planners can be made aware 
     of the limitations early in the development cycles. 
      
     Note that this document does not define any formal rules or 
     restrictions of any kind. Instead, the sole purpose of this 
     document is to itemize the common reasons why various extension 
     efforts have been rejected by the DNS community in the past, and 
     why other efforts may be rejected in the future. It is entirely 
   
  Hall                  I-D Expires: December 2003             [page 2] 
  Internet Draft        draft-hall-dns-data-00.txt            May 2003 
   
   
     possible for a usage model to be embraced by the DNS community 
     even though all of the principles listed within this document are 
     violated (although it is extremely unlikely), and as such, this 
     document should not be considered as a governing device of any 
     kind. Instead, this document should only be viewed as a planning 
     aid for developers and planners to use when considering the 
     creation of new uses for the DNS. 
      
  2.      Prerequisites and Terminology 
      
     Readers of this document are expected to be familiar with the 
     following specifications: 
      
          [RFC1034]     Mockapetris, P. "Domain names - concepts and 
                         facilities", STD 13, RFC 1034, November 1987. 
      
          [RFC1035]     Mockapetris, P. "Domain names - implementation 
                         and specification", STD 13, RFC 1035, November 
                         1987. 
      
          [RFC1123]     Braden, R. "Requirements for Internet Hosts - 
                         Application and Support", STD 3, RFC 1123, 
                         October 1989. 
      
          [RFC2181]     Elz, R., and Bush, R. "Clarifications to the 
                         DNS Specification", RFC 2181, July 1997. 
      
     The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL 
     NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" 
     in this document are to be interpreted as described in RFC 2119. 
      
      
  3.      DNS Architectural Principles 
      
     The current collection of DNS specifications define a lightweight 
     lookup service which provides anonymous access to structured 
     information about named entries from distributed database 
     partitions ("zones"). The service is specifically optimized for 
     "lookup by name" datagram transactions, distributed caches of 
     previous lookup answer sets, and non-authenticated access. 
      
  3.1.    Resource Records 
      
     All data stored in DNS uses a common record format, consisting of 
     six common fields (although one of these fields is a generic 
     "data" field which varies in size and shape according to the type 
     of data being provided). Four of these fields ("domain name", 
   
  Hall                  I-D Expires: December 2003             [page 3] 
  Internet Draft        draft-hall-dns-data-00.txt            May 2003 
   
   
     "type", "class" and "data") provide attributes which collectively 
     form a unique identifier for a piece of data. Any three of these 
     four fields may be identical across multiple resource records; for 
     example, multiple resource records may exist with the same domain 
     name, type and class, but they must have different data values in 
     order to represent unique records within the global DNS. 
      
     For the purposes of this document, the most important of these 
     fields is the domain name field, which provides a non-unique 
     identifier for every record in the database. All queries must 
     explicitly identify the domain name of the entry they are looking 
     for, and may optionally specify the desired type and/or class 
     values. If a query results in multiple matches, then all of the 
     matching records must be returned. 
      
  3.2.    Hierarchical Partitioning 
      
     From a high-level perspective, the DNS database is distributed 
     across multiple partitions called "zones", each of which have 
     ownership for a specific subset of domain names. Zones are linked 
     in a hierarchical tree, with the top-level zones having zones 
     directly beneath them, and with some of those having additional 
     subordinate zones, and so forth. Although the zones are structured 
     in a hierarchical tree, each zone acts as an independent entity, 
     and is only concerned with the records that it controls directly. 
      
     The hierarchical partitioning structure is traversed whenever the 
     DNS protocol needs to locate the zone which is authoritative for a 
     named resource record. When a resolver asks for the resource 
     records associated with a specific domain name, the zone hierarchy 
     is followed until either an answer or an error is returned. In 
     this regard, the domain name of a resource record provides a 
     lookup key which is used by the protocol to navigate the zone 
     structure itself. 
      
  3.3.    Minimalist Messages 
      
     The DNS protocol uses a binary message format which is designed 
     specifically for lookup transactions. There are very few spurious 
     bits or fields in the DNS message (there is no "version" field, 
     for example). Among these optimizations are protocol-specific 
     compression techniques which reduce message sizes, and the 
     preferential use of UDP datagrams for the lookup transactions. 
      
   
  Hall                  I-D Expires: December 2003             [page 4] 
  Internet Draft        draft-hall-dns-data-00.txt            May 2003 
   
   
      
  3.4.    Built-In Record Caching 
      
     Further contributing to the lookup-centric design objective, DNS 
     resolvers and servers are allowed to cache resource records that 
     they have discovered, so that subsequent queries for duplicate 
     data may be retrieved without having to reissue a complex query. 
      
  4.      Inherent Design Limitations 
      
     As a result of the highly-optimized lookup model, DNS has several 
     critical built-in limitations. For example, DNS does not provide 
     any functions to "search by value", nor does it provide any sort 
     of mechanisms for cache-overrides, user authentication, access 
     control services, nor most of the other mechanisms that are 
     typically associated with richer (and slower) distributed 
     directory or database services. 
      
     Although DNS could be extended to accommodate some of these 
     usages, such an effort would require a significant amount of 
     design effort, and would likely require a complete redeployment of 
     the associated software agents. Furthermore, there is a 
     significant danger of overloading DNS with excessive features and 
     data such that the service itself would be incapable of performing 
     lightweight lookups for named entries quickly and efficiently. 
      
  4.1.    Domain Name Length 
      
     Domain names are restricted to a maximum length of 255 characters. 
     Since a domain name is the primary identifier for a resource 
     record -- and since the domain name of a record also identifies 
     the zone where a record is stored -- the length of a domain name 
     is can be a significant restriction. 
      
     For example, a resource record in a zone that is nested several 
     layers deep may have to be significantly shorter than a domain 
     name for the same kind of resource record in a top-level hierarchy 
     to comply with the length restriction. As a result, data models 
     which require application-specific labels or sequences can be 
     problematic for some users and should generally be avoided. 
      
  4.2.    Ambiguity 
      
     Although resource records provide six common fields, only three of 
     these fields can be specified in a lookup query (domain name, 
     record type, and network class). However, if multiple resource 
   
  Hall                  I-D Expires: December 2003             [page 5] 
  Internet Draft        draft-hall-dns-data-00.txt            May 2003 
   
   
     records exist with identical values for these fields (but with 
     different values in the data field), then all of those records 
     will be returned. As such, it is not possible to explicitly 
     request an exact resource record from among a set, unless only one 
     instance of that record type exists at that domain name. 
      
     However, it is not possible to guarantee that a particular 
     resource record type will only exist in the singular form at any 
     given time. Although it is possible to demand that administrators 
     "MUST NOT" enter a particular resource record more than once for 
     any domain name, such demands are at the whims of the systems in 
     the query path, and are generally unenforceable. 
      
     In short, it is not possible to guarantee that a newly-defined 
     resource record will only exist in the singular form. Data models 
     which depend on singular instances of a particular record should 
     be designed with this issue in mind. 
      
  4.3.    Incomplete Answer Sets 
      
     Just as it is not possible to extract a single resource record 
     from a set, it is not always possible to be sure that you will 
     receive all of the resource records in a set. Specifically, the 
     original DNS specifications allowed each resource record in a set 
     to have different time-to-live values, and this allowed (in 
     theory) each record in the set to be aged out of a cache at 
     different times. Furthermore, there have been some bugs in some 
     implementations which resulted in incomplete answer sets being 
     sent and subsequently cached by other nodes. 
      
     Although these problems have mostly been addressed over time, it 
     is still not possible to guarantee with absolute certainty that 
     all of the records in a set will always be returned. Data models 
     which depend on spreading answer data over multiple resource 
     records in a set should be designed with this in mind. 
      
  4.4.    Lookups Only 
      
     DNS currently only provides a lookup query, using the domain name 
     of the query as an index value. DNS does not provide any queries 
     which would allow a resolver to search all of the resource records 
     in the entire distributed database for a data value, but instead 
     only provides lookup queries which match against the three 
     qualifier fields. Although the original DNS specifications did 
     provide a mechanism to search a specific server for matching data-
   
  Hall                  I-D Expires: December 2003             [page 6] 
  Internet Draft        draft-hall-dns-data-00.txt            May 2003 
   
   
     values, this feature has never been widely deployed, and the 
     capability has since been deprecated. 
      
     In theory, it would be possible to create a super-index of all 
     zones in the entire distributed database and search against that 
     index, although nobody has built such an index so as-of-yet. 
      
     Regardless, applications must be aware that all queries use the 
     domain name as a lookup key, and it is not possible to search for 
     resource records by their data-values. 
      
  4.5.    UDP and TCP Restriction 
      
     DNS messages which are sent over UDP have a maximum message size 
     of 512 bytes. If a lookup results in an response message that 
     exceeds this size, then the query process must be restarted using 
     TCP. However, a DNS header restriction limits DNS message which 
     are sent over TCP to a maximum message size of 65,535 bytes. 
     Answer data that exceeds this threshold cannot be retrieved using 
     DNS at all. In short, UDP overflows penalize performance, while 
     TCP overflows cause the lookup process to fail entirely. 
     Furthermore, not all servers support TCP, and in those cases, UDP 
     messages which overflow the 512 byte limit will also be fatal. 
      
     In those cases where falling back to TCP works as expected, there 
     can be additional penalties apart from the longer setup time. For 
     example, TCP session management typically consumes more resources 
     than UDP datagrams, significantly limiting the number of queries 
     which a server can process at any given time. 
      
     For all of these reasons, planners and developers are strongly 
     encouraged to limit resource record data to sizes that will not 
     cause UDP overflow. In those cases where this is unavoidable, they 
     should be prepared for a variety of problems, including 
     performance issues and outright failure. 
      
  4.6.    Compression 
      
     The DNS specifications provide a compression mechanism which can 
     be used to substitute label sequences with pointers to previous 
     occurrences of those sequences. However, this mechanism only works 
     with well-known resource records. New resource record types cannot 
     make use of the pointer mechanism, since caches will not be aware 
     of the resource record's data-structure, and therefore will not be 
     able to tell that the data value is a domain name pointer which is 
     supposed to reference some other sequence of labels. 
   
  Hall                  I-D Expires: December 2003             [page 7] 
  Internet Draft        draft-hall-dns-data-00.txt            May 2003 
   
   
      
     This is an especially important consideration to keep in mind when 
     considering large data structures; while it is tempting to believe 
     that the domain name can be compressed, this simply is not true. 
      
  4.7.    Cache Overflow 
      
     Another issue related to data size is the amount of memory 
     available to a particular cache. All caches have fixed amounts of 
     available memory, and when that memory is consumed, some data will 
     have to be expired from the cache. In these cases, the cache will 
     have to query for the data again (causing performance penalties), 
     and will then have to bump some other data from the memory pool in 
     order to make room for the data again. In heavily loaded 
     environments (such as a very busy ISP), this can result in a 
     constant churning of the memory pool. 
      
     This is obviously a good reason to limit the size of the resource 
     records in use, but it is also a good reason for limiting the 
     total number of resource records in use with a particular 
     application. Since each entry will have to consume memory in a 
     cache somewhere, excess records or excessively large records will 
     both contribute to the potential for cache churning. 
      
  4.8.    Cache Lag 
      
     Since DNS is optimized for lookups, the use of intermediary and 
     end-node caches allows lookups to be held in memory at a location 
     that is "closer" to the user, which generally improves performance 
     over having to follow a complex delegation chain for every query. 
     However, caching can be somewhat hostile towards general-purpose 
     database models, particularly in light of the fact that DNS 
     provides no mechanisms for forcing a system to flush its cache of 
     previously discovered records. 
      
     In particular, caches prevent data from being validated against an 
     authoritative source. While this is normally beneficial for lookup 
     activities, it can be a devastating feature for data models that 
     require data-integrity at all times. For example, a resource 
     record which recorded the user who was currently logged on at a 
     terminal might seem to be a useful feature, while cache lag would 
     tend to make the data inaccurate more often than accurate, thereby 
     making it useless for its intended purpose. 
      
     Although DNS servers can dictate the length of time that a 
     resource record is to be held in a cache, this feature depends on 
   
  Hall                  I-D Expires: December 2003             [page 8] 
  Internet Draft        draft-hall-dns-data-00.txt            May 2003 
   
   
     several additional requirements. Furthermore, data models which 
     require the use of low time-to-live settings are generally frowned 
     upon by the DNS community, as these resource records place a 
     disproportionate burden on the lookup infrastructure. For these 
     reasons, DNS is inappropriate for data models which require full-
     time and instantaneous data integrity. 
      
  4.9.    World-Readable Data 
      
     DNS does not provide any mechanisms for authenticating users 
     during the lookup process, nor does it provide any standardized 
     mechanisms for linking access controls to a resource record. 
     Without these features, DNS is unsuitable for queries which 
     require authenticated access on a per-user basis. 
      
     For example, if an application wanted to store contact information 
     for employees in DNS, access to the data would likely be 
     restricted to certain people (perhaps allowing the general public 
     to see some level of anonymous data, while allowing internal 
     personnel to see greater levels of detail, while allowing the 
     supervisor to see all of the data). However, this model requires 
     user-specific authentication for each lookup process, and it also 
     requires that each resource record have an attribute list that 
     determined who was allowed to see the data. 
      
     However, DNS does not provide any mechanisms for providing 
     authentication within the lookup process. Furthermore, such an 
     effort would require a massive undertaking, which is not very 
     likely given that there are many other protocols already in place 
     which already provide similar mechanisms. Similarly, the DNS 
     protocol does not provide any mechanisms for storing and 
     exchanging access lists along with resource records. Adding this 
     information to the standardized resource record structure is not a 
     simple task, and would likely result in a substantial increase in 
     message overflow. 
      
     Although some DNS servers currently provide mechanisms for 
     restricting access based on qualifiers such as the IP address of 
     the client, it is important to point out that once the resource 
     records get into a cache outside of the protected scope, the 
     information is only as secure as that cache. In this regard, a 
     caching server that resides outside of a firewall can be just as 
     informative as the DNS servers inside the firewall. In the end, 
     there is no such thing as "private" information with DNS. All data 
     which is stored in DNS should be treated as if it were public 
     data, visible to all users. 
   
  Hall                  I-D Expires: December 2003             [page 9] 
  Internet Draft        draft-hall-dns-data-00.txt            May 2003 
   
   
      
  5.      Design Conclusion 
      
     Due to the architectural tradeoffs inherent in the DNS lookup 
     model, some usage models are better suited to DNS than others. In 
     particular, DNS is highly efficient at lookups of compact, public 
     and relatively stable data. Conversely, DNS is unsuitable for 
     value-based queries or searches, restricted-access data, highly-
     dynamic data, or large records and arrays. 
      
     For usage models which require access to those kinds of data, 
     application protocols such as LDAP or HTTP would be more 
     appropriate, and would provide greater rewards. 
      
  6.      Going Standards-Track 
      
     Generally speaking, planners and developers can usually define 
     their own resource record types as part of another standards-track 
     specification without interference from the DNS community as long 
     as the functional scope is limited to defining data-structures for 
     those resource record types. However, there are some cases where 
     it may be useful or necessary for the DNS community to be involved 
     with the standardization process. 
      
     In particular, if a DNS resource record type requires a server to 
     perform some kind of extra processing beyond echoing resource 
     record data from a database into a message, then the DNS community 
     should be consulted. For example, requiring that servers provide 
     additional data outside the answer section of the response message 
     should be vented with the community. 
      
     Similarly, if a specification requires special structuring of the 
     message for the benefit of a single service, then the DNS 
     community should definitely be involved in the discussion, since 
     any changes to the highly-optimized (binary) message format could 
     be disastrous in non-obvious ways.  
      
     Requests to reserve portions of the namespace for the use of a 
     single network service should also be brought to the DNS community 
     for discussion. 
      
     Finally, if a resource record goes against more than two of the 
     good-use guidelines put forth throughout this document, then it 
     would probably be a good idea to consult with the DNS community 
     over any alternatives which may be available. 
      
   
  Hall                  I-D Expires: December 2003            [page 10] 
  Internet Draft        draft-hall-dns-data-00.txt            May 2003 
   
   
     In all cases, IANA must be involved in delegating resource record 
     type codes and mnemonics. 
      
  7.      Security Considerations 
      
     This document does not create any security considerations. 
      
  8.      IANA Considerations 
      
     This document does not create any IANA considerations. 
      
  9.      Author's Address 
      
     Eric A. Hall 
     ehall@ehsco.com 
      
  10.     Normative References 
      
          [RFC1123]     Braden, R. "Requirements for Internet Hosts - 
                         Application and Support", STD 3, RFC 1123, 
                         October 1989. 
      
          [RFC2181]     Elz, R., and Bush, R. "Clarifications to the 
                         DNS Specification", RFC 2181, July 1997. 
      
          [STD13]       Mockapetris, P. "Domain names - concepts and 
                         facilities", STD 13, RFC 1034 and "Domain 
                         names - implementation and specification", STD 
                         13, RFC 1035, November 1987. 
      
  11.     Acknowledgments 
      
     Funding for the RFC editor function is currently provided by the 
     Internet Society. 
      
     Edward Lewis provided valuable feedback during the development of 
     this document. 
      
  12.     Full Copyright Statement 
      
     Copyright (C) The Internet Society (2003). All Rights Reserved. 
      
     This document and translations of it may be copied and furnished 
     to others, and derivative works that comment on or otherwise 
     explain it or assist in its implementation may be prepared, 
     copied, published and distributed, in whole or in part, without 
     restriction of any kind, provided that the above copyright notice 
   
  Hall                  I-D Expires: December 2003            [page 11] 
  Internet Draft        draft-hall-dns-data-00.txt            May 2003 
   
   
     and this paragraph are included on all such copies and derivative 
     works. However, this document itself may not be modified in any 
     way, such as by removing the copyright notice or references to the 
     Internet Society or other Internet organizations, except as needed 
     for the purpose of developing Internet standards in which case the 
     procedures for copyrights defined in the Internet Standards 
     process must be followed, or as required to translate it into 
     languages other than English. 
      
     The limited permissions granted above are perpetual and will not 
     be revoked by the Internet Society or its successors or assigns. 
      
     This document and the information contained herein is provided on 
     an "AS IS" basis and THE INTERNET SOCIETY AND THE INTERNET 
     ENGINEERING TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR 
     IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF 
     THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED 
     WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. 
      
   
  Hall                  I-D Expires: December 2003            [page 12]