File Allocation Table How It Seems To Work Thomas
Kjoernes, Introduction In this article I will talk about FAT. No, not the flabby stuff that sits around your waist, but the MS-DOS file system with the same name. The reason for writing this is of course to let everybody know that I know quite a bit about the FAT file system. It cleanses my soul; with all the devil worshipping Ive done in the past I sure need it! Seriously, I find writing technical articles about programming and related matters very educational. Im using this as an excuse to learn more about FAT for the purpose of writing a FAT file system driver for my OS project. I hope somebody will find this information useful. I will start out by explaining a little bit about the various flavours of FAT as used by MS-DOS, then Ill move on to explaining the various parts that make up the FAT file system and describe the structures Some FAT source code and information:
I will try to update this
document with sample code and descriptions of how to
interpret the FAT and perform common operations, such as
searching for and reading/writing files. FAT Types Today FAT comes in three different flavours FAT12, FAT16 and FAT32. The names refer to the number of bits used by the entries in table that gave the file system its name! The "File Allocation Table" itself is actually one of the structures inside the FAT file system as seen on-disk. The purpose of this table is to keep track of which areas of the disk are available and which areas are in use. Another important part about FAT is the "Long File Name" extension to FAT sometimes referred to as VFAT. The terms LFN and VFAT are closely related, but VFAT really means just Virtual FAT. FAT Overview Its time to get slightly technical. Ill first just mention all the structures almost in the order in which they usually appear inside the partition. When talking about the order of things Im referring to order as seen through the Logical Block Address of a particular structure. Cluster This term is very fundamental for FAT. A cluster is a group of sectors on the FAT media. Only the part of the partition called the "data area" is divided into clusters. The rest of the partition is simply sectors. Files and directories store their data in these clusters. The size of one cluster is specified in a structure called the Boot Record and can range from a single sector to 128 sector(s). Boot Record All the three flavours of FAT have a Boot Record, which is located within an area of reserved sectors. The DOS format program reserves 1 sector for FAT12 and FAT16 and usually 32 sectors for FAT32. Microsoft has recently made a document available that attempts to teach us about FAT. The document is good as a reference if you already know something about FAT, but its not very good at describing how to actually use the information. The document can be found at - http://www.microsoft.com/hwdev. I will not attempt to duplicate all the technical documentation presented in that document. I suggest you get it and read if you a very detailed explanation of every single field in the Boot Record for instance. File Allocate Table The actual "File Allocation Table" structure is a relatively simple structure, as are all of the FAT structures really. The FAT is a simple array of 12-bit, 16-bit or 32-bit data elements. Usually there will be two identical copies of the FAT. There is a field in the Boot Record that specifies the number of FAT copies. With FAT12 and FAT16, MS-DOS uses only the first copy, but the other copies are kept in sync. FAT32 was enhanced to specify which FAT copy is the active one in a 4-bit value part of a "Flags" field. Its quite common to think of the FAT as a singly linked list. Each of the chains in the FAT specify which parts of the disk belong to a given file or directory. Root Directory The Root Directory is formatted like any other directory except it does not contain the "dot" and "dot-dot" entries. See the details section for more information. The root directory can always be found immediately following the file allocation table(s) for FAT12 and FAT16 volumes. Data Area Time has come to describe the user data area. What is there to say really? The user data area (or just data area if you like) is where the contents of files and directories are stored. Simple as that See the formulas above for how to calculate the size of the data area. And yes, the data area is divided into sector groups called clusters. All the clusters in a single FAT volume have the same size. To further educate you, the term slack space refers to any unused space at the end of a cluster and cannot be used by any other file or directory. Note that directories are not known to suffer from slack space problems. This is simply because the exact size in bytes of a directory is not recorded as with files and generally no one seem to care anyway. The data area section will not be explained in detail. There is simply nothing more to say about it. Information on how to access files and directories is the closest we get to data area details. Wasted Sectors If the number of data sectors is not evenly divisible by the cluster size you end up with a few wasted data sectors. Also if the partition as declared in the partition table is larger than what is claimed in the Boot Record the volume can be said to have wasted sectors. If you are not familiar with the term partition table, I suggest that you go to Hale Landis web site and look for the How It Works series of documents at - http://www.ata-atapi.com. Boot Record Details The Boot Record is located at the very beginning of a FAT volume. FAT12 and FAT16 boot sectors occupy a single sector while FAT32 boot sectors are generally said to consist of three sectors. The first sector of the volume or the first few sectors are also known as the "reserved" sectors or the reserved area. The Reserved_Sectors field in the boot record tells us how large this area is. Note that the first FAT follows directly after the reserved area. As mentioned in the overview section, the DOS format program reserves 1 sector for FAT12 and FAT16 and usually 32 sectors for FAT32. The reserved area for FAT32 contains not only the boot record but also a backup copy of the boot record. The boot record includes a field describing the sector size for the media and it is possible to have a boot record size different than the 512-byte commonly seen on harddisks and diskettes (usually seen on RAM-disks). Inside the boot record there is a structure called the BPB or BIOS Parameter Block. This structure is what really makes up the boot record. To further complicate things there are different versions of the boot record as well. Basically, as storage technology developed and disks got bigger, a few new fields were added to the boot record to support larger disk sizes. Of course being a lazy person, I will completely ignore the older boot record layouts and concentrate on the most current implementation. The structure definitions that follows come from one of the include files used by my OS FAT.INC. I only attempt to show you the layout of the BPB and remaining boot sector structures. If you want details of the purpose of each field and what they may and may not contain I urge you to read the Microsoft FAT overview document, which describes this quite well. Note that the BPB and Boot Sector layouts for FAT12 and FAT16 are identical. For this reason a show only the FAT12 and FAT32 structures. ; BPB for FAT12 and FAT16 volumes
; BPB for FAT32 volumes
; Boot Sector layout for FAT12 and FAT16
;Boot Sector layout for FAT32
File Allocation Table Details Its time to mention clusters again. The 0th and 1st FAT entries are reserved and contain some special information. The 2nd entry and up tells you the state of the corresponding cluster in the data area. A value of zero indicates that the cluster represented by that FAT entry is available and can be used when allocating new space for a file or directory. A hexadecimal value in the range FFF8-FFFF indicates that the cluster is the last in that chain of clusters. Yes, as you might have guessed, this example is only valid for FAT16. Check out the table below for cluster values for all FAT types: Cluster Values
If you have a closer look at the FAT32 values, youll see that only they are actually 28-bits long. This is actually correct. The upper nibble is stated by Microsoft as being reserved and might have a special meaning in future FAT32 implementations. They also tell you that
you should mask the bits off when interpreting the FAT.
Also, care should be taken to preserve the upper bits
when changing the FAT. The MS document tells us that the FAT type is determined by how many clusters there is room for in the user data area. Have a look at the table below: FAT Type Determination
As seen in the previous table, 11 values were reserved for other purposes, which tells us to simply subtract that number from the maximum number of values represented by a 12-bit, 16-bit and 28-bit number respectively (remember that the upper 4-bits are reserved in FAT32). This does not match what the MS document says unfortunately. The MS numbers are one less that when just taking account for the reserved values. Since the document explicitly states that THIS INFORMATION IS CORRECT and the NUMBERS ARE CORRECT we must obey and use those numbers instead of the ones I really want to use. If you take a look at another of "my" documents FAT32.HTM, which was actually written by Microsoft, they use "my" numbers. The actual statement is:
The size of the data area
is determined using the horrible pseudo-formulas shown
below. I split the formula into smaller logical parts,
simply because it looked horrible all in one go. FAT_Sectors = Number_Of_FATs * Sectors_Per_FAT Data_Sectors = Total_Sectors (Reserved_Sectors + FAT_Sectors + Root_Sectors) Total_Clusters
= Data_Sectors / Sectors_Per_Cluster To locate the start sector
of the data area you can use the following formula: Root Directory Details As stated in the overview section, the root directory is formatted just like any other directory. See the directory and LFN details section for information about the directory entry format. FAT12 and FAT16 volumes
have the root directory located immediately following the
file allocation table(s). The following formula gives you
the starting sector number for the root directory: See the formula in the previous example for the FAT_Sectors details. On FAT32 volumes the root
directory is made up of an ordinary cluster chain. A
field in the Boot Record will tell you the initial
cluster number. Once youve got the initial cluster
number you can easily get the starting sector number for
FAT32 as well: This is exactly the same
way you would find the starting sector number for a file
or a directory. The Data_Area_Start formula
includes Root_Sectors, but that does not apply to
this formula. The Boot Record for a FAT32 volume will
have the field holding number of entries in the root
directory set to zero anyway. Directory and LFN details I feel it is time to describe how directories are implemented in the FAT file system. All directories except the root directory for FAT12 and FAT16 drives are actually files. A file is a stream of bytes located in the data area portion of the volume made up by one or more clusters. The exact size of the file is recorded in the directory structure. The size of directory files is not recorded anywhere. Directories are always treated as being a multiple of the cluster size. Directories will be expanded when a new entry is added and the directory is already full. Note that the root directory for FAT12 and FAT16 drives cannot be expanded. The size of the root directory for these FAT types was determined when the volume was formatted. The directory is divided into small structures called Directory Entries. Each directory entry is always exactly 32 bytes long. The entry holds the name, attribute, size, date, time and initial cluster number for the file or directory. The Long File Names supported with VFAT are stored in addition to the short/normal name in entries preceding the short name. The LFN entry is hidden from older utility software (basically older MS-DOS versions) by using a traditionally illegal attribute value. Long names are stored in UNICODE, which is the 16-bit successor of ASCII. Each UNICODE character normally appears as an ASCII character followed by a null byte. Each LFN entry can hold up to 13 UNCODE characters. If the long name is longer than that additional entries are stored in the directory. Also note that the LFN entries are stored in reverse order with the first part of the long name appearing just before the short name entry. Normal/Short Entry
Format
The time and date fields use the following "packed" storage format: Packed Time:
Packed Date:
I havent seen any official MS documentation on LFN entries, but after studying directory sectors and reading lots about it from various sources, including Ralf Browns Interrupt List, which you BTW, should get from this location <insert WWW page here>, Ive come to the conclusion that this information is very accurate. Also note that its
merely a guess that the new file access/creation time/date
fields added with VFAT are stored in the same format as
the standard time/date fields. Long File Name Entry
Format
The Long File Name entries include every single character in the long name including the "dot" which is implied by short names. The very last LFN entry will have a null-terminator if less than 13 characters are stored in that long name entry. If only 12 characters or less are stored the remainder of the long name parts are filled with FFFFs. TK. |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||