Oracle Database 10g‘s mantra is simpler, more automated database management. We DBAs cannot help but get excited about all the new technology that Oracle routinely releases. With the release of Oracle’s Volume Manager and File system built into the Database kernel (ASM) that enthusiasm cannot be overstated. While this Oracle 10g delivers numerous interesting new features, we will examine just one, Automatic Storage Management (ASM), which makes Linux and UNIX database environments disk space management a snap.
What is exactly Automatic Storage Management?
ASM Oracle storage management system directly manages potentially thousands of Oracle database files by enabling the DBA to create disk groups, which are comprised of disks and the files that reside on them. The DBA will need to manage a small number of disk groups.
The task of planning, initializing, allocating, and managing many disks for several databases, if not single large database, becomes unwieldy. Capitulation is often inevitable, and many companies simply treat the disk storage farm as a black box, thus abstracting that complexity away from the database. Very often, this approach can lead to database I/O bottlenecks that are time-consuming to diagnose. For enterprises where the DBA’s tasks do involve Disk management, these tasks consume far too much precious time. Neither scenario should be generally acceptable, as both represent the extremes.
Oracle Database 10g‘s new ASM feature directly addresses these real-world scenarios by providing an effective and simple middle-ground solution. Now, instead of spending inordinate amounts of time managing disk complexities or abstracting it away from the database in the hopes that all is well, we can simply permit ASM to manage it all. We can now merely allocate disks to Oracle with preferences for striping and mirroring stated via templates, and let ASM manage the space—thus eliminating the need for traditional tools, like Veritas Volume Manager (VXVM) and (VXFS) file systems, used to manage lots of disk space and the numerous commands/GUI tools (VXSA) necessary to manage the storage. Thus, Linux and UNIX database servers can now be deployed more easily and quickly for ever-growing database size requirements—and with much greater I/O efficiency than before.
It is important to note that ASM is not a general-purpose file system and is not intended to replace the operating system’s file system. For example, if ASM were used to manage background and user trace files, it could be difficult for the DBA to access the information they contain. ASM files cannot be directly accessed by the operating system; as a result, if the operating system is unable to access and open the alert. log file, the DBA will be unable to view its contents in a text editor and diagnose database problems. Rather, these types of files should be managed through the operating system’s file management system.
Easier to Set Up
ASM is just plain easier to set up and administer than its traditional file system and Volume manager. Let’s assume we have a simple database to create—just the required tablespaces and one for user data. Let’s also assume the following hardware and software constraints:
- RAID 0—stripe everything across all drives
- Stripe Width = 4
- Stripe Length = 64 K
- Stripe Width = 4
- Four 20 GB IDE disks—each with single partition
- Linux ext3 file system (2 GB file size limit)
- Five Tablespaces
- SYSTEM, 2GB, 1 data files
- SYSAUX, 2GB, 1 data files
- UNDO, 8GB, 4 data files
- TEMP, 8GB, 4 data files
- USER, 60GB, 30 data files
- SYSTEM, 2GB, 1 data files
- One Physical Volume (PV) per disk drive
- One Volume Group (VG)—VG01
- Four Logical Volumes (LV)
- LV01, 4GB, SYSTEM and SYSAUX
- LV02, 8GB, UNDO
- LV03, 8GB, TEMP
- LV04, 60GB, USER
- LV01, 4GB, SYSTEM and SYSAUX
Now let’s compare the actual steps to create the database in each environment. Note that in the LVM example I am purposely avoiding the handling of various overhead issues with all size settings in order to keep the example simple. However, be warned: for optimal disk space usage, you will have to tackle this issue as well. It is yet another example of complex storage planning and management issues that you can avoid by using ASM.
Here are the steps using cooked files with an LVM. (Note that the IDE disk drives are assumed to be the second through fifth drives in the example, hence the b-e device name designations of /dev/hdb through /dev/hde):
1. fdisk /dev/hdb set its type to 0x8e (LVM partition)
2. fdisk /dev/hdc set its type to 0x8e (LVM partition)
3. fdisk /dev/hdd set its type to 0x8e (LVM partition)
4. fdisk /dev/hde set its type to 0x8e (LVM partition)
5. pvcreate /dev/hdb /dev/hdc /dev/hdd /dev/hde)
6. vgcreate VG01 /dev/hdb /dev/hdc /dev/hdd /dev/hde)
7. lvcreate -L 4 G -i 4 -I 64 -n LV01 VG01)
8. lvcreate -L 8 G -i 4 -I 64 -n LV02 VG01)
9. lvcreate -L 8 G -i 4 -I 64 -n LV03 VG01)
10. -L 60 G -i 4 -I 64 -n LV04 VG01)
11. mkfs -t ext3 /dev/VG01/LV01)
12. mkfs -t ext3 /dev/VG01/LV02)
13. mkfs -t ext3 /dev/VG01/LV03)
14. mkfs -t ext3 /dev/VG01/LV04)
15. mount /dev/VG01/LV01 /home/oracle/oradata/LVMDB/system)
16. mount /dev/VG01/LV02 /home/oracle/oradata/LVMDB/undo)
17. mount /dev/VG01/LV03 /home/oracle/oradata/LVMDB/temp)
18. mount /dev/VG01/LV04 /home/oracle/oradata/LVMDB/user1)
19. edit /etc/fstab and add the new mount point entries)
20. Create initLVMDB.ora file)
o INSTANCE_TYPE = RDBMS)
21. SQL Plus connect as SYSDBA for SID=LVMDB)
22. STARTUP NOMOUNT PFILE=initLVMDB.ora)
23. CREATE SPFILE FROM PFILE=initLVM.ora)
24. Create Oracle database and user tablespace using SQL code.
Now simply imagine that you will need dozens of such user tablespaces, each with lots and lots of data files. It is not hard to see why this example does not scale well. Keeping track of all the tablespaces, data files, logical volumes, physical volumes, and actual disk drives (for finding hot spots) is just too difficult.
Now here is the much simpler ASM example:
1. Create initASM.ora file
o INSTANCE_TYPE = OSM
2. SQL Plus connect as SYSDBA for SID=ASM
3. STARTUP NOMOUNT PFILE=initASM.ora
4. CREATE SPFILE FROM PFILE=initASM.ora
5. CREATE DISKGROUP dgroup1 EXTERNAL REDUNDANCY DISK
‘/dev/hdb’,’/dev/hdc’,’/dev/hdd’,’/dev/hde’
6. Create initASMDB.ora file
o INSTANCE_TYPE = RDBMS
o DB_CREATE_FILE_DEST = ‘+dgroup1’
7. SQL Plus connect as SYSDBA for SID=ASMDB
8. STARTUP NOMOUNT PFILE=initASMDB.ora
9. Create Oracle database and user tablespace using SQL
And that’s the syntax if we still want to explicitly control the data allocation sizes (which really are no longer a concern as we’re now working at the disk level). Thus, we really could just let Oracle handle all the internal space needs and issue the much simpler syntax:
create database ASMDB;
Not only is this syntax much shorter and thus easier to read, but note that we even had our redo logs created in our striped disk group. (In the prior example, they had just been placed on the file system.) The main point is that the process of laying out the storage and creating the database is just so much simpler, that if you’re managing lots and lots of disk drives (i.e., SAN or NAS) that you can’t go wrong with upgrading to Oracle Database 10g, for ASM alone.
Furthermore, ASM automatically rebalances both its striping and mirroring of a disk group whenever disks are added, dropped, or fail—and all with the database completely online. Therefore, Oracle automatically takes care of keeping all of your objects fully striped. That is why ASM can make the claim that it provides near-optimal I/O balancing without any manual tuning. It simply internalizes and automates that which DBAs have been doing manually for years—trying to eliminate hot spots by spreading things across as many drives as possible. Note that you can control when and how Oracle performs that rebalancing via the OSM_POWER_LIMIT and other parameters
Easier to Change
Of course, you only create the database once, so you might argue that the savings shown above are not reason enough to undertake any changes. So now, let’s examine what would happen in each scenario if we add four disks. This is where ASM really shines.
Let’s assume that our single USER tablespace is nearly full, containing just 10 tables and 10 indexes, where each table consumes 4GB and each index consumes 2GB. If we now need to create another table and index, we do not have enough room. Therefore, we are given four more disks identical to the first four to add to our storage design in order to accommodate additional space requests. In other words, we are going to add 80GB to our single USER tablespace. Sounds easy enough, right?
In the LVM example, we have three primary options:
- Create a new Volume Group VG02 with a new Logical Volume LV05
- Extend existing Volume Group VG01 with a new Logical Volume LV05
- Extend existing Volume Group VG01 by extending Logical Volume LV04.
[Note: Since we are assuming that all 80GB is to be dedicated to the USER tablespace, there is no need to create more than a single new Logical Volume for the first two options.]
Most people will choose the third option because we are merely trying to add space to our existing storage design. The steps to implement that choice are shown below.
1. fdisk /dev/hdf set its type to 0x8e (LVM partition)
2. fdisk /dev/hdg set its type to 0x8e (LVM partition)
3. fdisk /dev/hdh set its type to 0x8e (LVM partition)
4. fdisk /dev/hdi set its type to 0x8e (LVM partition)
5. pvcreate /dev/hdf /dev/hdg /dev/hdh /dev/hdi
6. vgextend VG01 /dev/hdf /dev/hdg /dev/hdh /dev/hdi
7. lvextend -L +80 G /dev/VG01/LV04
8. ext2online /dev/VG01/LV04
9. SQL Plus connect as SYSDBA for SID=LVMDB
10. Add new space to the tablespace using SQL code.
Not only is this example very long; it has several problems as well. First, online file system resizing is a very tricky business. The author of the ext2online utility states that “resizing a mounted file system is inherently dangerous and may corrupt file systems.” Moreover, the ext2online utility can only enlarge a mounted file system; to shrink or enlarge file systems one would use ext2resize instead—which only works on unmounted file systems. Of course, that would require taking the tablespace offline.
The LVM approach has some not-so-obvious drawbacks as well. We very likely expected that this solution would result in our data being striped across all eight of our drives—not true. While we can add space to a Logical Volume, we cannot change its striping nature on Linux (although some UNIX LVMs do provide such capabilities). So our 10 old tables and 10 old indexes are striped across drives b-e, while our new table and index are striped across drives f-i (because the USER tablespace was already full, new objects will be created in the new space). Even if we exported the tablespace objects, dropped them, coalesced the tablespace, and then imported them back into the tablespace, the Logical Volume is still set for four-way striping. We would have to manually do the following if we really wanted eight-way striping:
- Export the objects in that tablespace (database in restricted session, to be safe)
- Drop the tablespace
- Drop the Logical Volume
- Create a new Logical Volume (with striping parameter set as -i 8)
- Create the tablespace (this would have lots of data file lines for all 140 GB)
- Import the objects into the tablespace.
That is where ASM steps in and makes life easy. The steps are shorter:
- SQL Plus connect as SYSDBA for SID=ASM
- ALTER DISKGROUP dgroup1 ADD DISK ‘/dev/hdf’,’/dev/hdg’,’/dev/hdh’,’/dev/hdi’
Good Performance, Too…
All of the above exemplifies why for many shops, ASM alone might justify an upgrade to Database 10g. While we are eliminating the LVM and file system, we are still nonetheless utilizing more Oracle technology to do essentially much of the same thing. So initial expectations are to see fairly similar or slightly better performance—with the advantages once again being far greater simplicity to create and manage. Roughly, a 10-15percent improvement from a feature that makes the DBA’s life easier is not a bad return on investment for the cost of doing an upgrade
Points to note:
- ASM gives raw type access therefore the Oracle kernel directs concurrency
- ASM can cache meta data (data about ASM disk group)
- ASM disk space can be expanded on the fly.
- ASM automatically stripes at the extent level improving I/O performance
These capabilities sound appealing. However, how does ASM work, and for what kinds of files?