ASM 翻译系列第三十二弹：自制数据抽取小工具

查看原文

其他

ASM 翻译系列第三十二弹：自制数据抽取小工具

Original 2017-02-23 邱大龙 沃趣科技

原作者：Bane Radulovic

译者：邱大龙

审核：魏兴华

DBGeeK社区联合出品

Find block in ASM

在本系列文章【 Where is my data】中，我已经演示了如何从ASM磁盘中定位和抽取一个Oracle的block，为了让这件事做起来不那么复杂，我又写了一个perl脚本find_block.pl来简化整个操作，只需要提供数据文件的名称和需要提取的block，这个脚本就可以输出从ASM磁盘组中抽取块的命令。

find_block.pl

find_block.pl是一个perl脚本，脚本里集成了dd或kfed命令来从ASM磁盘中抽取一个块，脚本可以在Linux和Unix的ASM版本下工作，且不管是单实例还是RAC环境。（不能是FLEX ASM）

脚本需要以Grid软件owner的身份来运行，而且要确保perl的二进制文件来自于Oracle安装软件的home目录下。在集群环境下，这个脚本可以运行在任意节点上，在运行脚本前，请检查ASM的环境变量，确定ORACLE_SID, ORACLE_HOME, LD_LIBRARY_PATH设定正确，而且对于10G和11GR1版本，需要设置PERL5LIB环境变量：

export PERL5LIB=$ORACLE_HOME/perl/lib/5.8.3:$ORACLE_HOME/perl/lib/site_perl

可以以如下的方式运行脚本：

$ORACLE_HOME/perl/bin/perl find_block.pl filename block

其中： filename是要抽取的块所在的文件名，对于数据文件来说，这个文件名可以从V$DATAFILE的NAME字段获取到，block代表要从ASM抽取的块号，这个块号是数据库的块号，而不是ASM的块号。

这个脚本的输出看起来像下面这样：

dd if=[ASM disk path] ... of=block_N.dd

在Exadata中是这样：

kfed read dev=[ASM disk path] ... > block_N.txt

对于数据文件来说，如果文件的冗余度是external外部冗余模式，这个脚本将产生一条单一的命令，对于是normal冗余，这个脚本将产生2个命令，对于high冗余，将产生3条命令。

Example with ASM version 10.2.0.1

第一个例子是单实例10.2.0.1的ASM版本，首先我在数据库中创建了一张表，插入一些数据。

[oracle@cat10g ~]$ sqlplus / as sysdba

SQL*Plus: Release 10.2.0.1.0 - Production on [date]

SQL> create table TAB1 (name varchar2(16)) tablespace USERS;

Table created.

SQL> insert into TAB1 values ('CAT');

1 row created.

SQL> insert into TAB1 values ('DOG');

1 row created.

SQL> commit;

Commit complete.

SQL> select ROWID, NAME from TAB1;

ROWID NAME

------------------ --------------------------------

AAANE+AAEAAAAGHAAA CAT

AAANE+AAEAAAAGHAAB DOG

SQL> select DBMS_ROWID.ROWID_BLOCK_NUMBER('AAANE+AAEAAAAGHAAA') "Block" from dual;

Block

---------

391

SQL> select t.name "Tablespace", f.name "Datafile"

from v$tablespace t, v$datafile f

where t.ts#=f.ts# and t.name='USERS';

Tablespace Datafile

------------ --------------------------------------

USERS +DATA/cat/datafile/users.259.783204313

SQL>

以上我们造取了两条数据，并且定位到了数据所在的文件和BLOCK号，切换到ASM环境，注意设置正确的环境变量PERL5LIB，然后运行脚本：

$ export PERL5LIB=$ORACLE_HOME/perl/lib/5.8.3:$ORACLE_HOME/perl/lib/site_perl

$ $ORACLE_HOME/perl/bin/perl find_block.pl +DATA/cat/datafile/users.259.783204313 391

dd if=/dev/oracleasm/disks/ASMDISK01 bs=8192 count=1 skip=100359 of=block_391.dd

find_block.pl脚本如预期产生了输出，由于这是一个外部冗余的磁盘组，这个脚本只产生了一行dd命令的输出，我们把输出的dd命令复制后执行：

$ dd if=/dev/oracleasm/disks/ASMDISK01 bs=8192 count=1 skip=100359 of=block_391.dd

执行后会将块的内容输出到文本文件中block_3237.dd中，然后使用操作系统的od工具，可以看到插入表中的数据：

$ od -c block_391.dd | tail -3

0017740 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 , 001

0017760 001 003 D O G , 001 001 003 C A T 001 006 u G

0020000

非常好，正式我们插入的数据！

Example with ASM version 12.1.0.1 in Exadata

ASM空间的占用取决于2个因素：文件的实际大小和磁盘组的冗余度。

在external冗余的磁盘组中，空间的占用：文件实际大小+1个AU（文件头）+1个额外的AU（如果文件大于60个AU）。

在一个normal冗余的磁盘组中，空间的占用：两倍的文件实际大小+2个AU（文件头）+3个额外的AU（如果文件大于60个AU）

在一个high冗余的磁盘组中，空间的占用：三倍的文件实际大小+3个AU（文件头）+3个额外的AU（如果文件大于60个AU）

在Exadata中我们不能使用dd命令抽取数据块，因为ASM的磁盘对于数据库的server来说是不可见的，为了获得数据块，我们可以使用kfed工具，因此find_block.pl脚本做了这种自适应，如果是Exadata的环境，会使用kfed工具来从ASM磁盘中抽取块。

我们来看一个ASM 12.1.0.1 版本下的一个例子，是一个Exadata环境下双节点的RAC，数据文件是PDB中的一个数据文件。

和上面的例子一样，我首先创建一张表然后插入一些数据：

$ sqlplus / as sysdba

SQL*Plus: Release 12.1.0.1.0 Production on [date]

SQL> alter pluggable database BR_PDB open;

Pluggable database altered.

SQL> show pdbs

CON_ID CON_NAME OPEN MODE RESTRICTED

------ -------- ----------- ----------

2 PDB$SEED READ ONLY NO

...

5 BR_PDB READ WRITE NO

SQL>

$ sqlplus bane/welcome1@BR_PDB

SQL*Plus: Release 12.1.0.1.0 Production on [date]

SQL> create table TAB1 (n number, name varchar2(16)) tablespace USERS;

Table created.

SQL> insert into TAB1 values (1, 'CAT')

1 row created.

SQL> insert into TAB1 values (2, 'DOG');

1 row created.

SQL> commit;

Commit complete.

SQL> select t.name "Tablespace", f.name "Datafile"

from v$tablespace t, v$datafile f

where t.ts#=f.ts# and t.name='USERS';

Tablespace Datafile

---------- ---------------------------------------------

USERS +DATA/CDB/054.../DATAFILE/users.588.860861901

SQL> select ROWID, NAME from TAB1;

ROWID NAME

------------------ ----

AAAWYEABfAAAACDAAA CAT

AAAWYEABfAAAACDAAB DOG

SQL> select DBMS_ROWID.ROWID_BLOCK_NUMBER('AAAWYEABfAAAACDAAA') "Block number" from dual;

Block number

------------

131

SQL>

同样获得插入数据的文件号和块号，切换到ASM的环境，然后运行perl脚本：

$ $ORACLE_HOME/perl/bin/perl find_block.pl +DATA/CDB/0548068A10AB14DEE053E273BB0A46D1/DATAFILE/users.588.860861901 131

kfed read dev=o/192.168.1.9/DATA_CD_03_exacelmel05 ausz=4194304 aunum=16212 blksz=8192 blknum=131 | grep -iv ^kf > block_131.txt

kfed read dev=o/192.168.1.11/DATA_CD_09_exacelmel07 ausz=4194304 aunum=16267 blksz=8192 blknum=131 | grep -iv ^kf > block_131.txt

我们观察到，find_block.pl脚本这次产生了2个命令，因此我们可以知道这是一个normal冗余的磁盘组，我们运行其中一个命令：

$ kfed read dev=o/192.168.1.9/DATA_CD_03_exacelmel05 ausz=4194304 aunum=16212 blksz=8192 blknum=131 | grep -iv ^kf > block_131.txt

我们将块的内容输出到了文本文件block_131.txt中，然后看到了我上面插入的数据DOG和CAT：

$ more block_131.txt

...

FD5106080 00000000 00000000 ... [................]

Repeat 501 times

FD5107FE0 00000000 00000000 ... [........,......D]

FD5107FF0 012C474F 02C10202 ... [OG,......CAT..,-]

Find any block

find_block.pl用来从ASM磁盘组中的任何一个文件中抽取块，不仅仅是数据文件，为了一乐，我对控制文件和控制文件上一个随机的块运行这个脚本：

$ $ORACLE_HOME/perl/bin/perl find_block.pl +DATA/CDB/CONTROLFILE/current.289.843047837 5

kfed read dev=o/192.168.1.9/DATA_CD_10_exacelmel05 ausz=4194304 aunum=73 blksz=16384 blknum=5 | grep -iv ^kf > block_5.txt

kfed read dev=o/192.168.1.11/DATA_CD_01_exacelmel07 ausz=4194304 aunum=66 blksz=16384 blknum=5 | grep -iv ^kf > block_5.txt

kfed read dev=o/192.168.1.10/DATA_CD_04_exacelmel06 ausz=4194304 aunum=78 blksz=16384 blknum=5 | grep -iv ^kf > block_5.txt

我们注意到脚本正确的计算出了控制文件的block size（不同于数据块的大小8K，为16K），并且脚本产生出了3个不同的命令，虽然磁盘组DATA是normal冗余，但是控制文件却做了high冗余，也就是做了三副本，控制文件在这一点上跟ASM的元数据文件一样。

Conclusion

find_block.pl脚本通过dd或者kfed命令来从ASM磁盘组的文件中抽取块，可能大多数情况下，我们想要从数据文件中抽取一个块，但是这个脚本不仅仅适用于数据文件，也可以从控制文件、日志文件、任何的ASM文件中抽取块。

如果文件是external外部冗余的，那么这个脚本将输出一个单一的命令，执行这个命令可以直接从ASM的磁盘中抽取块。

如果文件是normal冗余的，这个脚本将输出2个命令，它用来从不同的磁盘中抽取块，这可能会比较有用，例如后台日志提示数据块损坏，ASM不能修复它，那么就可以通过镜像块来修复。

如果文件是high冗余的，这个脚本将产生3个命令。

最后，使用这个脚本你不用知道文件的冗余度、块的大小，和任何其他属性，你只需要关心文件名和块号。

附脚本

#!$ORACLE_HOME/perl/bin/perl -w

# The find_block.pl constructs the command(s) to extract a block from ASM.

# For a complete info about this script see ASM Support Guy blog post:

# http://asmsupportguy.blogspot.com/2014/10/find-block-in-asm.html

# This program is free software: you can redistribute it and/or modify it under

# the terms of the GNU General Public License as published by the Free Software

# Foundation, either version 3 of the License, or any later version.

# This program is distributed in the hope that it will be useful, but WITHOUT

# ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS

# FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details

# at http://www.gnu.org/licenses/.

# Version 1.00, Oct 2014

# The initial release.

# Version 1.01, Oct 2014

# Minor improvements.

# Version 1.02, Oct 2014

# Added support for AFD disks.

# Version 1.03, Nov 2014

# Added sanity checks, e.g. if the requested block is reasonable,

# if the specified filename is valid, etc.

# Version 1.04, Nov 2014

# Improved the check for Exadata storage cell based disk.

use strict;

use DBI;

use DBD::Oracle qw(:ora_session_modes);

use POSIX;

# Handle the version query

die "find_block.pl version 1.04\n"

if ( $ARGV[0] =~ /^-v/i );

# Check the number of input arguments

die "Usage: \$ORACLE_HOME/perl/bin/perl find_block.pl filename block\n"

unless ( @ARGV == 2 );

# Get the filename from the first input argument

my $filename = shift @ARGV;

# Check if the filename makes sense.

# The 'minimum' filename is +DGNAME/filename,

# i.e. it has to begin with the '+' followed by a disk group name,

# followed by at least one '/', followed by directory or file name...

die "Error: The $filename is not a valid file name.\n"

unless ( $filename =~ /^\+\w/ && $filename =~ /\/\w/ );

# Get the disk group name out of the user specified filename

my $diskgroup_name = substr($filename, 1, index($filename, "/") -1 );

# Get the ASM file name out of the user specified filename

my $asmfile = substr($filename, rindex($filename, "/") +1 );

# Get the block number from the second input argument

my $block_number = shift @ARGV;

# Check if the block number is an integer

die "Usage: \$ORACLE_HOME/perl/bin/perl find_block.pl filename block\n"

unless ( $block_number =~ /^\d+$/ );

# Check if the ASM SID is set

die "Error: ASM SID not set.\n"

unless ( $ENV{ORACLE_SID} =~ /\+ASM/ );

# Connect to the (local) ASM instance

my $dbh = DBI->connect('dbi:Oracle:', "", "", { ora_session_mode => ORA_SYSDBA })

or die "$DBI::errstr\n";

# Check if the disk group exists and if it is mounted

my $group_number = &asm_diskgroup("group_number", $diskgroup_name);

die "Error: Disk group $diskgroup_name not mounted or does not exist.\n"

unless ( $group_number );

# Check if the user specified file exists in the disk group

my $file_number = &asm_alias("file_number", $asmfile, $group_number);

die "Error: File $asmfile does not exist in disk group $diskgroup_name.\n"

unless ( $file_number );

# Get the block size for the file

my $block_size = &asm_file("block_size", $group_number, $file_number);

# Get the number of blocks in the file

my $file_blocks = &asm_file("blocks", $group_number, $file_number);

# Check if the user specified block number makes sense

die "Error: Block range for file $asmfile is: 0 - $file_blocks.\n"

unless ( $block_number >= 0 && $block_number <= $file_blocks );

# Get the disk group AU size

my $au_size = &asm_diskgroup("allocation_unit_size", $diskgroup_name);

# Work out the blocks per AU and the virtual extent number

my $blocks_per_au = $au_size/$block_size;

my $xnum_kffxp = floor($block_number/$blocks_per_au);

# Get the disk and AU numbers into the @disk_au array

my @disk_au = &asm_kffxp($file_number, $group_number, $xnum_kffxp);

die "Could not get any disk and AU numbers for file $asmfile.\n"

unless ( @disk_au );

# Get the disk path(s) and generate the block extract command(s)

while ( @disk_au ) {

# Do not assume anything

my $storage_cell = "FALSE";

# Get the disk number from @disk_au

my $disk_number = shift @disk_au;

# Get the AU number from @disk_au

my $au_number = shift @disk_au;

# Get the path for that disk number

my $path = &asm_disk("path", $group_number, $disk_number);

# If there is no path move to the next disk

if ( ! $path ) {

next;

}

# If ASMLIB is in use, the path will return ORCL:DISKNAME.

# Set the path to /dev/oracleasm/disks/DISKNAME

elsif ( $path =~ /ORCL:(.*)/ ) {

$path = "/dev/oracleasm/disks/".$1;

}

# If ASM Filter Driver (AFD) is in use, the path will return AFD:DISKNAME.

# Get the actual path from /dev/oracleafd/disks/DISKNAME

elsif ( $path =~ /AFD:(.*)/ ) {

if ( ! open AFDDISK, "/dev/oracleafd/disks/".$1 ) { next }

else { chomp($path = <AFDDISK>) }

}

# For Exadata storage cell based disk, the path will start with o/IP address

elsif ( $path =~ /^o\/\d{1,3}\./ ) {

$storage_cell = "TRUE";

}

if ( $storage_cell eq "TRUE" ) {

# Construct the kfed command for Exadata storage cell based disk

# dev=$path ausz=$au_size aunum=$au_number blksz=$block_size blknum=$block_number

# The grep filters out the kfed stuff

print "kfed read dev=$path ausz=$au_size aunum=$au_number blksz=$block_size blknum=$block_number | grep -iv ^kf > block_$block_number.txt\n";

}

else {

# Construct the dd command

# if=$path bs=$block_size count=1 skip=$skip of=block_$block_number.dd

my $skip=$au_number*$blocks_per_au + $block_number%$blocks_per_au;

print "dd if=$path bs=$block_size count=1 skip=$skip of=block_$block_number.dd\n";

}

# We are done. Disconnect from the (local) ASM instance

$dbh->disconnect;

# Subs

# Get a column from v$asm_file for a given group number and file number

sub asm_file {

my $col = shift @_;

my $group_number = shift @_;

my $file_number = shift @_;

my $sql = $dbh->prepare("select $col from v\$asm_file where group_number=$group_number and file_number=$file_number");

$sql->execute;

my $col_value = $sql->fetchrow_array;

$sql->finish;

return $col_value;

}

# Get a column from v$asm_alias for a given (file) name and group number

sub asm_alias {

my $col = shift @_;

my $name = shift @_;

my $group_number = shift @_;

my $sql = $dbh->prepare("select $col from v\$asm_alias where lower(name)=lower('$name') and group_number=$group_number");

$sql->execute;

my $col_value = $sql->fetchrow_array;

$sql->finish;

return $col_value;

}

# Get a column from v$asm_diskgroup for a given disk group name

sub asm_diskgroup {

my $col = shift @_;

my $name = shift @_;

my $sql = $dbh->prepare("select $col from v\$asm_diskgroup where name=upper('$name')");

$sql->execute;

my $col_value = $sql->fetchrow_array;

$sql->finish;

return $col_value;

}

# Get a column from v$asm_disk for a given group number and disk number

sub asm_disk {

my $col = shift @_;

my $group_number = shift @_;

my $disk_number = shift @_;

my $sql = $dbh->prepare("select $col from v\$asm_disk where group_number=$group_number and disk_number=$disk_number");

$sql->execute;

my $col_value = $sql->fetchrow_array;

$sql->finish;

return $col_value;

}

# Get the disk and AU numbers from x$kffxp for a given virtual extent number.

# This will return one row for an external redundancy file,

# two rows for a normal redundancy and three rows for a high redundancy.

# Well, it will return an array with disk and AU pairs, not rows.

sub asm_kffxp {

my $file_number = shift @_;

my $group_number = shift @_;

my $xnum = shift @_;

# The @disk_au array to hold the disk number, AU number rows

my @disk_au;

my $sql = $dbh->prepare("select disk_kffxp, au_kffxp from x\$kffxp where number_kffxp=$file_number and group_kffxp=$group_number and xnum_kffxp=$xnum");

$sql->execute;

# Expecting one disk number and one AU number per row

while ( my @row = $sql->fetchrow_array) {

# Add each (element of the) row to @disk_au array

foreach ( @row ) { push @disk_au, $_ }

}

$sql->finish;

return @disk_au;

}

中美友好合作故事——十万名中国弃婴长大了

不仅要看已抓谁，还须一直抓到没

因为家暴终身挂着粪袋的她，在等一个死刑！

【资源分享】【收集不易多多分享】【2024年12月29日】宝山区区面试流程和结构化真题20道其中5题幼儿（独家请老师给参考答案）

《鱿鱼游戏2》今天下午四点开播，网友无心上班了，导演悄悄剧透