查看原文
其他

ASM 翻译系列第三十二弹:自制数据抽取小工具

2017-02-23 邱大龙 沃趣科技

原作者:Bane Radulovic

译者:    邱大龙

审核:    魏兴华

DBGeeK社区联合出品

Find block in ASM

在本系列文章【 Where is my data】中,我已经演示了如何从ASM磁盘中定位和抽取一个Oracle的block,为了让这件事做起来不那么复杂,我又写了一个perl脚本find_block.pl来简化整个操作,只需要提供数据文件的名称和需要提取的block,这个脚本就可以输出从ASM磁盘组中抽取块的命令。

find_block.pl

find_block.pl是一个perl脚本,脚本里集成了dd或kfed命令来从ASM磁盘中抽取一个块,脚本可以在Linux和Unix的ASM版本下工作,且不管是单实例还是RAC环境。(不能是FLEX ASM)

脚本需要以Grid软件owner的身份来运行,而且要确保perl的二进制文件来自于Oracle安装软件的home目录下。在集群环境下,这个脚本可以运行在任意节点上,在运行脚本前,请检查ASM的环境变量,确定ORACLE_SID, ORACLE_HOME, LD_LIBRARY_PATH设定正确,而且对于10G和11GR1版本,需要设置PERL5LIB环境变量:

export PERL5LIB=$ORACLE_HOME/perl/lib/5.8.3:$ORACLE_HOME/perl/lib/site_perl

可以以如下的方式运行脚本:

$ORACLE_HOME/perl/bin/perl find_block.pl filename block

其中: filename是要抽取的块所在的文件名,对于数据文件来说,这个文件名可以从V$DATAFILE的NAME字段获取到,block代表要从ASM抽取的块号,这个块号是数据库的块号,而不是ASM的块号。

这个脚本的输出看起来像下面这样:

dd if=[ASM disk path] ... of=block_N.dd

在Exadata中是这样:

kfed read dev=[ASM disk path] ... > block_N.txt

对于数据文件来说,如果文件的冗余度是external外部冗余模式,这个脚本将产生一条单一的命令,对于是normal冗余,这个脚本将产生2个命令,对于high冗余,将产生3条命令。

Example with ASM version 10.2.0.1

第一个例子是单实例10.2.0.1的ASM版本,首先我在数据库中创建了一张表,插入一些数据。

[oracle@cat10g ~]$ sqlplus / as sysdba

SQL*Plus: Release 10.2.0.1.0 - Production on [date]

SQL> create table TAB1 (name varchar2(16)) tablespace USERS;

Table created.

SQL> insert into TAB1 values ('CAT');

1 row created.

SQL> insert into TAB1 values ('DOG');

1 row created.

SQL> commit;

Commit complete.

SQL> select ROWID, NAME from TAB1;

ROWID              NAME

------------------ --------------------------------

AAANE+AAEAAAAGHAAA CAT

AAANE+AAEAAAAGHAAB DOG

SQL> select DBMS_ROWID.ROWID_BLOCK_NUMBER('AAANE+AAEAAAAGHAAA') "Block" from dual;

    Block

---------

      391

SQL> select t.name "Tablespace", f.name "Datafile"

from v$tablespace t, v$datafile f

where t.ts#=f.ts# and t.name='USERS';

Tablespace   Datafile

------------ --------------------------------------

USERS        +DATA/cat/datafile/users.259.783204313

SQL>

以上我们造取了两条数据,并且定位到了数据所在的文件和BLOCK号,切换到ASM环境,注意设置正确的环境变量PERL5LIB,然后运行脚本:

$ export PERL5LIB=$ORACLE_HOME/perl/lib/5.8.3:$ORACLE_HOME/perl/lib/site_perl

$ $ORACLE_HOME/perl/bin/perl find_block.pl +DATA/cat/datafile/users.259.783204313 391

dd if=/dev/oracleasm/disks/ASMDISK01 bs=8192 count=1 skip=100359 of=block_391.dd

$

find_block.pl脚本如预期产生了输出,由于这是一个外部冗余的磁盘组,这个脚本只产生了一行dd命令的输出,我们把输出的dd命令复制后执行:

$ dd if=/dev/oracleasm/disks/ASMDISK01 bs=8192 count=1 skip=100359 of=block_391.dd

$

执行后会将块的内容输出到文本文件中block_3237.dd中,然后使用操作系统的od工具,可以看到插入表中的数据:

$ od -c block_391.dd | tail -3

0017740 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 , 001

0017760 001 003 D O G , 001 001 003 C A T 001 006 u   G

0020000

$

非常好,正式我们插入的数据!

Example with ASM version 12.1.0.1 in Exadata

ASM空间的占用取决于2个因素:文件的实际大小和磁盘组的冗余度。

在external冗余的磁盘组中,空间的占用:文件实际大小+1个AU(文件头)+1个额外的AU(如果文件大于60个AU)。

在一个normal冗余的磁盘组中,空间的占用:两倍的文件实际大小+2个AU(文件头)+3个额外的AU(如果文件大于60个AU)

在一个high冗余的磁盘组中,空间的占用:三倍的文件实际大小+3个AU(文件头)+3个额外的AU(如果文件大于60个AU)


在Exadata中我们不能使用dd命令抽取数据块,因为ASM的磁盘对于数据库的server来说是不可见的,为了获得数据块,我们可以使用kfed工具,因此find_block.pl脚本做了这种自适应,如果是Exadata的环境,会使用kfed工具来从ASM磁盘中抽取块。

我们来看一个ASM 12.1.0.1 版本下的一个例子,是一个Exadata环境下双节点的RAC,数据文件是PDB中的一个数据文件。

和上面的例子一样,我首先创建一张表然后插入一些数据:

$ sqlplus / as sysdba

SQL*Plus: Release 12.1.0.1.0 Production on [date]

SQL> alter pluggable database BR_PDB open;

Pluggable database altered.

SQL> show pdbs

CON_ID CON_NAME OPEN MODE   RESTRICTED

------ -------- ----------- ----------

       2 PDB$SEED READ ONLY   NO

...

       5 BR_PDB   READ WRITE  NO

SQL>

$ sqlplus bane/welcome1@BR_PDB

SQL*Plus: Release 12.1.0.1.0 Production on [date]

SQL> create table TAB1 (n number, name varchar2(16)) tablespace USERS;

Table created.

SQL> insert into TAB1 values (1, 'CAT')

1 row created.

SQL> insert into TAB1 values (2, 'DOG');

1 row created.

SQL> commit;

Commit complete.

SQL> select t.name "Tablespace", f.name "Datafile"

from v$tablespace t, v$datafile f

where t.ts#=f.ts# and t.name='USERS';

Tablespace Datafile

---------- ---------------------------------------------

USERS      +DATA/CDB/054.../DATAFILE/users.588.860861901

SQL> select ROWID, NAME from TAB1;

ROWID              NAME

------------------ ----

AAAWYEABfAAAACDAAA CAT

AAAWYEABfAAAACDAAB DOG

SQL> select DBMS_ROWID.ROWID_BLOCK_NUMBER('AAAWYEABfAAAACDAAA') "Block number" from dual;

Block number

------------

       131

SQL>

同样获得插入数据的文件号和块号,切换到ASM的环境,然后运行perl脚本:

$ $ORACLE_HOME/perl/bin/perl find_block.pl +DATA/CDB/0548068A10AB14DEE053E273BB0A46D1/DATAFILE/users.588.860861901 131

kfed read dev=o/192.168.1.9/DATA_CD_03_exacelmel05 ausz=4194304 aunum=16212 blksz=8192 blknum=131 | grep -iv ^kf > block_131.txt

kfed read dev=o/192.168.1.11/DATA_CD_09_exacelmel07 ausz=4194304 aunum=16267 blksz=8192 blknum=131 | grep -iv ^kf > block_131.txt

我们观察到,find_block.pl脚本这次产生了2个命令,因此我们可以知道这是一个normal冗余的磁盘组,我们运行其中一个命令:

$ kfed read dev=o/192.168.1.9/DATA_CD_03_exacelmel05 ausz=4194304 aunum=16212 blksz=8192 blknum=131 | grep -iv ^kf > block_131.txt

$

我们将块的内容输出到了文本文件block_131.txt中,然后看到了我上面插入的数据DOG和CAT:

$ more block_131.txt

...

FD5106080 00000000 00000000 ...  [................]

      Repeat 501 times

FD5107FE0 00000000 00000000 ...  [........,......D]

FD5107FF0 012C474F 02C10202 ...  [OG,......CAT..,-]

$

Find any block

find_block.pl用来从ASM磁盘组中的任何一个文件中抽取块,不仅仅是数据文件,为了一乐,我对控制文件和控制文件上一个随机的块运行这个脚本:

$ $ORACLE_HOME/perl/bin/perl find_block.pl +DATA/CDB/CONTROLFILE/current.289.843047837 5

kfed read dev=o/192.168.1.9/DATA_CD_10_exacelmel05 ausz=4194304 aunum=73 blksz=16384 blknum=5 | grep -iv ^kf > block_5.txt

kfed read dev=o/192.168.1.11/DATA_CD_01_exacelmel07 ausz=4194304 aunum=66 blksz=16384 blknum=5 | grep -iv ^kf > block_5.txt

kfed read dev=o/192.168.1.10/DATA_CD_04_exacelmel06 ausz=4194304 aunum=78 blksz=16384 blknum=5 | grep -iv ^kf > block_5.txt

$

我们注意到脚本正确的计算出了控制文件的block size(不同于数据块的大小8K,为16K),并且脚本产生出了3个不同的命令,虽然磁盘组DATA是normal冗余,但是控制文件却做了high冗余,也就是做了三副本,控制文件在这一点上跟ASM的元数据文件一样。

Conclusion

find_block.pl脚本通过dd或者kfed命令来从ASM磁盘组的文件中抽取块,可能大多数情况下,我们想要从数据文件中抽取一个块,但是这个脚本不仅仅适用于数据文件,也可以从控制文件、日志文件、任何的ASM文件中抽取块。

如果文件是external外部冗余的,那么这个脚本将输出一个单一的命令,执行这个命令可以直接从ASM的磁盘中抽取块。

如果文件是normal冗余的,这个脚本将输出2个命令,它用来从不同的磁盘中抽取块,这可能会比较有用,例如后台日志提示数据块损坏,ASM不能修复它,那么就可以通过镜像块来修复。

如果文件是high冗余的,这个脚本将产生3个命令。

最后,使用这个脚本你不用知道文件的冗余度、块的大小,和任何其他属性,你只需要关心文件名和块号。

附脚本

#!$ORACLE_HOME/perl/bin/perl -w

#

# The find_block.pl constructs the command(s) to extract a block from ASM.

# For a complete info about this script see ASM Support Guy blog post:

# http://asmsupportguy.blogspot.com/2014/10/find-block-in-asm.html

#

# Copyright (C) 2014 Bane Radulovic

#

# This program is free software: you can redistribute it and/or modify it under

# the terms of the GNU General Public License as published by the Free Software

# Foundation, either version 3 of the License, or any later version.

# This program is distributed in the hope that it will be useful, but WITHOUT

# ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS

# FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details

# at http://www.gnu.org/licenses/.

#

# Version 1.00, Oct 2014

# The initial release.

#

# Version 1.01, Oct 2014

# Minor improvements.

#

# Version 1.02, Oct 2014

# Added support for AFD disks.

#

# Version 1.03, Nov 2014

# Added sanity checks, e.g. if the requested block is reasonable,

# if the specified filename is valid, etc.

#

# Version 1.04, Nov 2014

# Improved the check for Exadata storage cell based disk.

#

use strict;

use DBI;

use DBD::Oracle qw(:ora_session_modes);

use POSIX;

# Handle the version query

die "find_block.pl version 1.04\n"

 if ( $ARGV[0] =~ /^-v/i );

# Check the number of input arguments

die "Usage: \$ORACLE_HOME/perl/bin/perl find_block.pl filename block\n"

 unless ( @ARGV == 2 );

# Get the filename from the first input argument

my $filename = shift @ARGV;

# Check if the filename makes sense.

# The 'minimum' filename is +DGNAME/filename,

# i.e. it has to begin with the '+' followed by a disk group name,

# followed by at least one '/', followed by directory or file name...

die "Error: The $filename is not a valid file name.\n"

 unless ( $filename =~ /^\+\w/ && $filename =~ /\/\w/ );

# Get the disk group name out of the user specified filename

my $diskgroup_name = substr($filename, 1, index($filename, "/") -1 );

# Get the ASM file name out of the user specified filename

my $asmfile = substr($filename, rindex($filename, "/") +1 );

# Get the block number from the second input argument

my $block_number = shift @ARGV;

# Check if the block number is an integer

die "Usage: \$ORACLE_HOME/perl/bin/perl find_block.pl filename block\n"

 unless ( $block_number =~ /^\d+$/ );

# Check if the ASM SID is set

die "Error: ASM SID not set.\n"

 unless ( $ENV{ORACLE_SID} =~ /\+ASM/ );

# Connect to the (local) ASM instance

my $dbh = DBI->connect('dbi:Oracle:', "", "", { ora_session_mode => ORA_SYSDBA })

 or die "$DBI::errstr\n";

# Check if the disk group exists and if it is mounted

my $group_number = &asm_diskgroup("group_number", $diskgroup_name);

die "Error: Disk group $diskgroup_name not mounted or does not exist.\n"

 unless ( $group_number );

# Check if the user specified file exists in the disk group

my $file_number = &asm_alias("file_number", $asmfile, $group_number);

die "Error: File $asmfile does not exist in disk group $diskgroup_name.\n"

 unless ( $file_number );

# Get the block size for the file

my $block_size = &asm_file("block_size", $group_number, $file_number);

# Get the number of blocks in the file

my $file_blocks = &asm_file("blocks", $group_number, $file_number);

# Check if the user specified block number makes sense

die "Error: Block range for file $asmfile is: 0 - $file_blocks.\n"

 unless ( $block_number >= 0 && $block_number <= $file_blocks );

# Get the disk group AU size

my $au_size = &asm_diskgroup("allocation_unit_size", $diskgroup_name);

# Work out the blocks per AU and the virtual extent number

my $blocks_per_au = $au_size/$block_size;

my $xnum_kffxp = floor($block_number/$blocks_per_au);

# Get the disk and AU numbers into the @disk_au array

my @disk_au = &asm_kffxp($file_number, $group_number, $xnum_kffxp);

die "Could not get any disk and AU numbers for file $asmfile.\n"

 unless ( @disk_au );

# Get the disk path(s) and generate the block extract command(s)

while ( @disk_au ) {

 # Do not assume anything

 my $storage_cell = "FALSE";

 # Get the disk number from @disk_au

 my $disk_number = shift @disk_au;

 # Get the AU number from @disk_au

 my $au_number = shift @disk_au;

 # Get the path for that disk number

 my $path = &asm_disk("path", $group_number, $disk_number);

 # If there is no path move to the next disk

 if ( ! $path ) {

  next;

  }

 # If ASMLIB is in use, the path will return ORCL:DISKNAME.

 # Set the path to /dev/oracleasm/disks/DISKNAME

 elsif ( $path =~ /ORCL:(.*)/ ) {

  $path = "/dev/oracleasm/disks/".$1;

  }

 # If ASM Filter Driver (AFD) is in use, the path will return AFD:DISKNAME.

 # Get the actual path from /dev/oracleafd/disks/DISKNAME

 elsif ( $path =~ /AFD:(.*)/ ) {

  if ( ! open AFDDISK, "/dev/oracleafd/disks/".$1 ) { next }

  else { chomp($path = <AFDDISK>) }

  }

 # For Exadata storage cell based disk, the path will start with o/IP address

 elsif ( $path =~ /^o\/\d{1,3}\./ ) {

  $storage_cell = "TRUE";

  }

 if ( $storage_cell eq "TRUE" ) {

  # Construct the kfed command for Exadata storage cell based disk

  # dev=$path ausz=$au_size aunum=$au_number blksz=$block_size blknum=$block_number

  # The grep filters out the kfed stuff

  print "kfed read dev=$path ausz=$au_size aunum=$au_number blksz=$block_size blknum=$block_number | grep -iv ^kf > block_$block_number.txt\n";

  }

 else {

  # Construct the dd command

  # if=$path bs=$block_size count=1 skip=$skip of=block_$block_number.dd

  my $skip=$au_number*$blocks_per_au + $block_number%$blocks_per_au;

  print "dd if=$path bs=$block_size count=1 skip=$skip of=block_$block_number.dd\n";

  }

 }

# We are done. Disconnect from the (local) ASM instance

$dbh->disconnect;

# Subs

# Get a column from v$asm_file for a given group number and file number

sub asm_file {

 my $col = shift @_;

 my $group_number = shift @_;

 my $file_number = shift @_;

 my $sql = $dbh->prepare("select $col from v\$asm_file where group_number=$group_number and file_number=$file_number");

 $sql->execute;

 my $col_value = $sql->fetchrow_array;

 $sql->finish;

 return $col_value;

 }

# Get a column from v$asm_alias for a given (file) name and group number

sub asm_alias {

 my $col = shift @_;

 my $name = shift @_;

 my $group_number = shift @_;

 my $sql = $dbh->prepare("select $col from v\$asm_alias where lower(name)=lower('$name') and group_number=$group_number");

 $sql->execute;

 my $col_value = $sql->fetchrow_array;

 $sql->finish;

 return $col_value;

 }

# Get a column from v$asm_diskgroup for a given disk group name

sub asm_diskgroup {

 my $col = shift @_;

 my $name = shift @_;

 my $sql = $dbh->prepare("select $col from v\$asm_diskgroup where name=upper('$name')");

 $sql->execute;

 my $col_value = $sql->fetchrow_array;

 $sql->finish;

 return $col_value;

 }

# Get a column from v$asm_disk for a given group number and disk number

sub asm_disk {

 my $col = shift @_;

 my $group_number = shift @_;

 my $disk_number = shift @_;

 my $sql = $dbh->prepare("select $col from v\$asm_disk where group_number=$group_number and disk_number=$disk_number");

 $sql->execute;

 my $col_value = $sql->fetchrow_array;

 $sql->finish;

 return $col_value;

 }

# Get the disk and AU numbers from x$kffxp for a given virtual extent number.

# This will return one row for an external redundancy file,

# two rows for a normal redundancy and three rows for a high redundancy.

# Well, it will return an array with disk and AU pairs, not rows.

sub asm_kffxp {

 my $file_number = shift @_;

 my $group_number = shift @_;

 my $xnum = shift @_;

 # The @disk_au array to hold the disk number, AU number rows

 my @disk_au;

 my $sql = $dbh->prepare("select disk_kffxp, au_kffxp from x\$kffxp where number_kffxp=$file_number and group_kffxp=$group_number and xnum_kffxp=$xnum");

 $sql->execute;

 # Expecting one disk number and one AU number per row

 while ( my @row = $sql->fetchrow_array) {

  # Add each (element of the) row to @disk_au array

  foreach ( @row ) { push @disk_au, $_ }

  }

 $sql->finish;

 return @disk_au;

 }

相关阅读

ASM 翻译系列第三十一弹:了解ASM文件的空间分配

ASM 翻译系列第三十弹:高级知识 Physical metadata replication

ASM 翻译系列第二十九弹:ASM INTERNAL Free Space Table

     

关于沃趣

杭州沃趣科技股份有限公司创建于2012年(股票代码:839849),创始团队为原阿里巴巴数据库及系统运维团队核心骨干,凭借着多年的运维经验,为行业客户提供专业数据库、系统相关的服务和产品;专注为用户提供基于高性能、高可用、可扩展的开放数据库云平台解决方案的国产厂商。

沃趣科技的产品已广泛应用于证券、保险、医疗、广电传媒、银行、电信、能源电力、快递物流、公共事业、大型企业等,为这些行业用户持续提供行业解决方案及服务支持。

研发中心位于杭州,同时在北京、上海、广州、南京、兰州建立了分支机构,拥有辐射全国的销售和服务体系。

我们始终坚信,数据是驱动企业创新的源动力!坚持围绕企业数据库做好一件事

                        ——让高性能触手可及!

您可能也对以下帖子感兴趣

文章有问题?点此查看未经处理的缓存