我有一个这样的字符串存储((数组定义))的
PHP代码
- $code=' array(
- 0 => "a","a" => $GlobalScopeVar,"b" => array("nested"=>array(1,2,3)),"c" => function() use (&$VAR) { return isset($VAR) ? "defined" : "undefined" ; },); ';
有一个正则表达式来提取这个数组??,我的意思是我想要的东西
- $array=(
- 0 => '"a"','a' => '$GlobalScopeVar','b' => 'array("nested"=>array(1,3))','c' => 'function() use (&$VAR) { return isset($VAR) ? "defined" : "undefined" ; }',);
pD ::我做研究试图找到一个正则表达式,但没有发现.
pD2 :: godover of stackoverflow,让我现在赏金,我会提供400:3
pD3 ::这将在一个内部应用程序中使用,我需要提取一些PHP文件的数组被处理的部分,我试着用这个codepad.org/td6LVVme解释
即使您要求正则表达式,也可以使用纯PHP.
token_get_all
这里是关键功能.对于正则表达式检查
@HamZa’s answer出来.
这里的优点是它比正则表达式更有活力.正则表达式具有静态模式,而使用token_get_all,您可以在每个单个令牌之后决定要做什么.它甚至可以在必要时转义单引号和反斜杠,正则表达式不会做什么.
另外,在正则表达式中,即使有评论,你也有想象它应该做什么的问题;当您查看PHP代码时,什么代码更容易理解.
- $code = ' array(
- 0 => "a","string_literal",12345
- ); ';
- $token = token_get_all("<?PHP ".$code);
- $newcode = "";
- $i = 0;
- while (++$i < count($token)) { // enter into array; then start.
- if (is_array($token[$i]))
- $newcode .= $token[$i][1];
- else
- $newcode .= $token[$i];
- if ($token[$i] == "(") {
- $ending = ")";
- break;
- }
- if ($token[$i] == "[") {
- $ending = "]";
- break;
- }
- }
- // init variables
- $escape = 0;
- $wait_for_non_whitespace = 0;
- $parenthesis_count = 0;
- $entry = "";
- // main loop
- while (++$i < count($token)) {
- // don't match commas in func($a,$b)
- if ($token[$i] == "(" || $token[$i] == "{") // ( -> normal parenthesis; { -> closures
- $parenthesis_count++;
- if ($token[$i] == ")" || $token[$i] == "}")
- $parenthesis_count--;
- // begin new string after T_DOUBLE_ARROW
- if (!$escape && $wait_for_non_whitespace && (!is_array($token[$i]) || $token[$i][0] != T_WHITESPACE)) {
- $escape = 1;
- $wait_for_non_whitespace = 0;
- $entry .= "'";
- }
- // here is a T_DOUBLE_ARROW,there will be a string after this
- if (is_array($token[$i]) && $token[$i][0] == T_DOUBLE_ARROW && !$escape) {
- $wait_for_non_whitespace = 1;
- }
- // entry ended: comma reached
- if (!$parenthesis_count && $token[$i] == "," || ($parenthesis_count == -1 && $token[$i] == ")" && $ending == ")") || ($ending == "]" && $token[$i] == "]")) {
- // go back to the first non-whitespace
- $whitespaces = "";
- if ($parenthesis_count == -1 || ($ending == "]" && $token[$i] == "]")) {
- $cut_at = strlen($entry);
- while ($cut_at && ord($entry[--$cut_at]) <= 0x20); // 0x20 == " "
- $whitespaces = substr($entry,$cut_at + 1,strlen($entry));
- $entry = substr($entry,$cut_at + 1);
- }
- // $escape == true means: there was somewhere a T_DOUBLE_ARROW
- if ($escape) {
- $escape = 0;
- $newcode .= $entry."'";
- } else {
- $newcode .= "'".addcslashes($entry,"'\\")."'";
- }
- $newcode .= $whitespaces.($parenthesis_count?")":(($ending == "]" && $token[$i] == "]")?"]":","));
- // reset
- $entry = "";
- } else {
- // add actual token to $entry
- if (is_array($token[$i])) {
- $addChar = $token[$i][1];
- } else {
- $addChar = $token[$i];
- }
- if ($entry == "" && $token[$i][0] == T_WHITESPACE) {
- $newcode .= $addChar;
- } else {
- $entry .= $escape?str_replace(array("'","\\"),array("\\'","\\\\"),$addChar):$addChar;
- }
- }
- }
- //append remaining chars like whitespaces or ;
- $newcode .= $entry;
- print $newcode;
应输出:
- array(
- 0 => '"a"',"a" => '$GlobalScopeVar',"b" => 'array("nested"=>array(1,"c" => 'function() use (&$VAR) { return isset($VAR) ? "defined" : "undefined" ; }','"string_literal"','12345'
- )
你可以得到数组的数据,print_r(eval(“return $newcode;”));获取数组的条目:
- Array
- (
- [0] => "a"
- [a] => $GlobalScopeVar
- [b] => array("nested"=>array(1,3))
- [c] => function() use (&$VAR) { return isset($VAR) ? "defined" : "undefined" ; }
- [1] => "string_literal"
- [2] => 12345
- )