(PHP 4 >= 4.2.0, PHP 5)
mb_regex_encoding — Set/Get character encoding for multibyte regex
Set/Get character encoding for a multibyte regex.
encoding
encoding
参数为字符编码。如果省略,则使用内部字符编码。
If encoding
is set, then
成功时返回 TRUE
, 或者在失败时返回 FALSE
。
In this case, the internal character encoding is NOT changed.
If encoding
is omitted, then
the current character encoding name for a multibyte regex is returned.
php dot net at phor dot net (2010-08-31 13:40:25)
Beware, mb_regex_encoding does not support the same set of encodings as listed in mb_list_encodings.php
Example:
<?php
mb_internal_encoding('CP936');
mb_regex_encoding('CP936'); # this line produces an error
?>
Anonymous (2009-06-28 10:07:05)
To change algo the regex_encodign
<?php
echo "current mb_internal_encoding: ".mb_internal_encoding()."<br />";
echo "changing mb_internal_encoding to UTF-8<br />";
mb_internal_encoding("UTF-8");
echo "new mb_internal_encoding: ".mb_internal_encoding()."<br />";
echo "current mb_regex_encoding: ".mb_regex_encoding()."<br />";
echo "changing mb_regex_encoding to UTF-8<br />";
mb_regex_encoding('UTF-8');
echo "new mb_regex_encoding: ".mb_regex_encoding()."<br />";
?>
zl at zl dot hu (2009-05-29 06:05:51)
Return values vary in setting and getting:
<?php
echo mb_regex_encoding();
// returns encoding name as a string
?>
<?php
echo mb_regex_encoding("UTF-8");
// returns true (success) of false as a boolean
?>
/me (2008-02-02 06:30:13)
Note, that the regex encoding is only set to the initial internal character encoding. If you change the internal encoding in your script with mb_internal_encoding() the regex encoding does not change. For example:
<?php
echo "current mb_internal_encoding: ".mb_internal_encoding()."<br />";
echo "current mb_regex_encoding: ".mb_regex_encoding()."<br />";
echo "changing mb_internal_encoding to UTF-8<br />";
mb_internal_encoding("UTF-8");
echo "new mb_internal_encoding: ".mb_internal_encoding()."<br />";
echo "new mb_regex_encoding: ".mb_regex_encoding()."<br />";
?>
This code might output (depending on your initial internal character encoding):
current mb_internal_encoding: ISO-8859-1
current mb_regex_encoding: ISO-8859-1
changing mb_internal_encoding to UTF-8
new mb_internal_encoding: UTF-8
new mb_regex_encoding: ISO-8859-1
Furthermore the sentence "The default value is the internal character encoding." in the documentation might be misleading because the initial default value of the regex encoding is meant, and not the default value for the optional encoding parameter.