php - 为什么 PHP 会忽略任何字符串开头的换行符
问题描述
我期待来自我的 Laravel 微服务的请求有一个用户输入文本。
我的情况是用户在他们写的段落的开头输入了多个换行符。
代码应根据换行符“拆分”该文本并自行处理每个段落。
例如:我在请求中有这个字符串:
JSON:
{
"text": "\n\n\n\n\n\nHere you can find activities to practise your reading skills. Reading will help you to improve your understanding of the language and build your vocabulary.\n\nThe self-study lessons in this section are written and organised according to the levels of the Common European Framework of Reference for languages (CEFR).\n\nThere are different types of texts and interactive exercises that practise the reading skills you need to do well in your studies, to get ahead at work and to communicate in English in your free time."
}
我期待有这个数组:
Array
(
[0] => Array
(
[0] =>
)
[1] => Array
(
[0] =>
)
[2] => Array
(
[0] =>
)
[3] => Array
(
[0] =>
)
[4] => Array
(
[0] =>
)
[5] => Array
(
[0] =>
)
[6] => Array
(
[0] => Here you can find activities to practise your reading skills. Reading will help you to improve your understanding of the language and build your vocabulary.
)
[7] => Array
(
[0] =>
)
[8] => Array
(
[0] => The self-study lessons in this section are written and organised according to the levels of the Common European Framework of Reference for languages (CEFR).
)
[9] => Array
(
[0] =>
)
[10] => Array
(
[0] => There are different types of texts and interactive exercises that practise the reading skills you need to do well in your studies, to get ahead at work and to communicate in English in your free time.
)
)
但不幸的是,我有这个数组:
Array
(
[0] => Array
(
[0] => Here you can find activities to practise your reading skills. Reading will help you to improve your understanding of the language and build your vocabulary.
)
[1] => Array
(
[0] =>
)
[2] => Array
(
[0] => The self-study lessons in this section are written and organised according to the levels of the Common European Framework of Reference for languages (CEFR).
)
[3] => Array
(
[0] =>
)
[4] => Array
(
[0] => There are different types of texts and interactive exercises that practise the reading skills you need to do well in your studies, to get ahead at work and to communicate in English in your free time.
)
)
为了测试上述理论,我运行了这几行 PHP:
$stop = false;
$offset = 0;
while( !$stop ) {
$stop = (mb_substr($requestText, $offset, 1)!="\n");
$offset++;
}
print_r($offset);exit();
结果表明偏移变量为“1”;这意味着循环只运行一次并且没有在字符串的开头找到换行符。
问题是:如何根据换行符(包括字符串开头的换行符)(检测和计数)或(分解字符串)?
注意:我正在使用“mb_”系列函数(mb_substr、mb_strlen、...等),因为我期望在从右到左的语言中使用 UTF-8 编码的字符串。
** 添加 #1 ** 这是我的控制器:
class MyController extends BaseController
{
public function index(Request $request) {
$input = $request->all();
if(!isset($input) || empty($input)) {
return $this->returnJsonResponse($this->badRequestErrorStatus, "Bad Request: Please check the API documentation for its parameters.");
}
if(!isset($input["text"]) || empty($input["text"])) {
return $this->returnJsonResponse($this->badRequestErrorStatus, "Bad Requess: Please provide the text parameter.");
}
\Log::info("### New Request Measurements [Chunk Base: " .env("AI_MICROSERVICES_SPELLCHECKER_MAX_REQUEST_TEXT_CHARACTERS_LENGTH"). " Char] ###");
//--- Capture The Following Block Process Time
$startMilliseconds = microtime(true)*1000;
$data['text'] = $this->_chunkText($input["text"]);
$endMilliseconds = microtime(true)*1000;
\Log::info(" Chunking Process Time: (( " . ($endMilliseconds - $startMilliseconds) . " )) Milliseconds");
//---
}
/**
* Chunk the passed text according to Business rules.
*
* @param String $requestText
*
* @return array
*/
private function _chunkText($requestText) {
\Log::info("Chunking Process Starts:");
$stop = false;
$offset = 0;
while( !$stop ) {
$stop = (mb_substr($requestText, $offset, 1)!="\n");
$offset++;
}
// print_r($offset);exit();
}
解决方案
非常感谢 user3532758 我已禁用 App\Http\Kernel 类中的“TrimString”中间件,将其注释掉。
推荐阅读
- r - 生成重复并将矩阵转换为df
- java - 如何停止强制子类实现父构造函数
- c# - IIS 上托管的 Asp.Net Core 中间件响应重定向 URL 不完整
- amazon-web-services - Kubectl 命令给出连续的连接错误
- c# - 在(隔离的)Azure 函数中运行用户制作的 C# 代码是否安全?
- powerbi - DAX 有条件地从不同的列求和?
- typescript - 使用 Prisma ORM 将日期(仅)列作为字符串检索或映射到字符串而无需时间
- verilog - 我使用verilog 实现了一个RV32。但是输出总是0,即y=0
- ruby-on-rails-5 - Rails 5 资产预编译在生产中失败
- r - 漱口水非交互式服务帐户令牌身份验证不起作用