sql - 解析地址:模仿这 8 行 Perl 的 T-SQL 代码?
问题描述
我只是从另一个程序中得到一个未解析的地址,我需要将它作为它的组件存储在接收系统中。我需要一些帮助!我给你洗猫。任何事物。
好消息是我可以依靠这些换行符。我可以指望城市后的逗号+空格,我可以指望州或省的两位数缩写,后跟空格。所以(没有打高尔夫球)我很快用 Perl 写了它以提供一些工作代码。
关键是如果我们在 \n 上拆分输入,我只想要第二行/元素(地址 1)、最后一行/元素(国家)和倒数第二个元素(城市、ST zip)。然后我需要将该元素拆分为其组件。我下面的 Perl 代码可以工作,但是如何在 T-SQL 中重新创建它?
$_ = "Company\n".
"Address 1\n".
"Address 2 (opt)\n".
"Address 3 (opt)\n".
"City, ST zip\n".
"Country";
# also works for "City, PV zip zip\n"
@add = split('\n');
$address = $add[1]; # who cares about addy and addy3
$country = pop(@add);
$ctz = pop(@add);
if ($ctz =~ /(.*), (..) (.*)/) {
# Yes a $ctz line like "City of Angels, II, MO 65423" would break it
$city = $1;
$state = $2;
$zip = $3;
} else {
$city = $state = $zip = '';
}
print "Address: $address\n".
"City: $city\n".
"State Code: $state\n".
"Zip: $zip\n".
"Country: $country\n";
解决方案
从大量借鉴Jeff Moden的字符串拆分器开始,但它可以处理多字符分隔符。它按顺序返回分隔的项目并带有索引列:
CREATE FUNCTION [dbo].[DelimitedSplit8K]
--===== Define I/O parameters
(@pString VARCHAR(8000), @pDelimiter VARCHAR(16))
--WARNING!!! DO NOT USE MAX DATA-TYPES HERE! IT WILL KILL PERFORMANCE!
RETURNS TABLE WITH SCHEMABINDING AS
RETURN
--===== "Inline" CTE Driven "Tally Table" produces values from 1 up to 10,000...
-- enough to cover VARCHAR(8000)
WITH E1(N) AS (
SELECT 1 UNION ALL SELECT 1 UNION ALL SELECT 1 UNION ALL
SELECT 1 UNION ALL SELECT 1 UNION ALL SELECT 1 UNION ALL
SELECT 1 UNION ALL SELECT 1 UNION ALL SELECT 1 UNION ALL SELECT 1
), --10E+1 or 10 rows
E2(N) AS (SELECT 1 FROM E1 a, E1 b), --10E+2 or 100 rows
E4(N) AS (SELECT 1 FROM E2 a, E2 b), --10E+4 or 10,000 rows max
cteTally(N) AS (--==== This provides the "base" CTE and limits the number of rows right up front
-- for both a performance gain and prevention of accidental "overruns"
SELECT TOP (ISNULL(DATALENGTH(@pString),0)) ROW_NUMBER() OVER (ORDER BY (SELECT NULL)) FROM E4
),
cteStart(N1) AS (--==== This returns N+1 (starting position of each "element" just once for each delimiter)
SELECT 1 UNION ALL
SELECT t.N+ Len( @pDelimiter ) FROM cteTally t WHERE SUBSTRING(@pString,t.N, Len( @pDelimiter ) ) = @pDelimiter
),
cteLen(N1,L1) AS(--==== Return start and length (for use in substring)
SELECT s.N1,
ISNULL(NULLIF(CHARINDEX(@pDelimiter,@pString,s.N1),0)-s.N1 ,8000)
FROM cteStart s
)
--===== Do the actual split. The ISNULL/NULLIF combo handles the length for the final element when no delimiter is found.
SELECT ItemNumber = ROW_NUMBER() OVER(ORDER BY l.N1),
Item = SUBSTRING(@pString, l.N1, l.L1)
FROM cteLen l;
然后在您的数据上放松:
declare @Newline as Char(2) = Char(13) + Char(10); -- This may need work to match your newlines.
declare @Sample as VarChar(1024) =
'Company' + @Newline +
'Address 1' + @Newline +
'Address 2 (opt)' + @Newline +
'Address 3 (opt)' + @Newline +
'City, ST zip' + @Newline +
'Country';
select *
from dbo.DelimitedSplit8K( @Sample, @Newline );
剩下的练习是弄清楚你想如何处理可选项目。
推荐阅读
- bitbucket - 如何修复 Bitbucket 警告“您似乎克隆了一个空存储库”?
- apache - 当有人访问 example.com/example 时,如何让我的网站加载 example.html
- python - 如何将自己模型的权重转移到同一网络但最后一层的分类数量不同?
- mysql - 如何使用 Java 中的休眠在 MySQL 数据库中创建联接表?
- html - 为什么条件渲染不适用于 vuejs 中的表单输入
- c++ - 如何创建 32 位红色纹理字节缓冲区
- soundcloud - 从 SoundCloud 检索所有用户的播放列表,限制为 50?
- sql - 具有 JOIN 和多个订单列的每组最大 N
- javascript - 如何添加 localStorage 来显示和隐藏 div
- excel - 根据列值随机打乱行