首页 > 解决方案 > 如何解析 Python 的三引号 f 字符串?

问题描述

我有这段代码可以解析和处理普通的“f-string”模板字符串(请参阅下面的用法部分以获取示例):

from string import Formatter
import sys


_conversions = {'a': ascii, 'r': repr, 's': str}

def z(template, locals_=None):
    if locals_ is None:
        previous_frame = sys._getframe(1)
        previous_frame_locals = previous_frame.f_locals
        locals_ = previous_frame_locals
        # locals_ = globals()
    result = []
    parts = Formatter().parse(template)
    for part in parts:
        literal_text, field_name, format_spec, conversion = part
        if literal_text:
            result.append(literal_text)
        if not field_name:
            continue
        value = eval(field_name, locals_) #.__format__()
        if conversion:
            value = _conversions[conversion](value)
        if format_spec:
            value = format(value, format_spec)
        else:
            value = str(value)
        result.append(value)
    res = ''.join(result)
    return res

用法:

a = 'World'
b = 10
z('Hello {a} --- {a:^30} --- {67+b} --- {a!r}')
# "Hello World ---             World              --- 77 --- 'World'"

但如果模板字符串是这样的,它就不起作用:

z('''
echo monkey {z("curl -s https://www.poemist.com/api/v1/randompoems | jq --raw-output '.[0].content'")} end | sed -e 's/monkey/start/'
echo --------------
''')

它给出了这个错误:

  File "<string>", line 1
    z("curl -s https
                   ^
SyntaxError: EOL while scanning string literal

如果无法正常运行,我什至愿意从 Python 的源代码中复制代码以使其正常工作。

标签: pythonstringparsingquotingf-string

解决方案


感谢@ForceBru 的提示,我完成了这个。以下代码解析和处理源 tripe-quote f-strings:(忽略处理部分)

_conversions = {'a': ascii, 'r': repr, 's': str}

def zstring(self, template, locals_=None, getframe=1):
    if locals_ is None:
        previous_frame = sys._getframe(getframe)
        previous_frame_locals = previous_frame.f_locals
        locals_ = previous_frame_locals

    def asteval(astNode):
        if astNode is not None:
            return eval(compile(ast.Expression(astNode), filename='<string>', mode='eval'), locals_)
        else:
            return None

    def eatFormat(format_spec, code):
        res = False
        if format_spec:
            flags = format_spec.split(':')
            res = code in flags
            format_spec = list(filter(lambda a: a != code,flags))
        return ':'.join(format_spec), res


    p = ast.parse(f"f'''{template}'''")
    result = []
    parts = p.body[0].value.values
    for part in parts:
        typ = type(part)
        if typ is ast.Str:
            result.append(part.s)
        elif typ is ast.FormattedValue:
            # print(part.__dict__)

            value = asteval(part.value)
            conversion = part.conversion
            if conversion >= 0:
                # parser doesn't support custom conversions
                conversion = chr(conversion)
                value = self._conversions[conversion](value)

            format_spec = asteval(part.format_spec) or ''
            # print(f"orig format: {format_spec}")
            format_spec, fmt_eval = eatFormat(format_spec, 'e')
            format_spec, fmt_bool = eatFormat(format_spec, 'bool')
            # print(f"format: {format_spec}")
            if format_spec:
                value = format(value, format_spec)
            if fmt_bool:
                value = boolsh(value)

            value = str(value)
            if not fmt_eval:
                value = self.zsh_quote(value)
            result.append(value)
    cmd = ''.join(result)
    return cmd

推荐阅读