深入理解LCEL：可编排Langchain组件的“灵魂”

本文最后更新于465 天前，其中的信息可能已经过时，如有错误请发送邮件到lvlvko233@qq.com

GPT省流

核心观点回顾

组件编排的简化：
- 传统方法：显式地创建并串联多个LLMChain。
- LCEL方法：通过管道符号|简化编排过程，适用于多种Langchain组件（如PromptTemplate、ChatModel、Retriever、OutputParser、Tool等），甚至用户自定义的Python函数。
Runnable的核心作用：
- 定义：Runnable作为可编排组件的核心接口，提供了包括invoke、stream、batch等抽象方法，奠定了组件自由编排的基础。
- 子类：RunnableSerializable继承自Runnable并提供序列化功能，成为Langchain组件的统一基础接口。
多种类型的Runnable：
- RunnableSequence：串联多个Runnable实例，形成顺序执行的链条。
- RunnableParallel：并行执行多个Runnable实例，提高执行效率。
- RunnableLambda：将Python函数封装为Runnable，使函数也能参与到LCEL的编排中。
- RunnableBranch：实现条件分支逻辑，类似RouterChain，用于根据条件选择不同的执行路径。

LCEL的优势

高效开发：通过简洁的语法和统一的接口，提高AI应用开发的效率。
灵活性：支持多种类型的组件和自定义函数的编排，灵活组合不同的组件。
可扩展性：通过定义明确的输入输出接口和抽象方法，为未来扩展和新组件的加入提供良好支持。

结语

LCEL（Langchain Expression Language）通过“一切组件皆可编排”的设计思想，使得AI应用的开发更加便捷和高效。希望通过本文，你能深入了解Langchain的底层原理，后续将有更多关于Langchain组件的介绍文章，敬请关注。

前言

如今Langchain已是大语言模型应用开发的事实标准框架，早在Langchain推出第一个稳定版本v0.1.0时，就已提出了一种新的Langchain范式：Langchain Expression Language简称LCEL，在本篇文章中将会全面地介绍LCEL的设计思想以及具体的实现。

设计思想：一切组件皆可编排

早在去年9月时，我便在鹅厂参与一款AI应用的后台开发，也是用的langchain当时我们有一个功能，需要初始化三个不同的LLMChain，并且将他们串联到一起，我们当时是这么做的

chain1 = LLMChain(
    prompt=prompt1,
    llm=llm,
)
chain2 = LLMChain(
    prompt=prompt2,
    llm=llm,
)
chain3 = LLMChain(
    prompt=prompt3,
    llm=llm,
)

overall_chain = SequentialChain(
    chains=[chain1, chain2, chain3],
)

这样子便可以将三个chain给串联起来，看起来似乎很简单，但是当我们开始用起来LCEL后，发现还能更简单

chain1 = prompt1 | llm
chain2 = prompt2 | llm
chain3 = prompt3 | llm

overall_chain = chain1 | chain2 | chain3

我们直接用类似管道符号"|"将prompt和llm串联在一起，便可以组成一个LLMChain，将chain和chain串联在一起，便可以组成一个SequentialChain!

值得让人兴奋的是，不仅仅是PromptTemplate，ChatModel，基本上你见过的Langchain组件，你都可以通过管道符号进行串联，例如Retriever，OutputParser，Tool等等。甚至可以是你写的python函数。

这里其实LCEL的思想便可见一斑了，对于Langchain的大部分组件，其实都有明确的输入输出，常用组件的输入输出如下：

组件	输入	输出
LLM	List[Message]	AIMessage/ToolMessage
PromptTemplate	Dict[str, Any]	List[Message]
OutputParser	Any	Any
Retriever	str	List[Document]
Tool	Dict[str, Any]	Any

那么仔细一想，如果说我们能给Langchain的这些组件制定统一的规范，并且重写和实现一些魔术方法，是不是就可以实现组件的自由编排了？

事实上确实如此，接下来，我们来一起看看LCEL的底层是怎样给组件定下统一的规范的

Runnable：LCEL的驱动核心

目前AI应用有个很火的概念叫管道化，意思为AI无论是知识库还是提示词、工具等都是可以自由编排和组合的，如果你使用Langchain来构建一个可编排的AI应用，则或多或少都会接触过Runnable，Runnable正是Langchain可编排能力的核心，这里无需多言，直接看源码

Runnable：最底层的抽象

首先直接看源码，在包langchain_core.runnable.base中可以清晰看到Runnable的定义

class Runnable(Generic[Input, Output], ABC):
    """A unit of work that can be invoked, batched, streamed, transformed and composed.

这里可以看到，Runnable继承了一个具有Input和Output变量的泛型，以及抽象基类(ABC)，那么它就会有如下特征：

首先对于泛型，这里有点类似C++的模板的意味，只不过在动态类型的python中，这里主要起变量注解的作用，这里主要标识任何Runnable以及子类都会表明自己的Input和Output
对于ABC，则说明Runnable是最底层的抽象，会定义一系列抽象方法来规范子类

这里就可以看看Runnable提供了哪些抽象接口

@abstractmethod
    def invoke(self, input: Input, config: Optional[RunnableConfig] = None) -> Output:
        """Transform a single input into an output. Override to implement.
        """

def stream(
        self,
        input: Input,
        config: Optional[RunnableConfig] = None,
        **kwargs: Optional[Any],
    ) -> Iterator[Output]:

def batch(
        self,
        inputs: List[Input],
        config: Optional[Union[RunnableConfig, List[RunnableConfig]]] = None,
        *,
        return_exceptions: bool = False,
        **kwargs: Optional[Any],
    ) -> List[Output]:

def assign(
        self,
        **kwargs: Union[
            Runnable[Dict[str, Any], Any],
            Callable[[Dict[str, Any]], Any],
            Mapping[
                str,
                Union[Runnable[Dict[str, Any], Any], Callable[[Dict[str, Any]], Any]],
            ],
        ],
    ) -> RunnableSerializable[Any, Any]:

def pick(self, keys: Union[str, List[str]]) -> RunnableSerializable[Any, Any]:

@beta_decorator.beta(message="This API is in beta and may change in the future.")
    async def astream_events(
        self,
        input: Any,
        config: Optional[RunnableConfig] = None,
        *,
        version: Literal["v1"],
        include_names: Optional[Sequence[str]] = None,
        include_types: Optional[Sequence[str]] = None,
        include_tags: Optional[Sequence[str]] = None,
        exclude_names: Optional[Sequence[str]] = None,
        exclude_types: Optional[Sequence[str]] = None,
        exclude_tags: Optional[Sequence[str]] = None,
        **kwargs: Any,
    ) -> AsyncIterator[StreamEvent]:

def __or__(
        self,
        other: Union[
            Runnable[Any, Other],
            Callable[[Any], Other],
            Callable[[Iterator[Any]], Iterator[Other]],
            Mapping[str, Union[Runnable[Any, Other], Callable[[Any], Other], Any]],
        ],
    ) -> RunnableSerializable[Input, Other]:
        """Compose this runnable with another object to create a RunnableSequence."""
        return RunnableSequence(self, coerce_to_runnable(other))

这些基本上就是Runnable最常用的方法了，其中invoke是抽象方法，意味着只要继承Runnable类你都必须重写这个invoke方法，并且stream, batch等方法都依赖invoke方法的实现，这里还可以看到它实现了__or__方法，通过RunnableSequence实现Runnable的组合，这里返回的变量注解写的RunnableSerializable类在后面会讲到

接下来我们直接来看他有哪些子类，这里如果你仔细看下来，你会发现只有一个叫RunnableSerializable的类继承了它，这里RunnableSerializable继承了Langchain的一个工具基类Serializable，用于方便Langchain组件进行序列化。

RunnableSerializable：组件的统一接口

class RunnableSerializable(Serializable, Runnable[Input, Output]):
    """Runnable that can be serialized to JSON."""

class Serializable(BaseModel, ABC):
    """Serializable base class."""

这里我们可以确定，RunnableSerializable才是Langchain组件统一继承的接口，Runnable相当于只是为组件提供基本抽象的接口

RunnableSequence：串联Runnable的子类

这个RunnableSequence有点SequentialChain的感觉了，这里我放关键的源码出来，挺好看懂的

class RunnableSequence(RunnableSerializable[Input, Output]):
    """Sequence of Runnables, where the output of each is the input of the next."""
    first: Runnable[Input, Any]
    """The first runnable in the sequence."""
    middle: List[Runnable[Any, Any]] = Field(default_factory=list)
    """The middle runnables in the sequence."""
    last: Runnable[Any, Output]
    """The last runnable in the sequence."""

    @property
    def steps(self) -> List[Runnable[Any, Any]]:
        """All the runnables that make up the sequence in order."""
        return [self.first] + self.middle + [self.last]

    def __or__(
        self,
        other: Union[
            Runnable[Any, Other],
            Callable[[Any], Other],
            Callable[[Iterator[Any]], Iterator[Other]],
            Mapping[str, Union[Runnable[Any, Other], Callable[[Any], Other], Any]],
        ],
    ) -> RunnableSerializable[Input, Other]:
        if isinstance(other, RunnableSequence):
            return RunnableSequence(
                self.first,
                *self.middle,
                self.last,
                other.first,
                *other.middle,
                other.last,
                name=self.name or other.name,
            )
        else:
            return RunnableSequence(
                self.first,
                *self.middle,
                self.last,
                coerce_to_runnable(other),
                name=self.name,
            )


    def invoke(self, input: Input, config: Optional[RunnableConfig] = None) -> Output:
        from langchain_core.beta.runnables.context import config_with_context

        # setup callbacks and context
        config = config_with_context(ensure_config(config), self.steps)
        callback_manager = get_callback_manager_for_config(config)
        # start the root run
        run_manager = callback_manager.on_chain_start(
            dumpd(self),
            input,
            name=config.get("run_name") or self.get_name(),
            run_id=config.pop("run_id", None),
        )

        # invoke all steps in sequence
        try:
            for i, step in enumerate(self.steps):
                input = step.invoke(
                    input,
                    # mark each step as a child run
                    patch_config(
                        config, callbacks=run_manager.get_child(f"seq:step:{i+1}")
                    ),
                )
        # finish the root run
        except BaseException as e:
            run_manager.on_chain_error(e)
            raise
        else:
            run_manager.on_chain_end(input)
            return cast(Output, input)

看到这里应该有点感觉了，这里RunnableSequence就是为了串联Runnable而存在的，还记得我们前面的那段代码实例吗？在执行完overall_chain = chain1 | chain2 | chain3后，这里的overall_chain就会是一个RunnableSequence对象

RunnableParallel：并行Runnable的子类

这个RunnableParallel我比较少用，这里还是直接贴关键源码，大致明白它的原理就ok了

class RunnableParallel(RunnableSerializable[Input, Dict[str, Any]]):
    steps__: Mapping[str, Runnable[Input, Any]]

    def invoke(
        self, input: Input, config: Optional[RunnableConfig] = None
    ) -> Dict[str, Any]:
        from langchain_core.callbacks.manager import CallbackManager

        # setup callbacks
        config = ensure_config(config)
        callback_manager = CallbackManager.configure(
            inheritable_callbacks=config.get("callbacks"),
            local_callbacks=None,
            verbose=False,
            inheritable_tags=config.get("tags"),
            local_tags=None,
            inheritable_metadata=config.get("metadata"),
            local_metadata=None,
        )
        # start the root run
        run_manager = callback_manager.on_chain_start(
            dumpd(self),
            input,
            name=config.get("run_name") or self.get_name(),
            run_id=config.pop("run_id", None),
        )

        # gather results from all steps
        try:
            # copy to avoid issues from the caller mutating the steps during invoke()
            steps = dict(self.steps__)
            with get_executor_for_config(config) as executor:
                futures = [
                    executor.submit(
                        step.invoke,
                        input,
                        # mark each step as a child run
                        patch_config(
                            config,
                            callbacks=run_manager.get_child(f"map:key:{key}"),
                        ),
                    )
                    for key, step in steps.items()
                ]
                output = {key: future.result() for key, future in zip(steps, futures)}
        # finish the root run
        except BaseException as e:
            run_manager.on_chain_error(e)
            raise
        else:
            run_manager.on_chain_end(output)
            return output

这里看下来RunnableParallel只是开了个线程池来invoke map中所有的runnable对象，没啥意思。

RunnableLambda: 将函数封装成Runnable

当时LCEL还有一个特性，就是可以用管道"|"直接连接函数和Runnable，实际上底层就将函数转换成了RunnableLambda，这里也放出关键的源码

class RunnableLambda(Runnable[Input, Output]):
    """RunnableLambda converts a python callable into a Runnable."""
    def invoke(
        self,
        input: Input,
        config: Optional[RunnableConfig] = None,
        **kwargs: Optional[Any],
    ) -> Output:
        """Invoke this runnable synchronously."""
        if hasattr(self, "func"):
            return self._call_with_config(
                self._invoke,
                input,
                self._config(config, self.func),
                **kwargs,
            )
        else:
            raise TypeError(
                "Cannot invoke a coroutine function synchronously."
                "Use `ainvoke` instead."
            )

    def _invoke(
        self,
        input: Input,
        run_manager: CallbackManagerForChainRun,
        config: RunnableConfig,
        **kwargs: Any,
    ) -> Output:
        if inspect.isgeneratorfunction(self.func):
            output: Optional[Output] = None
            for chunk in call_func_with_variable_args(
                cast(Callable[[Input], Iterator[Output]], self.func),
                input,
                config,
                run_manager,
                **kwargs,
            ):
                if output is None:
                    output = chunk
                else:
                    try:
                        output = output + chunk  # type: ignore[operator]
                    except TypeError:
                        output = chunk
        else:
            output = call_func_with_variable_args(
                self.func, input, config, run_manager, **kwargs
            )
        # If the output is a runnable, invoke it
        if isinstance(output, Runnable):
            recursion_limit = config["recursion_limit"]
            if recursion_limit <= 0:
                raise RecursionError(
                    f"Recursion limit reached when invoking {self} with input {input}."
                )
            output = output.invoke(
                input,
                patch_config(
                    config,
                    callbacks=run_manager.get_child(),
                    recursion_limit=recursion_limit - 1,
                ),
            )
        return cast(Output, output)

这里我理解RunnableLambda只不过是func的一层wrapper，实际上只是为了把函数转换成Runnable而已

RunnableBranch

RunnableBranch的主要作用类似原先的RouterChain，实现分支选择的逻辑，这里放出最关键的构造函数和invoke函数，看这两个基本能摸清楚它的原理了

class RunnableBranch(RunnableSerializable[Input, Output]):
    """Runnable that selects which branch to run based on a condition.

    The Runnable is initialized with a list of (condition, Runnable) pairs and
    a default branch.

    When operating on an input, the first condition that evaluates to True is
    selected, and the corresponding Runnable is run on the input.

    If no condition evaluates to True, the default branch is run on the input.

    Examples:

        .. code-block:: python

            from langchain_core.runnables import RunnableBranch

            branch = RunnableBranch(
                (lambda x: isinstance(x, str), lambda x: x.upper()),
                (lambda x: isinstance(x, int), lambda x: x + 1),
                (lambda x: isinstance(x, float), lambda x: x * 2),
                lambda x: "goodbye",
            )

            branch.invoke("hello") # "HELLO"
            branch.invoke(None) # "goodbye"
    """

    branches: Sequence[Tuple[Runnable[Input, bool], Runnable[Input, Output]]]
    default: Runnable[Input, Output]

    def __init__(
        self,
        *branches: Union[
            Tuple[
                Union[
                    Runnable[Input, bool],
                    Callable[[Input], bool],
                    Callable[[Input], Awaitable[bool]],
                ],
                RunnableLike,
            ],
            RunnableLike,  # To accommodate the default branch
        ],
    ) -> None:
        """A Runnable that runs one of two branches based on a condition."""
        if len(branches) < 2:
            raise ValueError("RunnableBranch requires at least two branches")

        default = branches[-1]

        if not isinstance(
            default,
            (Runnable, Callable, Mapping),  # type: ignore[arg-type]
        ):
            raise TypeError(
                "RunnableBranch default must be runnable, callable or mapping."
            )

        default_ = cast(
            Runnable[Input, Output], coerce_to_runnable(cast(RunnableLike, default))
        )

        _branches = []

        for branch in branches[:-1]:
            if not isinstance(branch, (tuple, list)):  # type: ignore[arg-type]
                raise TypeError(
                    f"RunnableBranch branches must be "
                    f"tuples or lists, not {type(branch)}"
                )

            if not len(branch) == 2:
                raise ValueError(
                    f"RunnableBranch branches must be "
                    f"tuples or lists of length 2, not {len(branch)}"
                )
            condition, runnable = branch
            condition = cast(Runnable[Input, bool], coerce_to_runnable(condition))
            runnable = coerce_to_runnable(runnable)
            _branches.append((condition, runnable))

        super().__init__(branches=_branches, default=default_)  # type: ignore[call-arg]

    def invoke(
        self, input: Input, config: Optional[RunnableConfig] = None, **kwargs: Any
    ) -> Output:
        """First evaluates the condition, then delegate to true or false branch."""
        config = ensure_config(config)
        callback_manager = get_callback_manager_for_config(config)
        run_manager = callback_manager.on_chain_start(
            dumpd(self),
            input,
            name=config.get("run_name"),
            run_id=config.pop("run_id", None),
        )

        try:
            for idx, branch in enumerate(self.branches):
                condition, runnable = branch

                expression_value = condition.invoke(
                    input,
                    config=patch_config(
                        config,
                        callbacks=run_manager.get_child(tag=f"condition:{idx + 1}"),
                    ),
                )

                if expression_value:
                    output = runnable.invoke(
                        input,
                        config=patch_config(
                            config,
                            callbacks=run_manager.get_child(tag=f"branch:{idx + 1}"),
                        ),
                        **kwargs,
                    )
                    break
            else:
                output = self.default.invoke(
                    input,
                    config=patch_config(
                        config, callbacks=run_manager.get_child(tag="branch:default")
                    ),
                    **kwargs,
                )
        except BaseException as e:
            run_manager.on_chain_error(e)
            raise
        run_manager.on_chain_end(dumpd(output))
        return output

这里看下来感觉Runnable的子类其实是遵循某种范式编写的:__init__初始化成员变量，再在invoke实现相应的逻辑。

总结

通过本文的深入探讨，我们可以看到LCEL（Langchain Expression Language）在Langchain框架中的重要性和创新性。LCEL的设计思想“一切组件皆可编排”使得AI应用的开发更加便捷和高效。

核心观点回顾

组件编排的简化：
- 传统方法需要显式地创建并串联多个LLMChain，而LCEL则通过使用管道符号|极大地简化了这一过程。
- 这种编排方式不仅适用于LLMChain，还适用于Langchain中的其他组件如PromptTemplate、ChatModel、Retriever、OutputParser和Tool，甚至可以扩展到用户自定义的Python函数。
Runnable的核心作用：
- LCEL引入了Runnable作为可编排组件的核心接口。Runnable定义了一系列抽象方法，如invoke、stream、batch等，这些方法为组件的自由编排提供了基础。
- RunnableSerializable是继承自Runnable并提供序列化功能的接口，成为Langchain组件统一的基础接口。
多种类型的Runnable：
- RunnableSequence：用于串联多个Runnable实例，形成顺序执行的链条。
- RunnableParallel：用于并行执行多个Runnable实例，提高执行效率。
- RunnableLambda：将Python函数封装为Runnable，使得函数也能参与到LCEL的编排中。
- RunnableBranch：实现了条件分支逻辑，类似于RouterChain，用于根据条件选择不同的执行路径。

LCEL的优势

高效开发：通过简洁的语法和统一的接口，LCEL大大提高了AI应用开发的效率。
灵活性：支持多种类型的组件和自定义函数的编排，使得开发者可以根据具体需求灵活组合不同的组件。
可扩展性：通过定义明确的输入输出接口和抽象方法，LCEL为未来扩展和新组件的加入提供了良好的支持。

希望这能帮助你更加深入地了解Langchain的底层原理，后续我会更新一篇关于介绍Langchain组件的文章，敬请关注～

GPT省流

核心观点回顾

LCEL的优势

结语

前言

设计思想：一切组件皆可编排

Runnable：LCEL的驱动核心

Runnable：最底层的抽象

RunnableSerializable：组件的统一接口

RunnableSequence：串联Runnable的子类

RunnableParallel：并行Runnable的子类

RunnableLambda: 将函数封装成Runnable

RunnableBranch

总结

核心观点回顾

LCEL的优势

评论

发送评论编辑评论

GPT省流

核心观点回顾

LCEL的优势

结语

前言

设计思想：一切组件皆可编排

Runnable：LCEL的驱动核心

Runnable：最底层的抽象

RunnableSerializable：组件的统一接口

RunnableSequence：串联Runnable的子类

RunnableParallel：并行Runnable的子类

RunnableLambda: 将函数封装成Runnable

RunnableBranch

总结

核心观点回顾

LCEL的优势

评论

发送评论 编辑评论

推荐文章

发送评论编辑评论