12.7 - lambda表达式简介（匿名函数）

"Key Takeaway"

请考虑下面这段代码，我们在 11.19 — 标准库算法简介中介绍过它：

#include <algorithm>
#include <array>
#include <iostream>
#include <string_view>
 
// Our function will return true if the element matches
bool containsNut(std::string_view str)
{
    // std::string_view::find returns std::string_view::npos if it doesn't find
    // the substring. Otherwise it returns the index where the substring occurs
    // in str.
    return (str.find("nut") != std::string_view::npos);
}
 
int main()
{
    std::array<std::string_view, 4> arr{ "apple", "banana", "walnut", "lemon" };
 
    // Scan our array to see if any elements contain the "nut" substring
    auto found{ std::find_if(arr.begin(), arr.end(), containsNut) };
 
    if (found == arr.end())
    {
        std::cout << "No nuts\n";
    }
    else
    {
        std::cout << "Found " << *found << '\n';
    }
 
    return 0;
}

这段代码会在一个数组中搜索字符串，找到第一个包含 “nut” 子串的元素。因此输出结果为：

Found walnut

虽然代码可以正确执行，但是还有可改进的地方。

这里问题的核心在于 std::find_if 要求使用者传递给它一个函数指针。因此，我们必须定义一个只使用一次的函数，还必须给它起个名，并将其放置在全局作用域中(因为函数不能嵌套!)。这个函数很短，从一行代码中几乎比从名称和注释中更容易辨别它的功能。

lambda

使用 lambda 表达式(有时候也称lambda或闭包或匿名函数) 可以在其他函数中定义匿名函数。能够嵌套定义是很重要的特性，它可以避免命名空间中的名称污染，而且可以将函数的定义尽可能靠近使用它的地方（避免额外的上下文）。

lambda 的语法是C++中最奇怪的东西之一，需要一点时间来适应。匿名函数形式是:

[ captureClause ] ( parameters ) -> returnType
{
    statements;
}

闭包语句可以为空，如果不需要捕获变量的话；
形参列表也可以为空，如果不需要形参的话；
返回类型是可选的，如果省略的话，会假定为 auto (使用返回值类型推断)。尽管我们之前说过，应该避免使用函数返回值的类型推断，但是在匿名表达式中是可以用的（因为这些函数通常都非常简单）。

因为lambda 没有函数名，所以不必为其起名。

一个基本的匿名函数定义看上去像是这样：

#include <iostream>
 
int main()
{
  [] {}; // 省略返回值类型, 无闭包，无参数
  
  return 0;
}

让我们用匿名函数重写上面的例子：

#include <algorithm>
#include <array>
#include <iostream>
#include <string_view>
 
int main()
{
  constexpr std::array<std::string_view, 4> arr{ "apple", "banana", "walnut", "lemon" };
 
  // Define the function right where we use it.
  const auto found{ std::find_if(arr.begin(), arr.end(),
                           [](std::string_view str) // 匿名函数，无闭包
                           {
                             return (str.find("nut") != std::string_view::npos);
                           }) };
 
  if (found == arr.end())
  {
    std::cout << "No nuts\n";
  }
  else
  {
    std::cout << "Found " << *found << '\n';
  }
 
  return 0;
}

和使用函数指针的情况一样，产生相同的结果:

Found walnut

可以看到，lambda 和之前的 containsNut 函数非常类型。它们的参数和函数体是完全一样的。这个lambda没有闭包（下节课介绍），因为没必要。不仅如此我们还省略了返回值类型，但是由于 operator!= 返回 bool，所以该lambda也返回布尔类型。

lambda 函数的类型

在上面的例子中，我们在需要的地方定义了匿名函数，它的这种用法有时被称为函数字面量。

然而，在同一行中编写lambda有时会降低代码的可读性。就像我们可以用字面值(或函数指针)初始化一个变量以供以后使用一样，我们也可以用lambda定义初始化一个变量，然后在以后使用它。一个有好名字的lambda函数可以使代码更容易阅读。

例如，在下面的代码片段中，我们使用std::all_of 来检查数组中的所有元素是否为偶数：

// Bad: We have to read the lambda to understand what's happening.
return std::all_of(array.begin(), array.end(), [](int i){ return ((i % 2) == 0); });

可以通过下面的方式提升可读性。

// Good: Instead, we can store the lambda in a named variable and pass it to the function.
auto isEven{
  [](int i)
  {
    return ((i % 2) == 0);
  }
};
 
return std::all_of(array.begin(), array.end(), isEven);

注意，现在最后一行代码可以很自然地被理解为：判断数组的所有元素是否都是偶数

不过，isEven 的类型是什么呢？

事实证明，lambdas没有可以显式使用的类型。当我们编写lambda时，编译器会为其生成一个唯一的类型，但对我们并不可见。

"扩展阅读"

实际上，lambda并不是函数(这是它们可以避免C++不支持嵌套函数的限制的部分原因)。它们是一种叫做functor的特殊对象。函子是包含重载的operator()的对象，使它们像函数一样可调用。

尽管我们并不知道lambda的类型，但是我们仍然有几种方法可以将其存放到某个变量中。如果一个lambda没有闭包，则可以使用一个普通的函数指针存放它。使用 std::function 或者通过 auto 进行类型推断也是可以的(即便此时lambda有闭包语句)。

#include <functional>
 
int main()
{
  // A regular function pointer. Only works with an empty capture clause (empty []).
  double (*addNumbers1)(double, double){
    [](double a, double b) {
      return (a + b);
    }
  };
 
  addNumbers1(1, 2);
 
  // Using std::function. The lambda could have a non-empty capture clause (discussed next lesson).
  std::function addNumbers2{ // note: pre-C++17, use std::function<double(double, double)> instead
    [](double a, double b) {
      return (a + b);
    }
  };
 
  addNumbers2(3, 4);
 
  // Using auto. Stores the lambda with its real type.
  auto addNumbers3{
    [](double a, double b) {
      return (a + b);
    }
  };
 
  addNumbers3(5, 6);
 
  return 0;
}

使用lambda的实际类型的唯一方法是通过 auto 。与 std::function 相比，auto 还有一个好处，那就是没有开销。

不幸的是，在C++ 20之前，我们不能总是使用auto 。在实际lambda未知的情况下(例如，因为我们将lambda作为参数传递给函数，由调用者决定将传入什么lambda)，我们不能在这种情况下使用 auto 。在这种情况下，可以使用std::function 来代替。

#include <functional>
#include <iostream>
 
// We don't know what fn will be. std::function works with regular functions and lambdas.
void repeat(int repetitions, const std::function<void(int)>& fn)
{
  for (int i{ 0 }; i < repetitions; ++i)
  {
    fn(i);
  }
}
 
int main()
{
  repeat(3, [](int i) {
    std::cout << i << '\n';
  });
 
  return 0;
}

输出

0
1
2

在这个例子中我们不能使用auto 关键字，这么做的话函数的调用者就无法知道fn应该有什么样的参数和返回值类型。这个限制在C++20引入缩写函数模板之后就不存在了。

此外，因为它们实际上是模板，具有 auto 参数的函数不能被分离到头文件和源文件中。

"法则"

需要用 lambda 初始化变量时，使用auto关键字。罪域不能用 lambda 初始化变量的场合，使用std::function。

泛型lambda函数

在大多数情况下，lambda形参的工作规则与常规函数形参相同。

一个值得注意的例外是，从C++ 14开始，我们允许使用auto 作为形参类型(注意:在c++ 20中，常规函数也可以使用 auto 作为形参类型)。当lambda有一个或多个 auto 形参时，编译器将从对lambda的调用中推断需要什么形参类型。

因为带有一个或多个 auto 参数的 lambda可以潜在地与各种类型一起工作，所以它们被称为泛型lambda。

"扩展阅读"

当在lambda的上下文中使用时，auto只是模板形参的缩写。

看一个泛型lambda的例子：

#include <algorithm>
#include <array>
#include <iostream>
#include <string_view>
 
int main()
{
  constexpr std::array months{ // pre-C++17 use std::array<const char*, 12>
    "January",
    "February",
    "March",
    "April",
    "May",
    "June",
    "July",
    "August",
    "September",
    "October",
    "November",
    "December"
  };
 
  // Search for two consecutive months that start with the same letter.
  const auto sameLetter{ std::adjacent_find(months.begin(), months.end(),
                                      [](const auto& a, const auto& b) {
                                        return (a[0] == b[0]);
                                      }) };
 
  // Make sure that two months were found.
  if (sameLetter != months.end())
  {
    // std::next returns the next iterator after sameLetter
    std::cout << *sameLetter << " and " << *std::next(sameLetter)
              << " start with the same letter\n";
  }
 
  return 0;
}

输出：

June and July start with the same letter

在上面的例子中，我们使用了 auto 形参来获取字符串的 const引用。由于所有的字符串类型都可以通过 operator[] 访问单个字符。所以我们无需关心传入字符串是std::string、C风格字符串还是其他字符串。这样一来，我们就可以编写介绍这些类型的lambda表达式，如果将来需要修改 months 的类型，也不必修改lambda。

不过，使用 auto 并不总是最佳的选择，考虑下面代码：

#include <algorithm>
#include <array>
#include <iostream>
#include <string_view>
 
int main()
{
  constexpr std::array months{ // pre-C++17 use std::array<const char*, 12>
    "January",
    "February",
    "March",
    "April",
    "May",
    "June",
    "July",
    "August",
    "September",
    "October",
    "November",
    "December"
  };
 
  // Count how many months consist of 5 letters
  const auto fiveLetterMonths{ std::count_if(months.begin(), months.end(),
                                       [](std::string_view str) {
                                         return (str.length() == 5);
                                       }) };
 
  std::cout << "There are " << fiveLetterMonths << " months with 5 letters\n";
 
  return 0;
}

输出：

There are 2 months with 5 letters

在这个例子中，如果使用 auto，则会推断出 const char*。由于 C 风格字符串并不易用（除了operator[]）。所以在这个例子中我们显式地定义参数为 std::string_view类型，这使得我们在处理数据时可以更容易。(例如，我们可以查询string view 的长度，即便传入的是一个C风格字符串数组)。

泛型lambda和静态变量

需要注意的一件事是，编译器将为 auto 解析出的每个不同类型生成唯一的lambda表达式。下面的例子展示了一个泛型lambda如何变成两个不同的lambda的：

#include <algorithm>
#include <array>
#include <iostream>
#include <string_view>
 
int main()
{
  // Print a value and count how many times @print has been called.
  auto print{
    [](auto value) {
      static int callCount{ 0 };
      std::cout << callCount++ << ": " << value << '\n';
    }
  };
 
  print("hello"); // 0: hello
  print("world"); // 1: world
 
  print(1); // 0: 1
  print(2); // 1: 2
 
  print("ding dong"); // 2: ding dong
 
  return 0;
}

输出：

0: hello
1: world
0: 1
1: 2
2: ding dong

在上面的例子中，我们定义了一个lambda，然后使用两个不同类型的参数对其进行了调用（一个是字符串字面量，另一个是整型参数）。这会生成两个不同版本的lambda(一个是字符串字面量形参，另一个带有整数形参)。

大多数时候，这是无关紧要的。但是，请注意，==如果泛型lambda使用静态变量，则生成的lambda之间不会共享这些变量。==

从上面的例子中可以看出这一点，其中每种类型(字符串字面量和整数)都有自己唯一的计数！虽然我们只写了一次lambda，但是生成了两个lambda——并且每个lambda都有自己的 callCount 。要在两个生成的lambda之间有一个共享计数器，我们必须在lambda外部定义一个全局变量或一个“静态”局部变量。正如在前面的课程中了学到的，全局和静态局部变量容易带来问题，而且会降低代码的可读性。在下一课中讨论lambda捕获之后，我们将能够避免这些变量。

返回值类型推断和尾随返回值类型

如果使用了返回值类型推断，则 lambda 的返回值类型需要从 return 语句进行推断，而且所有的返回类型必须相同（否则编译器将难以判断）：

例如：

#include <iostream>
 
int main()
{
  auto divide{ [](int x, int y, bool intDivision) { // note: no specified return type
    if (intDivision)
      return x / y; // return type is int
    else
      return static_cast<double>(x) / y; // ERROR: return type doesn't match previous return type
  } };
 
  std::cout << divide(3, 2, true) << '\n';
  std::cout << divide(3, 2, false) << '\n';
 
  return 0;
}

这将产生一个编译错误，因为第一个返回语句的返回类型(int)与第二个返回语句的返回类型(double)不匹配。

在返回不同类型的情况下，我们有两个选项：

是否显式强制转换以使所有返回类型匹配，还是
显式指定lambda的返回类型，并让编译器执行隐式转换。

第二种情况通常是更好的选择:

#include <iostream>
 
int main()
{
  // note: explicitly specifying this returns a double
  auto divide{ [](int x, int y, bool intDivision) -> double {
    if (intDivision)
      return x / y; // will do an implicit conversion of result to double
    else
      return static_cast<double>(x) / y;
  } };
 
  std::cout << divide(3, 2, true) << '\n';
  std::cout << divide(3, 2, false) << '\n';
 
  return 0;
}

通过这种方式，如果今后你决定修改返回值的类型，那么你通常只需要修改lambda的返回值类型即可，而不必修改其函数内容。

标准库中的函数对象

对于很多基本操作(例如：addition, negation 或 comparison) 来说，其实我们不需要自己编写lambda，因为标准库中提供很多可以使用的可调用对象。它们都定义在 <functional> 头文件中。

考虑下面的例子：

#include <algorithm>
#include <array>
#include <iostream>
 
bool greater(int a, int b)
{
  // Order @a before @b if @a is greater than @b.
  return (a > b);
}
 
int main()
{
  std::array arr{ 13, 90, 99, 5, 40, 80 };
 
  // Pass greater to std::sort
  std::sort(arr.begin(), arr.end(), greater);
 
  for (int i : arr)
  {
    std::cout << i << ' ';
  }
 
  std::cout << '\n';
 
  return 0;
}

输出：

99 90 80 40 13 5

在这个例子中，我们不需要将 greater 函数改写为lambda，而是可以直接使用 std::greater：

#include <algorithm>
#include <array>
#include <iostream>
#include <functional> // for std::greater
 
int main()
{
  std::array arr{ 13, 90, 99, 5, 40, 80 };
 
  // Pass std::greater to std::sort
  std::sort(arr.begin(), arr.end(), std::greater{}); // note: need curly braces to instantiate object
 
  for (int i : arr)
  {
    std::cout << i << ' ';
  }
 
  std::cout << '\n';
 
  return 0;
}

输出：

99 90 80 40 13 5

小结

与使用循环的解决方案相比，匿名函数和算法库可能稍显得不必要地复杂。然而，这种组合可以在短短几行代码中实现一些非常强大的操作，并且比自己编写循环更具有可读性。最重要的是，算法库具有强大且易于使用的并行性，这是循环所无法获得的。更新使用库函数的源代码比更新使用循环的代码更容易。

匿名函数很好，但它们不能在所有情况下取代常规函数。对于功能复杂且需要可重用的情形，首选常规函数。

LearnCPP CN

Chapters

12.7 - lambda表达式简介（匿名函数）

lambda

lambda 函数的类型

泛型lambda函数

泛型lambda和静态变量

返回值类型推断和尾随返回值类型

标准库中的函数对象

小结

Table of Contents

Backlinks