C++builder中的人工智能（18）：神经网络中的SoftMax函数

🕗 发布于 2024-11-10 05:57 人工智能 c++ 神经网络

在这篇文章中，我们将探讨SoftMax函数在神经网络中的作用，如何在人工神经网络（ANN）中使用SoftMax函数，以及在AI技术中SoftMax的应用场景。让我们来详细解释这些概念。

SoftMax函数是什么？

SoftMax函数是逻辑函数在多维情况下的推广，也被称为软argmax或归一化指数函数。它在多项式逻辑回归中使用，并且常作为神经网络最后一个激活函数，用于将网络的输出归一化为概率分布。换句话说，SoftMax用于将预测输出向量转换为概率分布。SoftMax函数不作为激活函数使用，而是在所有输出从激活函数获得后，用于归一化这个向量（或数组）。换言之，SoftMax从给定的输出向量或数组中给出重要值。

SoftMax对激活输出的每个元素使用标准指数函数，每个值的输出在0和1之间。它通过将所有这些指数的和除以这些值来归一化，确保输出向量的分量之和为1。

SoftMax函数的作用是什么？

在神经网络中，SoftMax函数通常用于基于神经网络的分类器的最后一层。这类网络通常在对数损失或交叉熵方法下进行训练，这些方法是多项式逻辑回归的非线性变体。

对于一个有n个成员的x向量（或数组），每个成员的SoftMax可以写成如下，

这个函数可能会因为无限结果而溢出。为了避免这种情况，我们可以通过减去最大值m来调节x值。

如何在C++中编写SoftMax函数？

SoftMax函数可以在C++中如下编写：

static void softmax(double* input, double* output, unsigned int n) {
    double sum = 0;
    double m = -INFINITY;
    for (long int i = 0; i < n; i++) {
        m = std::max(m, input[i]);
    }
    for (unsigned int j = 0; j < n; j++) {
        sum += std::exp(input[j] - m);
    }
    for (unsigned int i = 0; i < n; i++) {
        output[i] = std::exp(input[i] - m) / sum;
    }
}

我们还可以使用偏移量来计算softmax，如下：

static void softmax2(double* input, double* output, size_t input_len) {
    assert(input);
    double m = -INFINITY;
    for (long int i = 0; i < input_len; i++) {
        if (input[i] > m) {
            m = input[i];
        }
    }
    double sum = 0.0;
    for (size_t i = 0; i < input_len; i++) {
        sum += expf(input[i] - m);
    }
    double offset = m + logf(sum);
    for (size_t i = 0; i < input_len; i++) {
        output[i] = expf(input[i] - offset);
    }
}

有没有一个简单的C++ SoftMax示例？

以下示例中使用了两个softmax()函数：

#include <iostream>
#include <assert.h>
#include <algorithm>
#include <math.h>

static void softmax(double* input, double* output, unsigned int n) {
    double sum = 0;
    double m = -INFINITY;
    for (long int i = 0; i < n; i++) {
        m = std::max(m, input[i]);
    }
    for (unsigned int j = 0; j < n; j++) {
        sum += std::exp(input[j] - m);
    }
    for (unsigned int i = 0; i < n; i++) {
        output[i] = std::exp(input[i] - m) / sum;
    }
}

static void softmax2(double* input, double* output, size_t input_len) {
    assert(input);
    double m = -INFINITY;
    for (long int i = 0; i < input_len; i++) {
        if (input[i] > m) {
            m = input[i];
        }
    }
    double sum = 0.0;
    for (size_t i = 0; i < input_len; i++) {
        sum += expf(input[i] - m);
    }
    double offset = m + logf(sum);
    for (size_t i = 0; i < input_len; i++) {
        output[i] = expf(input[i] - offset);
    }
}

#define N 7

int main() {
    double inp[] = {1.0, 2.0, 400.0, 4000.0, 1.0, 2.0, 3.0};
    double out[N];
    std::cout << "Inputs:\n";
    for (int i = 0; i < N; i++) {
        std::cout << inp[i] << ',';
    }
    std::cout << '\n';

    softmax(inp, out, N);
    double tot = 0;
    std::cout << "Softmax Output:\n";
    for (int i = 0; i < N; i++) {
        std::cout << out[i] << ',';
        tot += out[i];
    }
    std::cout << '\n';
    std::cout << "total of softmax output:" << tot << '\n';

    softmax2(inp, out, N);
    tot = 0;
    std::cout << "Softmax Output:\n";
    for (int i = 0; i < N; i++) {
        std::cout << out[i] << ',';
        tot += out[i];
    }
    std::cout << '\n';
    std::cout << "total of softmax output:" << tot << '\n';

    getchar();
    return 0;
}

这个示例展示了如何在C++中使用SoftMax函数来处理一个简单的输入数组，并输出归一化后的概率分布。

原文地址：https://blog.csdn.net/caridle/article/details/143650152

免责声明：本站文章内容转载自网络资源，如本站内容侵犯了原著者的合法权益，可联系本站删除。更多内容请关注自学内容网（zxcms.com）！

上一篇：如何建设一个呼叫中心外呼部门？
下一篇：Zabbix5 通过 Rsyslog 实现设备日志收集分析syslog及监控告警

损失函数选择
答：回归问题是线性问题，比如房价预测，股市预测，温度预测等。分类问题是非线问题，比如猫狗分类，服装分类等。
阅读更多2024-11-29
数据结构（16）特殊矩阵的压缩存储
数据结构——特殊矩阵的压缩存储篇
阅读更多2024-11-29
Rockchip-linux驱动 --- IIC
IIC
阅读更多2024-11-29
【CSS】clip-path 属性（剪裁显示区域）
使用裁剪方式创建元素的可显示区域。区域内的部分显示，区域外的隐藏。
阅读更多2024-11-29
【Java 基础】-- 将 List＜String[]＞转为 List＜String＞
【代码】【Java 基础】-- 将 List 转为 List
阅读更多2024-11-29
[保姆式教程]使用labelimg2软件标注定向目标检测数据和格式转换
定向目标检测是一种在图像或视频中识别和定位对象的同时，还估计它们方向的技术。这种技术特别适用于处理有一定旋转或方向变化的对象，例如汽车、飞机或文本。定向目标检测器的输出是一组旋转的边界框，这些框精确地
阅读更多2024-11-29
YOLOv1 (You Only Look Once)
YOLOv1 通过将目标检测问题转化为回归问题，提供了一种高效、快速的检测方式。尽管它在小物体检测和密集目标的场景中存在一些局限，但它的创新性为后续目标检测方法的发展奠定了基础。如果你有兴趣实现 YO
阅读更多2024-11-29
华为仓颉编程环境搭建
摘自华为官方：仓颉编程语言作为一款面向全场景应用开发的现代编程语言，通过现代语言特性的集成、全方位的编译优化和运行时实现、以及开箱即用的 IDE 工具链支持，为开发者打造友好开发体验和卓越程序性能。其
阅读更多2024-11-29
架构第十五章：Ansible自动化运维工具
linux镜像源（组包）：wget -O /etc/yum.repos.d/CentOS-Base.repo http://mirrors.aliyun.com/repo/Centos-7.repo。
阅读更多2024-11-29
质数——acwing
之前做的笔记👆👆👆。
阅读更多2024-11-29

C++builder中的人工智能（18）：神经网络中的SoftMax函数

SoftMax函数是什么？

SoftMax函数的作用是什么？

如何在C++中编写SoftMax函数？

有没有一个简单的C++ SoftMax示例？

相关文章