Time Series 时序数据预测几个重要的参数说明

🕗 发布于 2024-02-22 21:09 Time Series 时序数据预测

Time Series 时序数据预测几个重要的参数说明

flyfish

‘–model’, type=str, default=‘Autoformer’, 选项: [Autoformer, Informer, Transformer]

模型的名字，可以是模型例如Autoformer, Informer, Transformer等中的任何一个。

‘–root_path’

数据集的根目录例如 ‘./data/ETT/’

‘–data_path’

根目录下可以有多个文件，具体是哪个文件，文件名例如 ETTh1.csv

‘–features’, type=str, default=‘M’, 可选项 :[M, S, MS]

M:multivariate predict multivariate, 输入多变量->输出多变量
S:univariate predict univariate, 输入单变量->输出单变量
MS:multivariate predict univariate’输入多变量->输出单变量

‘–freq’, type=str, default=‘h’,

时间特征编码的频率，可选项[ s,t,h,d,b,w,m]

s:secondly,
t:minutely
h:hourly
d:daily
b:business days
w:weekly
m:monthly
也可以设置15min 或者 3h

‘–checkpoints’, type=str, default=‘./checkpoints/’

训练模型的存储目录

Transformer的参数

简单列几个

'--d_model', type=int, default=512, help='dimension of model'
'--n_heads', type=int, default=8, help='num of heads')
'--e_layers', type=int, default=2, help='num of encoder layers'
'--d_layers', type=int, default=1, help='num of decoder layers'
'--d_ff', type=int, default=2048, help='dimension of fcn'

完整的参数

d_model (int) – the number of expected features in the encoder/decoder inputs (default=512).

nhead (int) – the number of heads in the multiheadattention models (default=8).

num_encoder_layers (int) – the number of sub-encoder-layers in the encoder (default=6).

num_decoder_layers (int) – the number of sub-decoder-layers in the decoder (default=6).

dim_feedforward (int) – the dimension of the feedforward network model (default=2048).

dropout (float) – the dropout value (default=0.1).

activation (Union[str, Callable[[Tensor], Tensor]]) – the activation function of encoder/decoder intermediate layer, can be a string (“relu” or “gelu”) or a unary callable. Default: relu

custom_encoder (Optional[Any]) – custom encoder (default=None).

custom_decoder (Optional[Any]) – custom decoder (default=None).

layer_norm_eps (float) – the eps value in layer normalization components (default=1e-5).

batch_first (bool) – If True, then the input and output tensors are provided as (batch, seq, feature). Default: False (seq, batch, feature).

norm_first (bool) – if True, encoder and decoder layers will perform LayerNorms before other attention and feedforward operations, otherwise after. Default: False (after).

bias (bool) – If set to False, Linear and LayerNorm layers will not learn an additive bias. Default: True.

使用上述参数写一个例子

import torch
import torch.nn as nn

transformer_model = nn.Transformer(d_model=512,nhead=8, 
num_encoder_layers=2,num_decoder_layers=1,dim_feedforward=2048)
print(transformer_model)

输出结果


Transformer(
  (encoder): TransformerEncoder(
    (layers): ModuleList(
      (0): TransformerEncoderLayer(
        (self_attn): MultiheadAttention(
          (out_proj): NonDynamicallyQuantizableLinear(in_features=512, out_features=512, bias=True)
        )
        (linear1): Linear(in_features=512, out_features=2048, bias=True)
        (dropout): Dropout(p=0.1, inplace=False)
        (linear2): Linear(in_features=2048, out_features=512, bias=True)
        (norm1): LayerNorm((512,), eps=1e-05, elementwise_affine=True)
        (norm2): LayerNorm((512,), eps=1e-05, elementwise_affine=True)
        (dropout1): Dropout(p=0.1, inplace=False)
        (dropout2): Dropout(p=0.1, inplace=False)
      )
      (1): TransformerEncoderLayer(
        (self_attn): MultiheadAttention(
          (out_proj): NonDynamicallyQuantizableLinear(in_features=512, out_features=512, bias=True)
        )
        (linear1): Linear(in_features=512, out_features=2048, bias=True)
        (dropout): Dropout(p=0.1, inplace=False)
        (linear2): Linear(in_features=2048, out_features=512, bias=True)
        (norm1): LayerNorm((512,), eps=1e-05, elementwise_affine=True)
        (norm2): LayerNorm((512,), eps=1e-05, elementwise_affine=True)
        (dropout1): Dropout(p=0.1, inplace=False)
        (dropout2): Dropout(p=0.1, inplace=False)
      )
    )
    (norm): LayerNorm((512,), eps=1e-05, elementwise_affine=True)
  )
  (decoder): TransformerDecoder(
    (layers): ModuleList(
      (0): TransformerDecoderLayer(
        (self_attn): MultiheadAttention(
          (out_proj): NonDynamicallyQuantizableLinear(in_features=512, out_features=512, bias=True)
        )
        (multihead_attn): MultiheadAttention(
          (out_proj): NonDynamicallyQuantizableLinear(in_features=512, out_features=512, bias=True)
        )
        (linear1): Linear(in_features=512, out_features=2048, bias=True)
        (dropout): Dropout(p=0.1, inplace=False)
        (linear2): Linear(in_features=2048, out_features=512, bias=True)
        (norm1): LayerNorm((512,), eps=1e-05, elementwise_affine=True)
        (norm2): LayerNorm((512,), eps=1e-05, elementwise_affine=True)
        (norm3): LayerNorm((512,), eps=1e-05, elementwise_affine=True)
        (dropout1): Dropout(p=0.1, inplace=False)
        (dropout2): Dropout(p=0.1, inplace=False)
        (dropout3): Dropout(p=0.1, inplace=False)
      )
    )
    (norm): LayerNorm((512,), eps=1e-05, elementwise_affine=True)
  )
)

完整的


Model

(
  (decomp): series_decomp(
    (moving_avg): moving_avg(
      (avg): AvgPool1d(kernel_size=(25,), stride=(1,), padding=(0,))
    )
  )
  (enc_embedding): DataEmbedding_wo_pos(
    (value_embedding): TokenEmbedding(
      (tokenConv): Conv1d(7, 512, kernel_size=(3,), stride=(1,), padding=(1,), bias=False, padding_mode=circular)
    )
    (position_embedding): PositionalEmbedding()
    (temporal_embedding): TimeFeatureEmbedding(
      (embed): Linear(in_features=5, out_features=512, bias=False)
    )
    (dropout): Dropout(p=0.05, inplace=False)
  )
  (dec_embedding): DataEmbedding_wo_pos(
    (value_embedding): TokenEmbedding(
      (tokenConv): Conv1d(7, 512, kernel_size=(3,), stride=(1,), padding=(1,), bias=False, padding_mode=circular)
    )
    (position_embedding): PositionalEmbedding()
    (temporal_embedding): TimeFeatureEmbedding(
      (embed): Linear(in_features=5, out_features=512, bias=False)
    )
    (dropout): Dropout(p=0.05, inplace=False)
  )
  (encoder): Encoder(
    (attn_layers): ModuleList(
      (0): EncoderLayer(
        (attention): AutoCorrelationLayer(
          (inner_correlation): AutoCorrelation(
            (dropout): Dropout(p=0.05, inplace=False)
          )
          (query_projection): Linear(in_features=512, out_features=512, bias=True)
          (key_projection): Linear(in_features=512, out_features=512, bias=True)
          (value_projection): Linear(in_features=512, out_features=512, bias=True)
          (out_projection): Linear(in_features=512, out_features=512, bias=True)
        )
        (conv1): Conv1d(512, 2048, kernel_size=(1,), stride=(1,), bias=False)
        (conv2): Conv1d(2048, 512, kernel_size=(1,), stride=(1,), bias=False)
        (decomp1): series_decomp(
          (moving_avg): moving_avg(
            (avg): AvgPool1d(kernel_size=(25,), stride=(1,), padding=(0,))
          )
        )
        (decomp2): series_decomp(
          (moving_avg): moving_avg(
            (avg): AvgPool1d(kernel_size=(25,), stride=(1,), padding=(0,))
          )
        )
        (dropout): Dropout(p=0.05, inplace=False)
      )
      (1): EncoderLayer(
        (attention): AutoCorrelationLayer(
          (inner_correlation): AutoCorrelation(
            (dropout): Dropout(p=0.05, inplace=False)
          )
          (query_projection): Linear(in_features=512, out_features=512, bias=True)
          (key_projection): Linear(in_features=512, out_features=512, bias=True)
          (value_projection): Linear(in_features=512, out_features=512, bias=True)
          (out_projection): Linear(in_features=512, out_features=512, bias=True)
        )
        (conv1): Conv1d(512, 2048, kernel_size=(1,), stride=(1,), bias=False)
        (conv2): Conv1d(2048, 512, kernel_size=(1,), stride=(1,), bias=False)
        (decomp1): series_decomp(
          (moving_avg): moving_avg(
            (avg): AvgPool1d(kernel_size=(25,), stride=(1,), padding=(0,))
          )
        )
        (decomp2): series_decomp(
          (moving_avg): moving_avg(
            (avg): AvgPool1d(kernel_size=(25,), stride=(1,), padding=(0,))
          )
        )
        (dropout): Dropout(p=0.05, inplace=False)
      )
    )
    (norm): my_Layernorm(
      (layernorm): LayerNorm((512,), eps=1e-05, elementwise_affine=True)
    )
  )
  (decoder): Decoder(
    (layers): ModuleList(
      (0): DecoderLayer(
        (self_attention): AutoCorrelationLayer(
          (inner_correlation): AutoCorrelation(
            (dropout): Dropout(p=0.05, inplace=False)
          )
          (query_projection): Linear(in_features=512, out_features=512, bias=True)
          (key_projection): Linear(in_features=512, out_features=512, bias=True)
          (value_projection): Linear(in_features=512, out_features=512, bias=True)
          (out_projection): Linear(in_features=512, out_features=512, bias=True)
        )
        (cross_attention): AutoCorrelationLayer(
          (inner_correlation): AutoCorrelation(
            (dropout): Dropout(p=0.05, inplace=False)
          )
          (query_projection): Linear(in_features=512, out_features=512, bias=True)
          (key_projection): Linear(in_features=512, out_features=512, bias=True)
          (value_projection): Linear(in_features=512, out_features=512, bias=True)
          (out_projection): Linear(in_features=512, out_features=512, bias=True)
        )
        (conv1): Conv1d(512, 2048, kernel_size=(1,), stride=(1,), bias=False)
        (conv2): Conv1d(2048, 512, kernel_size=(1,), stride=(1,), bias=False)
        (decomp1): series_decomp(
          (moving_avg): moving_avg(
            (avg): AvgPool1d(kernel_size=(25,), stride=(1,), padding=(0,))
          )
        )
        (decomp2): series_decomp(
          (moving_avg): moving_avg(
            (avg): AvgPool1d(kernel_size=(25,), stride=(1,), padding=(0,))
          )
        )
        (decomp3): series_decomp(
          (moving_avg): moving_avg(
            (avg): AvgPool1d(kernel_size=(25,), stride=(1,), padding=(0,))
          )
        )
        (dropout): Dropout(p=0.05, inplace=False)
        (projection): Conv1d(512, 7, kernel_size=(3,), stride=(1,), padding=(1,), bias=False, padding_mode=circular)
      )
    )
    (norm): my_Layernorm(
      (layernorm): LayerNorm((512,), eps=1e-05, elementwise_affine=True)
    )
    (projection): Linear(in_features=512, out_features=7, bias=True)
  )
)

‘–embed’, type=str, default=‘timeF’

时间特征编码，三种不同的编码方式，三个可选项[timeF, fixed, learned]
timeF编码：将时间戳拆解为月、日、周、小时、分钟等特征，然后将值缩放为[-0.5,0.5]的小数。
fixed编码：同样是拆解为各类特征，将其对应的整数用positional encoding的方式进行编码。
learned编码：同样是拆解为各类特征，将其对应的整数使用Embedding进行学习，在训练过程中动态调整。

原文地址：https://blog.csdn.net/flyfish1986/article/details/136228625

免责声明：本站文章内容转载自网络资源，如本站内容侵犯了原著者的合法权益，可联系本站删除。更多内容请关注自学内容网（zxcms.com）！

上一篇：K8S部署Java项目 pod的logs报错为：Error: Unable to access jarfile app.jar
下一篇：MATLAB环境下一维时间序列信号的欠定盲源分离方法

为什么 Allow 配合 meta noindex 比使用Disallow好？
为什么 Allow 配合 meta noindex。
阅读更多2024-11-06
全面解析：虚拟化技术及其应用
虚拟化技术是指通过软件模拟硬件功能，将物理资源抽象成逻辑资源的技术。虚拟化可以应用于计算、存储和网络等多个方面。虚拟化技术作为一项革命性的技术，正在深刻改变我们的世界。它不仅为企业带来了前所未有的商业
阅读更多2024-11-06
机器学习与大数据处理有何关系
机器学习（Machine Learning, ML）是人工智能的一个分支领域，它专注于让计算机系统通过自动地从数据中学习并改进其性能，以执行特定任务，而无需进行显式的编程。机器学习的核心思想是使用数据
阅读更多2024-11-06
Garbage instead of arguments “bitrate
初步确认，出现上面打印信息是因为iproute2不支持CAN配置，但是OPENWRT里我找了很入久，也没找到iproute2的配置信息，iproute2不需要配置，只需要待批ip-full ip-ti
阅读更多2024-11-06
CICD学习笔记1
黑猫是代码托管平台如github，老头jinkens：自动构建：意思就是自动执行shell脚本（脚本是部署项目该有的流程：自动环境更新、代码下载、重启项目）、shell脚本再自动部署-->构建成
阅读更多2024-11-06
【google play】使用Java接入谷歌支付流程
使用Java接入谷歌支付的完整流程，包括准备工作以及具体的Java实现。
阅读更多2024-11-06
MFC的HTTP客户端
/读取服务器上数据。另外别忘了异常处理！
阅读更多2024-11-06
G2 基于生成对抗网络（GAN）人脸图像生成
生成器（G）：输入随机噪声，通过学习数据的分布模式生成类似真实图像的输出。判别器（D）：用来判断输入的图像是真实的还是生成器生成的。训练过程中，生成器尝试欺骗判别器，生成逼真的图像，而判别器则不断优化
阅读更多2024-11-06
OCR、语音识别与信息抽取：免费开源的AI平台在医疗领域的创新应用
思通数科的AI平台通过OCR技术自动识别手写病历中的患者信息、诊断结果、医生签字等要素，并将这些信息转换为结构化数据，直接上传至医院的电子病历系统。通过这些自动化流程，平台帮助医院构建了标准化的影像数
阅读更多2024-11-06
2024年三个月自学手册网络安全（黑客技术）
网络安全可以基于攻击和防御视角来分类，我们经常听到的 “红队”、“渗透测试” 等就是研究攻击技术，而“蓝队”、“安全运营”、“安全运维”则研究防御技术。走安全行业的工程方向的，技术上面其实有很大的重叠
阅读更多2024-11-06

Time Series 时序数据预测几个重要的参数说明