Skip to content

关系训练rl模型 #18

@whu20122015

Description

@whu20122015

您好我在运行rlmodel.py的时候,一直选择的都是整个训练集的所有句子,并没有进行句子筛选,请问是怎么回事呢?
chosen sentence size: 235962
total_reward: -1.0407107
best_reward -1.0406367
chosen sentence size: 235962
total_reward: -1.0407107
best_reward -1.0406367
chosen sentence size: 235962
total_reward: -1.0407109
best_reward -1.0406367
chosen sentence size: 235962
total_reward: -1.040711
best_reward -1.0406367
chosen sentence size: 235962
total_reward: -1.0407109
best_reward -1.0406367

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions