为啥 m1max cpu 跑深度强化学习比 3900x + 2080s 的台式机快好多 - V2EX
wsinglewell

为啥 m1max cpu 跑深度强化学习比 3900x + 2080s 的台式机快好多

  •  1
     
  •   wsinglewell Mar 15, 2024 1350 views
    This topic created in 794 days ago, the information mentioned may be changed or developed.
    叠甲:没系统学过深度学习和强化学习,所以可能不是很了解一些基础知识。

    最近想训练一个二维迷宫寻路智能体,从 linyiYi 这里抄了一个他贪吃蛇 ai 的板子拿过来改
    https://github.com/linyiLYi/snake-ai

    但是发现训练的时候,训练速度 m1max cpu >>> 2080s > m1max mps > 3900x cpu
    代码在这里: https://github.com/abmcar/NavigateAi/tree/main/main
    模型基本上都是用的默认的参数


    对这个训练速度的差异很奇怪,是因为我台式机没有配置好环境还是因为我训练的内容恰好适合 m1max cpu?



    另外想问一下,像这种简单问题,是不是调一个好的 reward 策略之后训练起来很快啊?之前跑没有加障碍物的 env ,跑了 5e7 步才能做到走最短路不撞墙一直吃到果子,感觉是 reward 没设置好。如果训练效果跟 reward 策略强相关的话,我需要学习一些理论知识来制定 reward 策略吗?或者还需要自己根据实际 env 手写模型并且调参?
    下面是之前无障碍的 reward 策略,https://github.com/abmcar/NavigateAi/blob/4d8cd82323305d69b65cfd21fb7a58808b992232/main/navigate_game_custom_wrapper_mlp.py
    wsinglewell
        1
    wsinglewell  
    OP
       Mar 15, 2024
    说错了速度应该是
    m1max cpu >> 3900x >> 2080s cuda >> m1max mps
    m1max cpu 2 iterations 用时 8.1s 1e5 step 15s
    m1max mps 2 iterations 用时 135s 1e5 step 240s
    2080s cuda 2 iterations 用时 65s 1e5 step 120s
    3900x cpu 2 iterations 用时 26s 1e5 step 48S


    wsinglewell
        2
    wsinglewell  
    OP
       Mar 15, 2024
    @wsinglewell 上一张图是 2080s 和 3900x 下面这个是 m1max
    wsinglewell
        3
    wsinglewell  
    OP
       Mar 15, 2024
    wsinglewell
        4
    wsinglewell  
    OP
       Mar 15, 2024
    @wsinglewell #2
    @wsinglewell #3
    这图片上传功能真离谱...总是上传错...
    About     Help     Advertise     Blog     API     FAQ     Solana     4402 Online   Highest 6679       Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 44ms UTC 05:35 PVG 13:35 LAX 22:35 JFK 01:35
    Do have faith in what you're doing.
    ubao msn snddm index pchome yahoo rakuten mypaper meadowduck bidyahoo youbao zxmzxm asda bnvcg cvbfg dfscv mmhjk xxddc yybgb zznbn ccubao uaitu acv GXCV ET GDG YH FG BCVB FJFH CBRE CBC GDG ET54 WRWR RWER WREW WRWER RWER SDG EW SF DSFSF fbbs ubao fhd dfg ewr dg df ewwr ewwr et ruyut utut dfg fgd gdfgt etg dfgt dfgd ert4 gd fgg wr 235 wer3 we vsdf sdf gdf ert xcv sdf rwer hfd dfg cvb rwf afb dfh jgh bmn lgh rty gfds cxv xcv xcs vdas fdf fgd cv sdf tert sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf shasha9178 shasha9178 shasha9178 shasha9178 shasha9178 liflif2 liflif2 liflif2 liflif2 liflif2 liblib3 liblib3 liblib3 liblib3 liblib3 zhazha444 zhazha444 zhazha444 zhazha444 zhazha444 dende5 dende denden denden2 denden21 fenfen9 fenf619 fen619 fenfe9 fe619 sdf sdf sdf sdf sdf zhazh90 zhazh0 zhaa50 zha90 zh590 zho zhoz zhozh zhozho zhozho2 lislis lls95 lili95 lils5 liss9 sdf0ty987 sdft876 sdft9876 sdf09876 sd0t9876 sdf0ty98 sdf0976 sdf0ty986 sdf0ty96 sdf0t76 sdf0876 df0ty98 sf0t876 sd0ty76 sdy76 sdf76 sdf0t76 sdf0ty9 sdf0ty98 sdf0ty987 sdf0ty98 sdf6676 sdf876 sd876 sd876 sdf6 sdf6 sdf9876 sdf0t sdf06 sdf0ty9776 sdf0ty9776 sdf0ty76 sdf8876 sdf0t sd6 sdf06 s688876 sd688 sdf86