
´óС£¡£¡£¡£¡£¡£¡£º88.46MÓïÑÔ£º¼òÌåÖÐÎÄ
ÖÖ±ð£ºÈü³µÓÎϷϵͳ£ºAndroid/IOS
?¼ÓÄôóÕ¹Íûpc2.8ÉñͯչÍûÆÊÎö?ΪÄãÌṩ¼ÓÄôóÕ¹Íûpc2.8ÉñͯչÍûÆÊÎöAPP°²×¿°æÏÂÔØ£¬£¬£¬£¬£¬£¬ÀúÊ·°æ±¾¡¢¾É°æÏÂÔØ£¬£¬£¬£¬£¬£¬Éó²é×îмÓÄôóÕ¹Íûpc2.8ÉñͯչÍûÆÊÎöÊÖ»ú°æÏÈÈÝ¡¢Ó¦ÓýØÍ¼¡¢ÍøÓÑ̸ÂÛ£¬£¬£¬£¬£¬£¬Àû±ã¿ì½ÝµÄ½«°²×¿°æ¼ÓÄôóÕ¹Íûpc2.8ÉñͯչÍûÆÊÎöÓ¦ÓÃÃâ·ÑÏÂÔØµ½ÊÖ»ú¡£¡£¡£¡£¡£¡£¡£
ÊÇÒ»¿î¶þ´ÎԪðÏÕÓÎÏ·£¬£¬£¬£¬£¬£¬ÔÚÓÎÏ·ÖÐÍæ¼Ò¿ÉÒÔÊÎÑÝÒ»Ãû¸ßÖÐÅ®ÉúȥУ԰¾ÙÐÐ̽ÏÕ¡£¡£¡£¡£¡£¡£¡£Ð£Ô°Àï»á±¬·¢Ðí¶àµÄÏ·¾çÐԵĹÊÊ£¬£¬£¬£¬£¬£¬Íæ¼Ò¿ÉÒÔÍê³ÉÖÖÖÖÓÎϷʹÃüÀ´»ñµÃ½±Àø£¬£¬£¬£¬£¬£¬ÓÎÏ·µÄÍæ·¨Ò²ÊǸ»ºñÓÐȤ¡£¡£¡£¡£¡£¡£¡£
ÊÇÒ»¿î¾ªÏմ̼¤µÄðÏÕÌÓ×ßÓÎÏ·¡£¡£¡£¡£¡£¡£¡£Íæ¼Ò±»À§ÔÚ³äÂú¿ÆÑ§ÊµÑéµÄÃÔ¹¬ÖУ¬£¬£¬£¬£¬£¬ÐèҪѰÕÒÏßË÷¡¢½â¿ªÃÕÌ⣬£¬£¬£¬£¬£¬²Å»ªÌÓÀëÏÕ¾³¡£¡£¡£¡£¡£¡£¡£ÓÎÏ·»ÃæÏ¸Ä壬£¬£¬£¬£¬£¬ÒôЧ±ÆÕæ¡£¡£¡£¡£¡£¡£¡£Íæ¼ÒÐèÒªÔËÓÃÖǻۺÍÓÂÆø£¬£¬£¬£¬£¬£¬Ñ°ÕÒÒþ²ØµÄÏßË÷£¬£¬£¬£¬£¬£¬½â¿ªÃÕÌ⣬£¬£¬£¬£¬£¬×îÖÕÀÖ³ÉÌÓ×ß¡£¡£¡£¡£¡£¡£¡£
ÊÇÒ»¿îÊ®·ÖºÃÍæµÄÐÝÏд´ÒµÄ£ÄâÓÎÏ·£¬£¬£¬£¬£¬£¬ÄãÐèÒªÕÐÆ¸Ô±¹¤¡¢Ñ°ÕÒºÏÊʵİ칫԰µØ¡¢¼ÓÈëÖÖÖÖÐÅÏ¢»¯ÏîÄ¿ÕÐͶ±ê¡¢È·±£¹¤³ÌµÄ½ø¶È¡¢ÖÎÀíÍŶӳÉÔ±¡¢½»¸¶ÏîÄ¿²¢×¬È¡×ã¹»µÄÀûÈ󡣡£¡£¡£¡£¡£¡£
ÊÇÒ»¿îÊ®·Ö´Ì¼¤µÄÄ©ÈÕÉúÑÄÓÎÏ·£¬£¬£¬£¬£¬£¬Í¨¹ýÎäÆ÷ìî³ýËùÓеĽ©Ê¬£¬£¬£¬£¬£¬£¬²¢ÔÚ¶¼»áÖÐÕö¿ªÒ»³¡Ç¿ÁÒµÄðÏÕ£¬£¬£¬£¬£¬£¬ÎÞаµØÓ¦¶Ô½©Ê¬£¬£¬£¬£¬£¬£¬×èÖ¹±»Ï®»÷²¢¼á³ÖÉúÑĵ½×îºó¡£¡£¡£¡£¡£¡£¡£
ÊÇÒ»¿îºÜÊǺÃÍæµÄÐÝÏÐÒæÖÇÀàÓÎÏ·£¬£¬£¬£¬£¬£¬Õâ¿îÓÎÏ·ÍæÆðÀ´ºÜÊǵÄÓÐȤ£¬£¬£¬£¬£¬£¬ÄÚÀïÓÐÐí¶à¹Å°åÒâÒåÉϵÄÃÍÊÞ¡£¡£¡£¡£¡£¡£¡£Õâ¿îÓÎÏ·µÄ»·çºÜÊǵĹîÒ죬£¬£¬£¬£¬£¬µØÍ¼ÄÚÀïÓÐÐí¶àÒ°ÊÞ¿ÉÒÔ¾ÙÐÐѱ·þ£¬£¬£¬£¬£¬£¬×ÊÖúÍæ¼ÒÀ´¾ÙÐÐÕ½¶·¡£¡£¡£¡£¡£¡£¡£
ÊÇÒ»¿î³Ô¼¦ÓÎÏ·£¬£¬£¬£¬£¬£¬ÓÎÏ·ÓµÓÐϸÄåµÄÓÎÏ·»ÖÊ£¬£¬£¬£¬£¬£¬³¬´óʵ¾°µØÍ¼£¬£¬£¬£¬£¬£¬¿ÉÒÔ°ÙÈËͬ³¡¾º¼¼£¬£¬£¬£¬£¬£¬²Ù×÷Á÷ͨ£¬£¬£¬£¬£¬£¬ÍêÉÆÊָУ¬£¬£¬£¬£¬£¬»¹¿ÉÒÔÓïÒô¿ªºÚ£¬£¬£¬£¬£¬£¬ËæÊ±ËæµØ³©ÏíÉä»÷¾º¼¼ÐËȤ¡£¡£¡£¡£¡£¡£¡£
ÔÚ´óÓïÑÔÄ£×Ó¿ìËÙÂõÏò¸üÇ¿ÍÆÀí¼ÓÄôóÕ¹Íûpc2.8ÉñͯչÍûÆÊÎöÄÜÁ¦Óë¸üÖØ´óÓ¦Óó¡¾°µÄÀú³ÌÖУ¬£¬£¬£¬£¬£¬¡°ÉÏÏÂÎij¤¶È¡±ÒѾ´ÓÒ»¸öÄ£×ÓÉèÖòÎÊý£¬£¬£¬£¬£¬£¬ÑݱäÎªÖÆÔ¼ÏµÍ³ÄÜÁ¦ÉÏÏÞµÄÒªº¦Æ¿¾±¡£¡£¡£¡£¡£¡£¡£
Ò»·½Ã棬£¬£¬£¬£¬£¬³¤ÎĵµÃ÷È·¡¢¿çÂÖ¶Ô»°Ó°Ïó¡¢ÖØ´óÍýÏëÓ볤Á´Ê½ÍÆÀíµÈʹÃü£¬£¬£¬£¬£¬£¬¶ÔÄ£×ÓÌá³öÁËÔ¶³¬¹Å°å 4k »ò 8k ÐòÁг¤¶ÈµÄÐèÇ󣻣»£»£»£»ÁíÒ»·½Ã棬£¬£¬£¬£¬£¬Ö÷Á÷ Transformer ¼Ü¹¹ÖлùÓÚÈ«×¢ÖØÁ¦»úÖÆµÄÅÌËãģʽ£¬£¬£¬£¬£¬£¬ÔÚÐòÁ㤶ÈÔöÌíʱ²»¿É×èÖ¹µØ´øÀ´Æ½·½¼¶µÄʱ¼äÓëÏԴ濪Ïú£¬£¬£¬£¬£¬£¬Ê¹µÃ¡°Ö§³Ö¸ü³¤ÉÏÏÂÎÄ¡±ÔÚÏÖʵ¹¤³ÌÖÐѸËÙת»¯ÎªÄÑÒÔÔâÊܵı¾Ç®ÎÊÌâ¡£¡£¡£¡£¡£¡£¡£
Î§ÈÆÕâһì¶Ü£¬£¬£¬£¬£¬£¬Ï£º±×¢ÖØÁ¦ÏÕЩ³ÉΪѧÊõ½çÓ빤ҵ½çµÄ¹²Ê¶Æ«Ïò£¬£¬£¬£¬£¬£¬µ«ËæÖ®¶øÀ´µÄ£¬£¬£¬£¬£¬£¬²¢²»ÊÇÎÊÌâµÄ³¹µ×½â¾ö£¬£¬£¬£¬£¬£¬¶øÊÇһϵÁÐеĽṹÐÔÕÅÁ¦¡£¡£¡£¡£¡£¡£¡£
ÒÑÍùÊýÄêÖУ¬£¬£¬£¬£¬£¬´ó×ÚÊÂÇéʵÑéͨ¹ýÒýÈëеÄ×¢ÖØÁ¦½á¹¹¡¢Â·ÓÉ»úÖÆ»ò¿ÉѵÁ·Ï£º±Ä£¿£¿£¿£¿£¿éÀ´»º½âÅÌËãѹÁ¦¡£¡£¡£¡£¡£¡£¡£ÕâЩҪÁìÔÚÀíÂÛÖØÆ¯ºó»òÌØ¶¨ÆÀ²âÉÏÍùÍùÌåÏÖ¾«²Ê£¬£¬£¬£¬£¬£¬µ«ÔÚÕæÊµÄ£×ÓѵÁ·Óë°²ÅÅÁ÷³ÌÖУ¬£¬£¬£¬£¬£¬È´Öð½¥Ì»Â¶³öÒ»¸ö±»ºã¾ÃµÍ¹ÀµÄÎÊÌ⣺Ŀ½ñ´óÓïÑÔÄ£×ÓÏÕЩÎÞÒ»ÆÆÀý×ñÕÕ¡°¶ÌÐòÁÐԤѵÁ·¡¢³¤ÐòÁÐ΢µ÷¡±µÄѵÁ··¶Ê½£¬£¬£¬£¬£¬£¬¶øÒ»Ð©ÐÞ¸ÄÄ£×Ӽܹ¹µÄÏ£º±×¢ÖØÁ¦¼Æ»®ÀýÈçNSA£¬£¬£¬£¬£¬£¬Ôڽṹ¡¢²ÎÊý»òÊä³öÐÎʽÉÏÓë±ê×¼ dense attention ±£´æÏÔÖø²î³ØÆë¡£¡£¡£¡£¡£¡£¡£
ÕýÊÇÔÚÕâÒ»Åä¾°Ï£¬£¬£¬£¬£¬£¬Ç廪´óѧÁõÖªÔ¶ÍŶÓÌá³öÁË¡¶InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation¡·¡£¡£¡£¡£¡£¡£¡£ÓëÒÔÍùÇ¿µ÷¡°ÒýÈëнṹ¡±»ò¡°ÔöÌí¿ÉѵÁ·Ä£¿£¿£¿£¿£¿é¡±µÄ·¾¶²î±ð£¬£¬£¬£¬£¬£¬ÕâÏîÑо¿½«¹Ø×¢µãÇ°ÒÆÖÁÒ»¸ö¸ü»ù´¡µÄÎÊÌ⣺ϣº±×¢ÖØÁ¦ÊÇ·ñ±ØÐèÒԸıäÄ£×ӽṹΪ¼ÛÇ®£¬£¬£¬£¬£¬£¬²Å»ª»ñµÃ³¤ÉÏÏÂÎÄЧÂÊ£¿£¿£¿£¿£¿
Ϊ´ËÑо¿ÍŶÓÌá³öÁËÒ»ÖÖ dense¨Csparse ¿ÉÇл»µÄ×¢ÖØÁ¦¿ò¼Ü£¬£¬£¬£¬£¬£¬ÊÔͼÔÚÒÔÔÓÐ dense attention ²ÎÊý×÷ΪÆðʼµã£¬£¬£¬£¬£¬£¬¼á³ÖÊä³öÐÎʽÎȹ̣¬£¬£¬£¬£¬£¬×öµ½ÊÇ·ÇÎı¾¿ÉͬʱѵÁ·£¬£¬£¬£¬£¬£¬ÇÒÄܸßЧµØÊµÏÖ´Ó¶ÌÉÏÏÂÎĵ½³¤ÉÏÏÂÎĵį½»¬¹ý¶É¡£¡£¡£¡£¡£¡£¡£
ÖµµÃÒ»ÌáµÄÊÇ£¬£¬£¬£¬£¬£¬ÕâÏîÊÂÇ鲢佫Öصã·ÅÔÚ¼òµ¥Ö¸±êµÄÌáÉýÉÏ£¬£¬£¬£¬£¬£¬¶øÊÇϵͳÐԵشÓÐÔÄܼá³Ö¡¢ÑµÁ·ÎȹÌÐÔÒÔ¼°¶Ëµ½¶ËÍÆÀíЧÂÊÈý¸ö²ãÃæ£¬£¬£¬£¬£¬£¬¶ÔÕâÒ»Éè¼ÆË¼Ð÷¾ÙÐÐÁËÑéÖ¤£¬£¬£¬£¬£¬£¬´Ó¶øÎª³¤ÉÏÏÂÎÄ´óÓïÑÔÄ£×ÓµÄÑо¿Ó빤³Ìʵ¼ùÌṩÁËÒ»Ìõ²î±ðÓÚÒÔÍùµÄÊÖÒÕõè¾¶¡£¡£¡£¡£¡£¡£¡£
![]()
ÂÛÎĵص㣺https://arxiv.org/pdf/2509.24663
Ò»´Î¡¸ÊÇ·ñÕæ¿ÉÓá¹µÄʵÑ黨¸²
ÕûÌåÀ´¿´£¬£¬£¬£¬£¬£¬Ñо¿µÄʵÑéÉè¼Æ²¢·Ç¼òÆÓµØÑéÖ¤¡°InfLLM-V2 ÊÇ·ñÓÐÓá±£¬£¬£¬£¬£¬£¬¶øÊÇÎ§ÈÆÈý¸öÖð²ãµÝ½øµÄ½¹µãÎÊÌâÕö¿ª£ºµÚÒ»£¬£¬£¬£¬£¬£¬ÔÚ³¤ÉÏÏÂÎÄʹÃüÖУ¬£¬£¬£¬£¬£¬¸ÃÒªÁìµÄÐÔÄÜÊÇ·ñÄܹ»ÆÈ½üÉõÖÁÆ¥ÅäÈ«×¢ÖØÁ¦»úÖÆ£»£»£»£»£»µÚ¶þ£¬£¬£¬£¬£¬£¬ÔÚ¡°¶ÌÐòÁÐԤѵÁ· ¡ú ³¤ÐòÁÐ΢µ÷¡±µÄÕæÊµÑµÁ··¶Ê½Ï£¬£¬£¬£¬£¬£¬¸ÃÒªÁìÊÇ·ñ»áÆÆËðÄ£×ÓÔÓÐÄÜÁ¦£»£»£»£»£»µÚÈý£¬£¬£¬£¬£¬£¬ÔÚÍêÕûÍÆÀíÁ÷³ÌÖУ¬£¬£¬£¬£¬£¬Ï£º±×¢ÖØÁ¦´øÀ´µÄÅÌËã¼ÓËÙÊÇ·ñÄܹ»×ª»¯Îª¶Ëµ½¶ËµÄÏÖʵÊÕÒæ¡£¡£¡£¡£¡£¡£¡£
Î§ÈÆµÚÒ»¸öÎÊÌ⣬£¬£¬£¬£¬£¬Ñо¿ÍŶÓÖØµãÆÀ²âÁ˶àÖÖ³¤ÊäÈëÃ÷ȷʹÃü¡£¡£¡£¡£¡£¡£¡£ÔÚ 32k ³¤¶ÈµÄ RULER »ù×¼ÉÏ£¬£¬£¬£¬£¬£¬InfLLM-V2£¨Sparse£©ÔÚ¾ø´ó´ó¶¼×ÓʹÃüÖеÄÌåÏÖÏÕЩÓë Full Attention ÖØºÏ£¬£¬£¬£¬£¬£¬¶øÑµÁ·ºóÏ£º±ÒªÁ죨Èç InfLLM¡¢MInference£©ÔÚ²¿·ÖʹÃüÉÏ·ºÆðÏÔ×ÅÐÔÄܶÏÑ£¬£¬£¬£¬£¬£¬¿ÉѵÁ·Ï£º±×¢ÖØÁ¦ÒªÁì NSA ÔÚ¶ÌÐòÁе½³¤ÐòÁÐǨáãµÄÉ趨ÏÂÒ²ÏÔÖøÂäÎé¡£¡£¡£¡£¡£¡£¡£
ÕâһЧ¹ûÅú×¢£¬£¬£¬£¬£¬£¬InfLLM-V2 µÄÏ£º±Õ½ÂÔ²¢Î´ÆÆËð¿ç¿éµÄ³¤¾àÀëÒÀÀµ½¨Ä£ÄÜÁ¦£¬£¬£¬£¬£¬£¬¶øÆäËûÒªÁìҪôÔÚ block Ñ¡Ôñ½×¶ÎʧЧ£¬£¬£¬£¬£¬£¬ÒªÃ´¶ÔÔÓÐ×¢ÖØÁ¦ÂþÑÜÔì³ÉÁËÏÔÖøÈŶ¯¡£¡£¡£¡£¡£¡£¡£
![]()
ÔÚ¸üÌù½üÕæÊµÓ¦Óó¡¾°µÄ LongBench »ù×¼ÉÏ£¬£¬£¬£¬£¬£¬ÕâÒ»Ç÷ÊÆÌåÏÖµÃÔ½·¢ÏÔ×Å¡£¡£¡£¡£¡£¡£¡£ÓÉÓÚ LongBench ÁýÕÖÎÊ´ð¡¢ÕªÒª¡¢ÍÆÀíÒÔ¼°¶àÓïÑԵȶàÖÖÕæÊµÊ¹Ãü£¬£¬£¬£¬£¬£¬ÆäÕûÌåÄѶȸßÓںϳÉÊý¾Ý¼¯£¬£¬£¬£¬£¬£¬µ« InfLLM-V2£¨Sparse£©µÄÕûÌåµÃ·ÖÒÀÈ»µÖ´ïÉõÖÁÂÔ΢Áè¼Ý Full Attention¡£¡£¡£¡£¡£¡£¡£À×·åÍø
Ïà±È֮ϣ¬£¬£¬£¬£¬£¬NSA µÄÐÔÄÜÏÔ×ŵÍÓÚÈ«×¢ÖØÁ¦£¬£¬£¬£¬£¬£¬¶ø½öÒÀÀµ³¤¶ÈÍâÍÆµÄ SHORT+YaRN ·½¹æÔò·ºÆðÁË´ó·ùÐÔÄÜÍË»¯¡£¡£¡£¡£¡£¡£¡£Ñо¿Ö°Ô±½øÒ»²½ÊӲ쵽£¬£¬£¬£¬£¬£¬InfLLM-V2 µÄ dense / sparse ¿ÉÇл»»úÖÆÔÚ²¿·ÖʹÃüÖз´¶ø½µµÍÁË×¢ÖØÁ¦ÔëÉù£¬£¬£¬£¬£¬£¬´Ó¶øÊ¹Ä£×ÓÊä³öÔ½·¢Îȹ̡£¡£¡£¡£¡£¡£¡£
![]()
ÔÚ LongPPL ÕâÒ»ÓÃÓÚȨºâ³¤ÐòÁÐÓïÑÔ½¨Ä£ÄÜÁ¦µÄÒÉÐÄ¶ÈÆÀ²âÖУ¬£¬£¬£¬£¬£¬InfLLM-V2 µÄÌåÏÖÓë Full Attention »ù±¾Ò»Ö£¬£¬£¬£¬£¬£¬¶ø NSA µÄÒÉÐĶÈÏÔÖø¸ü¸ß¡£¡£¡£¡£¡£¡£¡£ÕâһЧ¹û˵Ã÷£¬£¬£¬£¬£¬£¬NSA Ôڶ̵½³¤Ç¨áãѵÁ·ºó²¢Î´ÕæÕýѧ»á½¨Ä£³¤³ÌÓïÑÔÂþÑÜ£¬£¬£¬£¬£¬£¬Æä½ÏµÍµÄѵÁ· loss ²¢Î´×ª»¯ÎªÓÐÓõij¤ÐòÁн¨Ä£ÄÜÁ¦¡£¡£¡£¡£¡£¡£¡£
![]()
Î§ÈÆµÚ¶þ¸öÎÊÌ⣬£¬£¬£¬£¬£¬Ñо¿ÍŶӻ¹ÏµÍ³ÆÀ¹ÀÁ˳¤Á´Ê½ÍÆÀíʹÃü£¬£¬£¬£¬£¬£¬°üÀ¨ MATH-500¡¢AIME ÒÔ¼° LiveCodeBench¡£¡£¡£¡£¡£¡£¡£ÕâÀàʹÃüµÄÅäºÏÌØµãÔÚÓÚÊä³öÐòÁнϳ¤£¬£¬£¬£¬£¬£¬ÇÒÖÐÐÄÍÆÀí°ì·¨¸ß¶ÈÒÀÀµÔçÆÚÉÏÏÂÎÄÐÅÏ¢¡£¡£¡£¡£¡£¡£¡£
ʵÑéЧ¹ûÏÔʾ£¬£¬£¬£¬£¬£¬InfLLM-V2£¨Sparse£©ÔÚÕâЩʹÃüÉϵÄÌåÏÖÓë Full Attention ÏÕЩ³Öƽ£¬£¬£¬£¬£¬£¬¶ø NSA ÔÚËùÓÐÏà¹ØÊ¹ÃüÖоù·ºÆðÁËÏÔ×ŵÄÐÔÄÜϽµ¡£¡£¡£¡£¡£¡£¡£ÕâÖ±½ÓÅú×¢£¬£¬£¬£¬£¬£¬InfLLM-V2 Ëù½ÓÄɵÄÏ£º±×¢ÖØÁ¦»úÖÆ²»»áÆÆËðÁ´Ê½Í·ÄÔÍÆÀíÀú³ÌÖÐËùÐèµÄ¡°Í·ÄÔÒ»Á¬ÐÔ¡±¡£¡£¡£¡£¡£¡£¡£
![]()
±ðµÄ£¬£¬£¬£¬£¬£¬Ñо¿Ö°Ô±»¹ÑéÖ¤ÁËÒ»¸öÔÚ¹¤³Ìʵ¼ùÖÐÓÈΪҪº¦µ«³£±»ºöÊÓµÄÎÊÌ⣺ÔÚÍêÉú³¤ÉÏÏÂÎÄ΢µ÷Ö®ºó£¬£¬£¬£¬£¬£¬Ä£×ÓÊÇ·ñÈÔÄܹ»Ê¤ÈÎͨÀý¶ÌÐòÁÐʹÃü¡£¡£¡£¡£¡£¡£¡£ÔÚ MMLU¡¢CEval¡¢HumanEval µÈÆÀ²âÖУ¬£¬£¬£¬£¬£¬InfLLM-V2 ÇÐ»Ø dense ģʽºóÒÀÈ»¼á³ÖÁËÓë Full Attention Ï൱µÄÐÔÄÜ£¬£¬£¬£¬£¬£¬¶ø NSA Ôò·ºÆðÁËÏÔ×ÅÍË»¯¡£¡£¡£¡£¡£¡£¡£ÕâһЧ¹û´Ó¹¤³Ì½Ç¶ÈÅú×¢£¬£¬£¬£¬£¬£¬InfLLM-V2 ²»»áÔÚÊÊÅ䳤ÉÏÏÂÎÄÄÜÁ¦µÄÀú³ÌÖÐÆÆËðÄ£×ÓÔÓеÄͨÓÃÄÜÁ¦¡£¡£¡£¡£¡£¡£¡£
![]()
×îºó£¬£¬£¬£¬£¬£¬Õë¶ÔµÚÈý¸öÎÊÌ⣬£¬£¬£¬£¬£¬Ñо¿ÍŶӲ»µ«ÆÀ¹ÀÁË attention kernel ²ãÃæµÄÀíÂÛ¼ÓËÙЧ¹û£¬£¬£¬£¬£¬£¬»¹ÔÚÍêÕûÍÆÀíÁ÷³ÌÖÐÕÉÁ¿ÁË prefilling£¨TTFT£©ºÍ decoding£¨TPOT£©µÄ¶Ëµ½¶ËЧÂÊ¡£¡£¡£¡£¡£¡£¡£
Ôڿɼû token ÊýΪ 6k£¨|I|=96£©µÄÉèÖÃÏ£¬£¬£¬£¬£¬£¬InfLLM-V2 ʵÏÖÁËÔ¼ 2.1¡Á µÄ prefilling ¼ÓËÙºÍ 2.3¡Á µÄ decoding ¼ÓËÙ£¬£¬£¬£¬£¬£¬²¢ÇÒÕâһЧ¹ûÊÇÔÚǰÀ¡ÍøÂ磨FFN£©²¿·ÖÍêȫδ¾ÙÐÐÓÅ»¯µÄÌõ¼þÏ»ñµÃµÄ£¬£¬£¬£¬£¬£¬½øÒ»²½ËµÃ÷¸ÃÏ£º±×¢ÖØÁ¦Éè¼ÆÔÚÕæÊµÍÆÀí³¡¾°ÖоßÓÐÇÐʵ¿ÉÂ䵨µÄ¼ÓËÙ¼ÛÖµ¡£¡£¡£¡£¡£¡£¡£
´Ó½á¹¹Ñ¡Ôñµ½ÏµÍ³ÓÅ»¯
Ч¹ûÖ®Í⣬£¬£¬£¬£¬£¬ÕâÏîÑо¿µÄʵÑéÏÖʵÉϻظ²ÁËÒ»¸ö¸ü»ù´¡µÄÎÊÌ⣺Ϊʲô InfLLM-V2 µÄʵÑéЧ¹û²¢·Ç¡°ÎÞÒâÅܳöÀ´µÄ¡±£¬£¬£¬£¬£¬£¬¶øÊÇÆäÉè¼ÆÂß¼ÔÚÍêÕûѵÁ·Á÷³ÌÖб»ÏµÍ³ÐÔÑéÖ¤µÄÒ»¶¨Ð§¹û¡£¡£¡£¡£¡£¡£¡£
Ñо¿ÍŶÓÊ×ÏÈÖ¸³ö£¬£¬£¬£¬£¬£¬ÏÖʵÌìÏÂÖÐÏÕЩËùÓдóÓïÑÔÄ£×Ó¶¼×ñÕÕ¡°¶ÌÐòÁÐԤѵÁ·¡¢³¤ÐòÁÐ΢µ÷¡±µÄͨÐз¶Ê½£¬£¬£¬£¬£¬£¬Òò´Ë£¬£¬£¬£¬£¬£¬ÈκÎÏ£º±×¢ÖØÁ¦¼Æ»®ÈôÊÇÔÚÕâÒ»Àú³ÌÖдó·ù¸Ä±ä²ÎÊý½á¹¹¡¢µ÷½â attention µÄÊä³öÐÎʽ£¬£¬£¬£¬£¬£¬¶¼»áÖ±½ÓËðÉËÄ£×ÓÔÚ¶ÌÐòÁн׶ÎÒѾѧµ½µÄÌåÏÖÄÜÁ¦¡£¡£¡£¡£¡£¡£¡£
»ùÓÚÕâÒ»ÏÖÊµÔ¼Êø£¬£¬£¬£¬£¬£¬Ñо¿Ö°Ô±Ã÷È·É趨ÁË InfLLM-V2 µÄ½¹µãʵÑéÌõ¼þ£ºÔÚ´Ó dense attention ¹ý¶Éµ½ sparse attention µÄÀú³ÌÖУ¬£¬£¬£¬£¬£¬±ØÐè°ü¹ÜÒÑÓÐ dense attention µÄ±í´ïÄÜÁ¦²»±»ÆÆË𡣡£¡£¡£¡£¡£¡£
ÔÚÏêϸѵÁ·Á÷³ÌÉÏ£¬£¬£¬£¬£¬£¬Ñо¿ÍŶÓÊ×ÏȽÓÄÉÍêÈ«±ê×¼µÄ Transformer ¼Ü¹¹¶ÔÄ£×Ó¾ÙÐжÌÐòÁÐԤѵÁ·£¬£¬£¬£¬£¬£¬Ä£×Ó¹æÄ£Îª 8B ²ÎÊý£¬£¬£¬£¬£¬£¬Ê¹Óà GQA ½á¹¹£¬£¬£¬£¬£¬£¬ÐòÁ㤶ÈΪ 4k¡£¡£¡£¡£¡£¡£¡£ÕâÒ»½×¶ÎδÒýÈëÈκΠInfLLM-V2 Ïà¹ØµÄÏ£º±»úÖÆ£¬£¬£¬£¬£¬£¬È·±£Ä£×ÓÄÜÁ¦ÍêÈ«½¨ÉèÔڹŰåÈ«×¢ÖØÁ¦µÄ»ù´¡Ö®ÉÏ¡£¡£¡£¡£¡£¡£¡£À×·åÍø
Ëæºó£¬£¬£¬£¬£¬£¬ÔÚ½øÈ볤ÉÏÏÂÎÄѵÁ·½×¶Îʱ£¬£¬£¬£¬£¬£¬Ä£×ÓÄÚ²¿½ö±¬·¢ÁËÈýÏîÒªº¦×ª±ä£ºµ±ÐòÁ㤶ÈÁè¼ÝÔ¤ÉèãÐֵʱ£¬£¬£¬£¬£¬£¬attention mask ÓÉŨÃÜÐÎʽÇл»ÎªÏ£º±ÐÎʽ£»£»£»£»£»Key Óë Value µÄͶӰ²ÎÊý±»ÍêÕû¸´Ó㬣¬£¬£¬£¬£¬²»ÒýÈëеIJÎÊý·ÖÖ§£»£»£»£»£»attention µÄÊä³öÐÎʽʼÖÕ¼á³ÖΪ single-output ½á¹¹£¬£¬£¬£¬£¬£¬²»Ê¹Óà gating£¬£¬£¬£¬£¬£¬Ò²²»±£´æ¶à· attention Êä³öµÄ¾ÛºÏ¡£¡£¡£¡£¡£¡£¡£
ÕýÊÇÕâÖÖ¡°×îС½á¹¹ÈŶ¯¡±µÄÇл»·½·¨£¬£¬£¬£¬£¬£¬Ê¹ InfLLM-V2 Äܹ»ÔÚÊÊÅ䳤ÉÏÏÂÎĵÄͬʱ£¬£¬£¬£¬£¬£¬×î´óÏ޶ȱ£´æÔÓÐÄ£×ÓÄÜÁ¦£¬£¬£¬£¬£¬£¬ÕâÒ²×é³ÉÁËÆäÓë NSA µÈ¿ÉѵÁ·Ï£º±×¢ÖØÁ¦ÒªÁìµÄʵÖʲî±ð¡£¡£¡£¡£¡£¡£¡£
Ïà¹ØÊµÑé½øÒ»²½ÑéÖ¤ÁËÒ»¸ö¾ßÓз´Ö±¾õÒâζµÄ½áÂÛ£º¿ÉѵÁ·µÄÏ£º± attention ²¢·×Æç¶¨¸üÊʺ϶̵½³¤µÄǨáãѵÁ·¡£¡£¡£¡£¡£¡£¡£Ñо¿Ö°Ô±µÄÆÊÎöÅú×¢£¬£¬£¬£¬£¬£¬NSA ÔÚ¸ÃÉ趨ϵÄÐÔÄÜÎÊÌâ²¢·ÇÔ´×ÔÏ£º±»úÖÆ×Ô¼º£¬£¬£¬£¬£¬£¬¶øÊÇÓÉÓÚÆäÒýÈëÁËÈýÌ× Key¨CValue ͶӰ¡¢¶à· attention Êä³öÒÔ¼°»ùÓÚ gating µÄЧ¹û¾ÛºÏ½á¹¹¡£¡£¡£¡£¡£¡£¡£
![]()
ÕâÐ©ÌØÊâÄ£¿£¿£¿£¿£¿éÔÚ¶ÌÐòÁн׶β»µ«´øÀ´ÈßÓàÅÌË㿪Ïú£¬£¬£¬£¬£¬£¬»¹»áÏÔÖø¸Ä±ä×¢ÖØÁ¦ÂþÑÜÐÎ̬£¬£¬£¬£¬£¬£¬´Ó¶ø¶ÔÄ£×ÓÒÑѧµ½µÄÌåÏÖÔì³É×ÌÈÅ¡£¡£¡£¡£¡£¡£¡£ÔÚʵÑéЧ¹ûÖУ¬£¬£¬£¬£¬£¬ÕâÒ»ÎÊÌâÏêϸÌåÏÖΪѵÁ· loss ÇúÏß·ºÆðÏÔ×ÅÕðµ´¡¢³¤ÐòÁÐÒÉÐĶȣ¨LongPPL£©ÏÔÖøÉý¸ß£¬£¬£¬£¬£¬£¬ÒÔ¼°³¤Á´Ê½ÍÆÀíʹÃüÐÔÄܵÄϵͳÐÔϽµ¡£¡£¡£¡£¡£¡£¡£
ÔÚ¹¤³ÌʵÏÖ²ãÃæ£¬£¬£¬£¬£¬£¬Ñо¿ÍŶӻ¹Í¨¹ý½øÒ»²½µÄÏûÈÚÆÊÎö¶¨Î»ÁË InfLLM-V2 µÄÖ÷ÒªÐÔÄÜÆ¿¾±£¬£¬£¬£¬£¬£¬·¢Ã÷Æä¼¯ÖÐÔÚ block selection ½×¶Î£¬£¬£¬£¬£¬£¬ÓÈÆäÊÇ compression attention µÄÅÌËãÒÔ¼° attention score µÄÏÔʽÎﻯÀú³Ì¡£¡£¡£¡£¡£¡£¡£Õë¶ÔÕâÒ»ÎÊÌ⣬£¬£¬£¬£¬£¬Ñо¿Ö°Ô±ÔÚʵÑéÖÐÒýÈëÁË head-group fusion ºÍ LSE Approximation µÈÓÅ»¯Õ½ÂÔ¡£¡£¡£¡£¡£¡£¡£
ʵÑéЧ¹ûÅú×¢£¬£¬£¬£¬£¬£¬ÕâЩˢÐÂÔÚÏÕЩ²»Ó°ÏìÄ£×ÓÐÔÄܵÄÌõ¼þÏ£¬£¬£¬£¬£¬£¬¿ÉÒÔ½« block selection µÄÅÌËãʱ¼ä½µµÍÔ¼ 20¨C30%£¬£¬£¬£¬£¬£¬´Ó¶øÎªºóÐø¶Ëµ½¶ËÍÆÀí¼ÓËÙʵÑéÖÐÊӲ쵽µÄÏÔÖøÐÔÄÜÌáÉýµÓÚ¨ÁËÒªº¦»ù´¡¡£¡£¡£¡£¡£¡£¡£
![]()
¿É¡¸ÈÈÉý¼¶¡¹µÄ³¤ÉÏÏÂÎļƻ®
´ÓÑо¿ÒâÒåµÄ½Ç¶ÈÀ´¿´£¬£¬£¬£¬£¬£¬ÕâÏîÑо¿¶Ô¡°³¤ÉÏÏÂÎÄ´óÓïÑÔÄ£×Ó¡±ÕâһƫÏò¸ø³öÁ˾ßÓÐÒªÁìÂÛ¼ÛÖµµÄÆôʾ¡£¡£¡£¡£¡£¡£¡£
Ñо¿ÍŶÓÃ÷È·Ö¸³ö£¬£¬£¬£¬£¬£¬Ï£º±×¢ÖØÁ¦»úÖÆÎ´À´µÄÉú³¤Öص㲢²»ÔÚÓÚÉè¼ÆÈ«ÐµÄ×¢ÖØÁ¦½á¹¹£¬£¬£¬£¬£¬£¬¶øÔÚÓÚÔõÑùÔÚ²»ÆÆËð¼ÈÓÐ dense attention ½á¹¹µÄÌõ¼þÏÂʵÏÖ¸ßЧµÄÏ£º±»¯£¬£¬£¬£¬£¬£¬ÕâÒ»¿´·¨ÔÚÒ»¶¨Ë®Æ½ÉϸıäÁË´ËǰÒÔ¡°½á¹¹Á¢Ò족ΪÖ÷µ¼µÄÑо¿·¶Ê½¡£¡£¡£¡£¡£¡£¡£
ÔÚ¹¤³Ìʵ¼ù²ãÃæ£¬£¬£¬£¬£¬£¬InfLLM-V2 Ëù¾ß±¸µÄһϵÁÐÌØÕ÷ǡǡÆõºÏÕæÊµ¹¤Òµ°²ÅŵĽ¹µãÐèÇ󣬣¬£¬£¬£¬£¬°üÀ¨ÎÞÐèµ÷½âÄ£×Ó²ÎÊý¹æÄ£¡¢ÎÞÐèά»¤¶àÌ×Ä£×Ó°æ±¾¡¢²»»áÎþÉü¶ÌÐòÁÐʹÃüÐÔÄÜ£¬£¬£¬£¬£¬£¬ÇÒ²»ÒÀÀµÖØÐ¾ÙÐдó¹æÄ£Ô¤ÑµÁ·¡£¡£¡£¡£¡£¡£¡£ÕâÒâζ×Å£¬£¬£¬£¬£¬£¬Ò»¸öÒѾ°²ÅÅ»òѵÁ·Íê³ÉµÄÏÖÓдóÓïÑÔÄ£×Ó£¬£¬£¬£¬£¬£¬¿ÉÒÔÔÚ×îС¼ÛǮϱ»¡°ÈÈÉý¼¶¡±Îª¾ß±¸³¤ÉÏÏÂÎÄ´¦Öóͷ£ÄÜÁ¦µÄÄ£×Ó¡£¡£¡£¡£¡£¡£¡£
ÔÚ´Ë»ù´¡ÉÏ£¬£¬£¬£¬£¬£¬Ñо¿Ö°Ô±Ò²ÎªºóÐøÊÂÇéÒþº¬µØ»®¶¨ÁËÈô¸ÉÖ÷ÒªÔ¼Êø£ºÊ×ÏÈ£¬£¬£¬£¬£¬£¬Ó¦×èÖ¹ÒýÈëÌØÁíÍâ attention ·ÖÖ§£¬£¬£¬£¬£¬£¬ÒÔÃâÆÆËðÔÓнṹµÄÒ»ÖÂÐÔ£»£»£»£»£»Æä´Î£¬£¬£¬£¬£¬£¬²»Ó¦½ÓÄÉÓë dense attention Êä³öÐÎʽ²»¼æÈݵÄÉè¼Æ£¬£¬£¬£¬£¬£¬²»È»½«µ¼Ö¶̵½³¤Ç¨áãÀú³ÌÖеÄÄÜÁ¦Ëðʧ£»£»£»£»£»×îºó£¬£¬£¬£¬£¬£¬Ï£º±×¢ÖØÁ¦µÄÉè¼Æ±ØÐè³ä·Ö˼Á¿µ×²ãÅÌËãʵÏÖÓë kernel ÌØÕ÷£¬£¬£¬£¬£¬£¬¶ø²»µ«Í£ÁôÔÚ¿´·¨²ãÃæµÄ½á¹¹ÓÅÑÅÐÔ¡£¡£¡£¡£¡£¡£¡£
ÕýÊÇÓÉÓÚ¸ÃÑо¿½«ÑµÁ··¶Ê½¡¢Ä£×ӽṹÉè¼ÆÒÔ¼° CUDA ¼¶ÊµÏÖϸ½Ú¾ÙÐÐÁËͳһ¿¼Á¿£¬£¬£¬£¬£¬£¬²¢ÏµÍ³ÐÔµØÚ¹ÊÍÁËÒÔÍùÏ£º±×¢ÖØÁ¦ÒªÁìÔÚÕæÊµÑµÁ·ÓëÍÆÀíÁ÷³ÌÖÐʧ°ÜµÄÔµ¹ÊÔÓÉ£¬£¬£¬£¬£¬£¬²ÅʹÆä²»µ«Í£ÁôÔÚÒªÁì²ãÃæµÄÌá³ö£¬£¬£¬£¬£¬£¬¶øÄܹ»½øÒ»²½Ö§³ÖÏÖʵģ×ÓµÄѵÁ·ÓëÂ䵨ӦÓ㬣¬£¬£¬£¬£¬ÕâÒ²ÊÇÑо¿ÍŶÓÄܹ»»ùÓڸÿò¼ÜÖ±½Ó²ú³ö MiniCPM-4.1 µÈÄ£×ÓµÄÖ÷ÒªÔµ¹ÊÔÓÉ¡£¡£¡£¡£¡£¡£¡£
InfLLM-V2 Ö÷Òª×÷Õß
ÕÔÍþÁØ£¬£¬£¬£¬£¬£¬ËûÊÇÇ廪´óѧÅÌËã»ú¿ÆÑ§ÓëÊÖÒÕϵ×ÔÈ»ÓïÑÔ´¦Öóͷ£ÊµÑéÊÒ£¨THUNLP£©µÄ²©Ê¿Ñо¿Éú£¬£¬£¬£¬£¬£¬Ñо¿Æ«Ïò¾Û½¹ÓÚ¸ßЧ´óÓïÑÔÄ£×Ó¡£¡£¡£¡£¡£¡£¡£
ËûµÄÑо¿Ö÷ÒªÎ§ÈÆÄ£×ÓÍÆÀíÓëѵÁ·¼ÓËÙÕö¿ª£¬£¬£¬£¬£¬£¬¹Ø×¢µã²¢·Ç´¿´âÒýÈëеÄÄ£×ӽṹ£¬£¬£¬£¬£¬£¬¶øÊÇÔõÑùÔÚ²»ÆÆËð±ê×¼ Transformer ±í´ïÄÜÁ¦Óë¼ÈÓÐÄ£×ÓÐÔÄܵÄÌõ¼þÏ£¬£¬£¬£¬£¬£¬ÊµÏÖ¶ÔÖÖÖÖ³¡¾°µÄÓÐÓÃÊÊÅäÓ빤³Ì¼¶¼ÓËÙ¡£¡£¡£¡£¡£¡£¡£
³ýѧÊõÑо¿Í⣬£¬£¬£¬£¬£¬Ëû»¹ºã¾Ã¼ÓÈë OpenBMB¡¢MiniCPM µÈ¿ªÔ´ÏîÄ¿£¬£¬£¬£¬£¬£¬ÔÚ¸ßÐÔÄÜ attention kernel¡¢ÍÆÀíÓÅ»¯ÓëϵͳʵÏÖ·½Ãæ¼ç¸ºÒªº¦¹¤³ÌÊÂÇ飬£¬£¬£¬£¬£¬ÆäÑо¿Ð§¹û½ÒÏþÓÚ ICLR¡¢ACL¡¢EMNLP µÈ¹ú¼ÊÖ÷Á÷¾Û»á¡£¡£¡£¡£¡£¡£¡£
![]()
²Î¿¼Á´½Ó£ºhttps://weilin-zhao.com
ÁõÖªÔ¶£¬£¬£¬£¬£¬£¬ËûÊÇÇ廪´óѧÅÌËã»ú¿ÆÑ§ÓëÊÖÒÕϵ½ÌÊÚ¡¢²©Ê¿Éúµ¼Ê¦£¬£¬£¬£¬£¬£¬¼æÈÎÖйúÖÐÎÄÐÅϢѧ»áÀíÊ¡¢Éç»áýÌå´¦Öóͷ£×¨Î¯»á¸±Ö÷ÈεÈѧÊõÖ°Îñ¡£¡£¡£¡£¡£¡£¡£
ÁõÖªÔ¶»®·ÖÓÚ 2006 Äê¡¢ 2011 ÄêÓÚÇ廪´óѧÅÌËã»ú¿ÆÑ§ÓëÊÖÒÕϵ»ñµÃѧʿ¡¢²©Ê¿Ñ§Î»£¬£¬£¬£¬£¬£¬²¢ÔÚÇ廪´óѧ¿ªÕ¹²©Ê¿ºóÑо¿£¬£¬£¬£¬£¬£¬ºóÁôУÈν̡£¡£¡£¡£¡£¡£¡£ÆäÖ÷ÒªÑо¿Æ«Ïò°üÀ¨´óÄ£×ÓÊÖÒÕ¡¢×ÔÈ»ÓïÑÔ´¦Öóͷ£¡¢ÖªÊ¶Í¼Æ×ÓëÓïÒåÅÌËãÒÔ¼°Éç»áÅÌËãµÈ½¹µãÁìÓò¡£¡£¡£¡£¡£¡£¡£
ÁõÖªÔ¶ÔÚ¹ú¼ÊÖ÷Á÷ѧÊõ¾Û»áºÍÆÚ¿¯£¨ÈçNature Machine Intelligence¡¢ACL¡¢EMNLP¡¢IJCAI ºÍ AAAI£©ÉϽÒÏþÁË 200 ÓàÆªÂÛÎÄ£¬£¬£¬£¬£¬£¬Æä Google Scholar ÒýÓÃÁ¿Áè¼Ý7Íò´Î£¬£¬£¬£¬£¬£¬·´Ó¦³öÆÕ±éµÄѧÊõÓ°ÏìÁ¦¡£¡£¡£¡£¡£¡£¡£
ËûÔÚ¶àÏî¹ú¼Ò¼¶¿ÆÑÐÏîÄ¿Öе£µ±ÈÏÕæÈË»òÖ÷Òª¼ÓÈëÕߣ¬£¬£¬£¬£¬£¬Ôø»ñ½ÌÓý²¿×ÔÈ»¿ÆÑ§Ò»µÈ½±¡¢ÖйúÖÐÎÄÐÅϢѧ»áǮ೤ÖÐÎÄÐÅÏ¢´¦Öóͷ£¿ÆÑ§ÊÖÒÕ½±Ò»µÈ½±¡¢ÌìÏ»¥ÁªÍøÁìÏȿƼ¼Ð§¹û½±¡¢±±¾©ÊÐÇàÄê½ÌѧÃûʦ½±µÈ¶àÏî¿ÆÑн±Àø£¬£¬£¬£¬£¬£¬²¢ÈëÑ¡°üÀ¨¹ú¼ÒÇàÄêÈ˲ÅÍýÏë¡¢Elsevier Öйú¸ß±»ÒýѧÕß¡¢¡¶ÂéÊ¡Àí¹¤¿Æ¼¼Ì¸ÂÛ¡·ÖйúÇø¡°35 ËêÒÔÏ¿Ƽ¼Á¢Òì 35 È˰ñµ¥¡±¼°Öйú¿ÆÐÇàÄêÈ˲ÅÍоٹ¤³ÌµÈÈ˲ÅÏîÄ¿¡£¡£¡£¡£¡£¡£¡£
![]()
²Î¿¼µØµã£ºhttps://nlp.csai.tsinghua.edu.cn/~lzy/zh.html
º«Ðñ£¬£¬£¬£¬£¬£¬ËûÊÇÇ廪´óѧÅÌËã»ú¿ÆÑ§ÓëÊÖÒÕϵÖúÀíÑо¿Ô±£¬£¬£¬£¬£¬£¬Ò²ÊÇ´óÄ£×Ó¿ªÔ´ÉçÇø OpenBMB µÄ½¹µãÌᳫÈËÓëºã¾ÃÈÏÕæÈËÖ®Ò»¡£¡£¡£¡£¡£¡£¡£
º«Ðñºã¾Ã´ÓÊ´óÄ£×ÓÊÖÒÕ¡¢×ÔÈ»ÓïÑÔ´¦Öóͷ£¡¢ÖªÊ¶¹¤³ÌµÈ·½ÃæµÄÑо¿£¬£¬£¬£¬£¬£¬²¿·ÖÑо¿Ò²Éæ¼°²¢ÐÐÅÌËã¡¢Ò칹ϵͳÓÅ»¯µÈÆ«Ïò£¬£¬£¬£¬£¬£¬ÔÚ¹ú¼Ê¶¥¼¶Ñ§Êõ¾Û»á¼°ÆÚ¿¯½ÒÏþÂÛÎÄÊýʮƪ£¬£¬£¬£¬£¬£¬Google Scholar ËûÒý 1.6 ÍòÓà´Î£¬£¬£¬£¬£¬£¬Ôø»ñ½ÌÓý²¿×ÔÈ»¿ÆÑ§Ò»µÈ½±¡¢ÌìÏ»¥ÁªÍø´ó»áÁìÏȿƼ¼½±£¬£¬£¬£¬£¬£¬²¢ÈëÑ¡ÖйúÅÌËã»úѧ»á£¨CCF£©ÓŲ©¼¤ÀøÍýÏë¡¢Ç廪ÓÅÒ첩ʿºó¡¢¡¶ÂéÊ¡Àí¹¤¿Æ¼¼Ì¸ÂÛ¡·ÖйúÇø¡°35 ËêÒÔÏ¿Ƽ¼Á¢Òì 35 È˰ñµ¥¡±¡¢¼°²©Ê¿ºóÁ¢ÒìÈ˲ÅÖ§³ÖÍýÏë¡£¡£¡£¡£¡£¡£¡£
![]()
²Î¿¼Á´½Ó£ºhttps://www.cs.tsinghua.edu.cn/info/1114/6422.htm
Ф³¯¾ü£¬£¬£¬£¬£¬£¬ËûÊÇÇ廪´óѧÅÌËã»úϵ²©Ê¿ºó£¬£¬£¬£¬£¬£¬Ö÷ÒªÑо¿Æ«ÏòΪ¸ßЧ´óÄ£×Ӽܹ¹£¬£¬£¬£¬£¬£¬ÔÚNature Machine Intelligence¡¢ICML¡¢NeurIPS¡¢ICLR¡¢ACLµÈ¹ú¼Ê¶¥¼¶¾Û»á¼°ÆÚ¿¯½ÒÏþÂÛÎÄ¶àÆª£¬£¬£¬£¬£¬£¬Ôø»ñǮ೤ÖÐÎÄÐÅÏ¢´¦Öóͷ£¿ÆÑ§ÊÖÒÕ½±Ò»µÈ½±£¬£¬£¬£¬£¬£¬²©Ê¿ºóÁ¢ÒìÈ˲ÅÖ§³ÖÍýÏ룬£¬£¬£¬£¬£¬Ç廪´óѧˮľѧÕߣ¬£¬£¬£¬£¬£¬Ç廪´óѧÓÅÒ첩ʿÂÛÎĵÈÉùÓþ¡£¡£¡£¡£¡£¡£¡£
![]()
²Î¿¼Á´½Ó£ºhttps://x¼ÓÄôóÕ¹Íûpc2.8ÉñͯչÍûÆÊÎöcjthu.github.io/
Èý¹úÖ¾£º×ÇÊÀèÉÐÛ´óС£¡£¡£¡£¡£¡£¡£º95.84M°æ±¾£ºvip5.1.33ÏÂÔØ
İͷսʿ´óС£¡£¡£¡£¡£¡£¡£º35.56M°æ±¾£ºvip6.6.19ÏÂÔØ
ÃÔʧӢÐÛ´óС£¡£¡£¡£¡£¡£¡£º60.24M°æ±¾£ºvip8.8.61ÏÂÔØ
˪֮½µÁÙ´óС£¡£¡£¡£¡£¡£¡£º36.98M°æ±¾£ºvip4.3.71ÏÂÔØ
Celestial Warriors´óС£¡£¡£¡£¡£¡£¡£º58.84M°æ±¾£ºvip5.7.7ÏÂÔØ
İͷɱÊÖ´óС£¡£¡£¡£¡£¡£¡£º40.92M°æ±¾£ºvip1.6.62ÏÂÔØ

²»°Üİͷµç×Ó¾º¼¼34.84Mvip9.4.96
ÏÂÔØ
Ðǽç·¾¶ÏàÖúÓÎÏ·13.90Mvip8.6.21
ÏÂÔØ
Ìúȶ·Ê¿Space50.66Mvip1.6.46
ÏÂÔØ
Á¿×ÓʹÃüÀíÏë27.19Mvip2.1.90
ÏÂÔØ
ÃÍ»ðÈ»÷»ØºÏÖÆ28.69Mvip4.5.91
ÏÂÔØ
·çÔÆ°ÔÒµCo-opCompetitive63.62Mvip1.4.12
ÏÂÔØ
ÌìÃüÖ®×ÓMMO-MassivelyMultiplayerOnline16.79Mvip5.0.77
ÏÂÔØ
ÌúÈÆÆÏþÄ£Äâı»®27.80Mvip2.1.90
ÏÂÔØ
Èý¹úÖ¾£ºÈºÐÛÕù°ÔÌåÓýÓÎÏ·22.47Mvip5.0.96
ÏÂÔØ
˪֮½µÁÙAdventure85.79Mvip3.9.2
ÏÂÔØ