ÈË´ó&ͨÒ壺IterResearchÓÃ40KÉÏÏÂÎÄÇáËÉʵÏÖ2048ÂÖ½»»¥²»ÍË»¯
2026-03-04 01:52:56

ÒÔ 40K ÉÏÏÂÎÄ £¬£¬£¬£¬ £¬Èà Agent ËÑË÷ 2048 ÂÖ £¬£¬£¬£¬ £¬ÐÔÄÜ»¹ÄÜÒ»ÆðÕÇ£¿ £¿£¿£¿£¿ £¿£¿£¿ÕâÏÕЩÊDz»¿ÉÏëÏóµÄ¡£¡£¡£¡£¡£

Ä¿½ñÖ÷Á÷µÄ Search Agent ¶¼ÃæÁÙͳһ¸öÞÏÞΣºAgent ÐèÒªÖØ¸´ËÑË÷ÍøÒ³¡¢±È¶ÔÏßË÷¡¢ÑéÖ¤¼ÙÉè¡¢»ØËÝÐÞÕý £¬£¬£¬£¬ £¬½»»¥Âִζ¯éüÊýÊ®ÉϰÙÂÖ¡£¡£¡£¡£¡£µ«ÒÔ ReAct Ϊ´ú±íµÄ¹Å°å·¶Ê½ £¬£¬£¬£¬ £¬°ÑÿһÂÖµÄ˼Ë÷ºÍ¹¤¾ß·µ»ØÐ§¹ûÒ»Ö±×·¼Óµ½Í³Ò»¸öÉÏÏÂÎÄ´°¿ÚÖÐ ¡ª¡ª ×öµÃÔ½¶à £¬£¬£¬£¬ £¬ÉÏÏÂÎÄÔ½Ó·Ö× £¬£¬£¬£¬ £¬Áô¸øÍÆÀíµÄ¿Õ¼äÔ½ÉÙ £¬£¬£¬£¬ £¬ÔçÆÚµÄÔëÉùºÍ¹ýʧ·¾¶»¹±»ÓÀÊÀ¡¸º¸ËÀ¡¹ÔÚÓ°ÏóÀï¡£¡£¡£¡£¡£

Ч¹û¾ÍÊÇ£ºAgent ËѵÃÔ½ÉîÈë £¬£¬£¬£¬ £¬·´¶ø¡¸Ï롹µÃÔ½ºýÍ¿¡£¡£¡£¡£¡£

Äܲ»¿ÉÈà Agent ÔÚ̽Ë÷Àú³ÌÖÐÒ»Ö±¡¸ÕûÀíÊÂÇę́¡¹ £¬£¬£¬£¬ £¬Ê¼ÖÕÔÚÒ»¸öÇå½àµÄ¿Õ¼äÀï˼Ë÷£¿ £¿£¿£¿£¿ £¿£¿£¿

À´×ÔÖйúÈËÃñ´óѧÓë°¢Àï°Í°ÍͨÒåʵÑéÊÒµÄÑо¿ÍŶÓÌá³öÁË IterResearch £¬£¬£¬£¬ £¬Ò»ÖÖȫеĵü´úʽÉî¶ÈÑо¿·¶Ê½¡£¡£¡£¡£¡£

ͨ¹ýÂí¶û¿É·òʽµÄÊÂÇé¿Õ¼äÖØ¹¹ £¬£¬£¬£¬ £¬IterResearch Èà Agent ÔÚ½ö 40K ÉÏÏÂÎij¤¶ÈÏÂÍê³ÉÁË 2048 ´Î¹¤¾ß½»»¥ÇÒÐÔÄܲ»Ë¥¼õ £¬£¬£¬£¬ £¬ÔÚ BrowseComp ÉÏ´Ó 3.5% Ò»ÆðÅÊÉýÖÁ 42.5%¡£¡£¡£¡£¡£

ÏÖÔÚ £¬£¬£¬£¬ £¬¸ÃÂÛÎÄÒѱ» ICLR 2026 ÎüÊÕ¡£¡£¡£¡£¡£

ÂÛÎÄÁ´½Ó£ºhttps://arxiv.org/pdf/2511.07327´úÂëÁ´½Ó£ºhttps://github.com/Chen-GX/IterResearch

¡¸¶ÑÉÏÏÂÎÄ¡¹ÎªÊ²Ã´ÄÑÒÔʵÏÖ Interaction Scaling£¿ £¿£¿£¿£¿ £¿£¿£¿

ÔÚ Search Agent ³¡¾°Ï £¬£¬£¬£¬ £¬Agent µÄÊÂÇéʵÖÊÉÏÊÇÒ»¸öÓëÍⲿÇéÐÎÒ»Ö±½»»¥µÄÑ­»·¡£¡£¡£¡£¡£¹Å°å ReAct ·¶Ê½½«ÕâÒ»Àú³Ì½¨Ä£Îª¡¸µ¥ÉÏÏÂÎĶѵþ¡¹£ºÃ¿Ò»ÂÖµÄÍÆÀíºÍ¹¤¾ß·µ»Ø±»Ò»Á¬×·¼Óµ½Í³Ò»¸öÉÏÏÂÎÄ´°¿ÚÖÐ £¬£¬£¬£¬ £¬ÐγÉÏßÐÔÔöÌíµÄÓ°ÏóÁ´¡£¡£¡£¡£¡£

ÕâÖÖ¿´ËÆ×ÔÈ»µÄÉè¼Æ £¬£¬£¬£¬ £¬ÔÚ³¤³ÌʹÃüÖлáÒý·¢Á½¸ö½á¹¹ÐÔÎÊÌ⣺

ÆäÒ»ÊÇÉÏÏÂÎÄÖÏÏ¢£¨context suffocation£©£ºÉÏÏÂÎÄ´°¿ÚµÄ×ÜÈÝÁ¿ÊÇÓÐÏÞµÄ £¬£¬£¬£¬ £¬ÀúÊ·ÐÅϢһֱȺ¼¯Òâζ×ÅÁô¸øºóÐøÍÆÀíµÄ¡¸ÌìÉúÔ¤Ë㡹±»Ò»Á¬Ñ¹Ëõ¡£¡£¡£¡£¡£Agent ±»ÆÈ¸ø³ö¸ü¶Ì¡¢¸üdzµÄ»Ø¸² £¬£¬£¬£¬ £¬×îÖÕ»¬Ïòç¢Â©µÄ½áÂÛ£» £» £»£»£»£»£» £»Æä¶þÊÇÔëÉùÎÛȾ£¨noise contamination£©£ºËÑË÷Àú³ÌÖб¬·¢µÄ´ó×ÚÍøÒ³ÕªÒª¡¢ÔçÆÚµÄ¹ýʧ·¾¶ºÍÎÞ¹ØÏßË÷±»ÓÀÊÀдÈëÉÏÏÂÎÄ £¬£¬£¬£¬ £¬¶ÔºóÐøÍÆÌêÍ·Éú¼¶Áª×ÌÈÅ £¬£¬£¬£¬ £¬ÐÅÔë±ÈÒ»Á¬×ߵ͡£¡£¡£¡£¡£

ÉçÇøÒѾ­Òâʶµ½ÁËÕâЩÎÊÌâ £¬£¬£¬£¬ £¬Â½ÐøÌá³öÁË context folding¡¢summary µÈ»º½âÕ½ÂÔ £¬£¬£¬£¬ £¬ÊÔͼΪҡҡÓû×¹µÄÉÏÏÂÎÄ¡¸ÐøÃü¡¹¡£¡£¡£¡£¡£µ«ÕâЩҪÁìʵÖÊÉÏÊÇÔÚµ÷½â £¬£¬£¬£¬ £¬²¢Î´´Ó»ù´¡ÉϸıäÉÏÏÂÎÄÏßÐÔÔöÌíµÄ½á¹¹ ¡ª¡ª ¸ø Agent 256K ÉõÖÁ¸ü³¤µÄ´°¿Ú £¬£¬£¬£¬ £¬Ò²Ö´ÙÇÍÆ³ÙÍ߽⠣¬£¬£¬£¬ £¬¶ø·Ç×èÖ¹Í߽⡣¡£¡£¡£¡£

²»ÔÙ¡¸¶Ñµþ¡¹ £¬£¬£¬£¬ £¬¶øÊÇ¡¸Öع¹¡¹£ºIterResearch µÄ½¹µã˼Ð÷

IterResearch ¶ÔÕâÒ»ÎÊÌâµÄ»ØÓ¦²»ÊÇÐÞÐÞ²¹²¹ £¬£¬£¬£¬ £¬¶øÊÇ´Ó·¶Ê½²ãÃæÖØÐÂ˼Ë÷£ºÓëÆäÒ»Ö±ÍùÉÏÏÂÎÄÀïÈû¹¤¾ß £¬£¬£¬£¬ £¬²»ÈçÈà Agent ѧ»á¡¸±ß×ö±ßÕûÀí¡¹¡£¡£¡£¡£¡£

Ñо¿ÍŶӽ«³¤³ÌÑо¿Àú³ÌÐÎʽ»¯ÎªÒ»¸öÂí¶û¿É·ò¾öÒéÀú³Ì£¨MDP£©¡£¡£¡£¡£¡£½¹µãÍ·ÄÔÊÇ£ºAgent ²»ÔÙά»¤Ò»¸öÒ»Ö±ÅòÕ͵ÄÍêÕûÀúÊ· £¬£¬£¬£¬ £¬¶øÊÇͨ¹ýÒ»¸öÒ»Á¬½ø»¯µÄ¡¸ÑݽøÊ½±¨¸æ¡¹£¨evolving report£©À´×ÛºÏÒÑÓÐÓùû¡¢Ñ¹ËõÎÞ¹ØÐÅÏ¢¡¢¸üÐÂÍÆÀí״̬¡£¡£¡£¡£¡£Ã¿Ò»ÂÖÍÆÀí¶¼ÔÚÒ»¸ö±»Öع¹¹ýµÄ¡¢ºã¶¨ÖØÆ¯ºóµÄÊÂÇé¿Õ¼äÖÐÕö¿ª¡£¡£¡£¡£¡£

ÏêϸÀ´Ëµ £¬£¬£¬£¬ £¬Agent µÄÿһ²½°üÀ¨Á½¸ö½¹µãÐж¯£º

¾öÒé½×¶Î£ºAgent »ùÓÚÄ¿½ñ״̬ £¬£¬£¬£¬ £¬Êä³öÈý²¿·Ö ¡ª¡ª ˼Ë÷Àú³Ì£¨Think£©¡¢¸üкóµÄÑݽø±¨¸æ£¨Report£©ºÍ±¾ÂÖ¹¤¾ßŲÓÃÇëÇó£¨Action£©¡£¡£¡£¡£¡£±¨¸æÔÚÕâÀïÊÎÑÝÁË¡¸Ñ¹ËõÓ°Ï󡹵ĽÇÉ« £¬£¬£¬£¬ £¬Agent ÐèÒªÔÚÿһÂÖ×Ô¶¯¾öÒéÄÄЩÐÅÏ¢ÖµµÃ±£´æ £¬£¬£¬£¬ £¬ÄÄЩӦ¸Ã±»ÑïÆú¡£¡£¡£¡£¡£×´Ì¬×ªÒƽ׶Σº½øÈëÏÂÒ»ÂÖʱ £¬£¬£¬£¬ £¬ÍêÕûµÄÀúÊ·¹ì¼£±»ÓÐÒâÑïÆú £¬£¬£¬£¬ £¬Agent ½ö±£´æ¸üкóµÄ±¨¸æ¡¢ÉÏÒ»ÂֵŤ¾ßŲÓü°Æä·µ»ØÐ§¹û £¬£¬£¬£¬ £¬ÈýÕßÅäºÏ×é³ÉеÄÍÆÀíÆðµã¡£¡£¡£¡£¡£

´ÓÉÏÏÂÎÄÖÎÀíµÄÊӽǿ´ £¬£¬£¬£¬ £¬¹Å°å ReAct µÄ״̬¿Õ¼äËæ½»»¥ÂÖ´Î t ÏßÐÔÔöÌí£¨O (t)£© £¬£¬£¬£¬ £¬¶ø IterResearch µÄÊÂÇé¿Õ¼äʼÖÕ¼á³Öºã¶¨£¨O (1)£©¡£¡£¡£¡£¡£

Ñо¿ÍŶÓÖ¸³ö £¬£¬£¬£¬ £¬ÕâÖÖ»úÖÆÓë RNN/LSTM ÖеÄÒþ״̬¸üÐÂÓнṹÉϵÄÏàËÆÐÔ ¡ª¡ª ¶¼Í¨¹ýÒ»¸öÒþ״̬À´³ÐÔØÓ°Ïó²¢Ö𲽸üС£¡£¡£¡£¡£²î±ðÖ®´¦ÔÚÓÚ £¬£¬£¬£¬ £¬IterResearch µÄ¡¸Òþ״̬¡¹ÊÇÒ»·ÝÏÔʽ¡¢¿ÉÚ¹Ê͵ÄÑо¿±¨¸æ £¬£¬£¬£¬ £¬¼ÈÄÜŨËõÀúÊ· £¬£¬£¬£¬ £¬ÓÖÄÜΪÏÂÒ»²½ÍÆÀíÌṩÇåÎúµÄÆðµã¡£¡£¡£¡£¡£

40K ÉÏÏÂÎÄ £¬£¬£¬£¬ £¬2048 ÂÖ½»»¥²»ÍË»¯£ºInteraction Scaling µÄÍþÁ¦

ÕâÏîÊÂÇéÖÐ×î½¹µãµÄ·¢Ã÷ £¬£¬£¬£¬ £¬¾ÍÊÇ Interaction Scaling ÌØÕ÷ ¡ª¡ª¸ø Agent ¸ü¶àµÄ½»»¥Ô¤Ëã £¬£¬£¬£¬ £¬ÐÔÄܾÍÄÜÒ»Á¬ÌáÉý £¬£¬£¬£¬ £¬¶ø²»»áÏñ¹Å°åÒªÁìÄÇÑùÓÉÓÚÉÏÏÂÎÄÒç³ö¶øÍ߽⡣¡£¡£¡£¡£

ÔÚ BrowseComp »ù×¼ÉÏ £¬£¬£¬£¬ £¬Ñо¿ÍŶӽ« Agent µÄ×î´ó½»»¥ÂÖ´Î´Ó 2 Öð²½·Å¿íµ½ 2048¡£¡£¡£¡£¡£Ð§¹ûÏÔʾ £¬£¬£¬£¬ £¬IterResearch µÄ׼ȷÂÊ´Ó 3.5% Ò»ÆðÅÊÉýµ½ 42.5% £¬£¬£¬£¬ £¬ÇÒÔÚ 2048 ÂÖʱÒÀȻûÓзºÆðÏÔ×ŵÄÍË»¯¼£Ï󡣡£¡£¡£¡£¶ø¹Å°åµ¥ÉÏÏÂÎÄÒªÁìÔÚ¼¸Ê®ÂÖºó¾ÍÒѾ­²»¿°Öظº¡£¡£¡£¡£¡£

ÖµµÃÇ¿µ÷µÄÊÇ £¬£¬£¬£¬ £¬2048 ²¢·Ç IterResearch µÄ½»»¥ÉÏÏÞ £¬£¬£¬£¬ £¬¶ø½öÊÇʵÑéÆÀ²â¹æÄ£µÄÖյ㡣¡£¡£¡£¡£Ä£×ÓÔÚ 2048 ÂÖʱÐÔÄÜÇúÏßÈÔ¼á³ÖÉÏÉýÇ÷ÊÆ £¬£¬£¬£¬ £¬Åú×¢¸Ã·¶Ê½ÔÚÀíÂÛÉϾ߱¸½øÒ»²½À©Õ¹µÄDZÁ¦¡£¡£¡£¡£¡£

ÕâһЧ¹ûת´ïÁËÒ»¸öÖ÷ÒªÐźţº³¤³ÌʹÃüµÄ¡¸ÄÑ¡¹ £¬£¬£¬£¬ £¬¿ÉÄܲ¢·ÇÍêÈ«À´×ÔÄ£×ÓÍÆÀíÄÜÁ¦È±·¦ £¬£¬£¬£¬ £¬¸üÓпÉÄÜÊÇ̽Ë÷Éî¶ÈÊÜÏÞ¡£¡£¡£¡£¡£µ± Agent ÓµÓÐÒ»¸öÇå½àµÄÍ·ÄԿռ䲢±»ÔÊÐí³ä·Ö̽Ë÷ʱ £¬£¬£¬£¬ £¬ËüȷʵÓÐÄÜÁ¦ÔÚ³¬³¤Ê¹ÃüÖÐÒ»Á¬Ç°½ø¡£¡£¡£¡£¡£

ÁíÒ»¸öÓÐÒâ˼µÄ·¢Ã÷ÊÇ£ºÖ»¹Ü×î´óÂִα»ÉèÖÃΪ 2048 £¬£¬£¬£¬ £¬Agent ÏÖʵÉÏÆ½¾ùÖ»ÓÃÁËÔ¼ 80 ÂÖ¡£¡£¡£¡£¡£Ëüѧ»áÁËÔÚ»ñÈ¡×ã¹»ÐÅÏ¢ºó×Ô¶¯ÖÕÖ¹ £¬£¬£¬£¬ £¬¶ø·Ç»úеµØºÄ¾¡Ô¤Ëã ¡ª¡ª Õâ˵Ã÷Agent ²»µ«Ñ§»áÁË¡¸×ßµÃÔ¶¡¹ £¬£¬£¬£¬ £¬»¹Ñ§»áÁË¡¸ÖªµÀºÎʱͣ¡¹¡£¡£¡£¡£¡£

¡¸¼´²å¼´Óá¹µÄÍÆÀí·¶Ê½£º²»ÑµÁ·Ò²ÄÜÌáÉý±ÕÔ´Ä£×Ó

ÈôÊǽö°Ñ IterResearch µÄµü´úÂß¼­×÷ΪÌáÐÑÕ½ÂÔ£¨prompting strategy£© £¬£¬£¬£¬ £¬Ö±½ÓÓ¦ÓÃÓÚ±ÕÔ´Ä£×Ó¶ø²»×öÈκÎѵÁ· £¬£¬£¬£¬ £¬Ð§¹û»áÔõÑù£¿ £¿£¿£¿£¿ £¿£¿£¿

Ñо¿ÍŶÓÔÚ o3 ºÍ DeepSeek-V3.1 ÉÏ×öÁËÑéÖ¤¡£¡£¡£¡£¡£ÔÚÍêÈ«ÏàͬµÄʹÃüÉ趨Ï £¬£¬£¬£¬ £¬Ïà±È¹Å°åµÄ ReAct ÌáÊ÷ģʽ £¬£¬£¬£¬ £¬IterResearch ÔÚ×î¾ßÌôÕ½Ð﵀ BrowseComp ÉÏ»®·ÖΪ o3 ´øÀ´ÁË 12.7 ¸ö°Ù·Öµã¡¢Îª DeepSeek-V3.1 ´øÀ´ÁË 19.2 ¸ö°Ù·ÖµãµÄÌáÉý¡£¡£¡£¡£¡£

Õâ˵Ã÷IterResearch µÄ½¹µãÓÅÊÆÔÚÓڽṹÐÔµÄÈÏÖª»úÖÆ £¬£¬£¬£¬ £¬¶ø·ÇÒÀÀµÌض¨Ãü¾Ý»ò΢µ÷¼¼ÇÉ¡£¡£¡£¡£¡£ÎÞÂ۵ײãÄ£×ÓÊÇʲô¼Ü¹¹ £¬£¬£¬£¬ £¬Ëü´¥¼°µÄ¶¼Êdz¤³ÌÍÆÀíÖеĹ²ÐÔÆ¿¾±¡£¡£¡£¡£¡£

×ܽá

IterResearch Ìá³öÁËÒ»¸ö¾«Á·¶øÓÐÓõķ¶Ê½×ª»»£ºÓëÆäÒ»Ö±ÐÞ²¹Ò»¸ö×¢¶¨»áÍß½âµÄÏßÐÔÉÏÏÂÎÄ £¬£¬£¬£¬ £¬²»Èç´Ó½á¹¹ÉÏÈà Agent ѧ»á¡¸±ß×ö±ßÖØ¹¹Ïëά¡¹¡£¡£¡£¡£¡£

Õâһ˼Ð÷ÔÚѵÁ·¿ò¼Ü¡¢ÌáÐÑÕ½ÂԺͿ緶ʽǨáãÈý¸ö²ãÃæ¶¼Õ¹ÏÖÁËÒ»ÖµÄÓÐÓÃÐÔ £¬£¬£¬£¬ £¬¶øÆäÕ¹ÏÖµÄ Interaction Scaling ÌØÕ÷¸üÊÇΪ³¤³Ì Agent µÄÄÜÁ¦½çÏß·­¿ªÁËеÄÏëÏó¿Õ¼ä¡£¡£¡£¡£¡£ÔÚ Agent ×ßÏòÕæÕýºã¾Ã¡¢Ò»Á¬ÔËÐеÄδÀ´ £¬£¬£¬£¬ £¬IterResearch ÌṩÁËÒ»¸öÖµµÃ¹Ø×¢µÄÆ«Ïò¡£¡£¡£¡£¡£

×÷ÕßÏÈÈÝ

µÚÒ»×÷Õ߳¹úöÎ £¬£¬£¬£¬ £¬ÖйúÈËÃñ´óѧδÀ´¿µ½¡ÓÃÆ·ÓÐÏÞ¹«Ë¾¸ßê²È˹¤ÖÇÄÜѧԺ²©Ê¿Éú £¬£¬£¬£¬ £¬µ¼Ê¦ÎªÕÔöνÌÊÚºÍËÎ £» £»£»£»£»£» £»ª½ÌÊÚ £¬£¬£¬£¬ £¬Ñо¿Æ«ÏòΪ LLM ÍÆÀíÓë Agent £¬£¬£¬£¬ £¬¾Û½¹ËÑË÷ÖÇÄÜÌåÓë´úÂëÖÇÄÜÌå¡£¡£¡£¡£¡£ÔøÔÚ°¢Àï°Í°ÍͨÒåʵÑéÊҵȻú¹¹ÊµÏ° £¬£¬£¬£¬ £¬ÔÚ ICLR¡¢ICML¡¢NeurIPS¡¢ACL µÈ¶¥¼¶¾Û»á½ÒÏþ¶àƪÂÛÎÄ¡£¡£¡£¡£¡£±¾ÊÂÇéÓÉÖйúÈËÃñ´óѧÓë°¢Àï°Í°ÍͨÒåʵÑéÊÒÏàÖúÍê³É¡£¡£¡£¡£¡£