³¬6ÍòGitHubÏîĿʵ²â£ºAgentд´úÂëЧÂʱ©ÕÇ£¬£¬ £¬£¬£¬£¬Í¨¹ýÂÊÈÔÂäÎéÈËÀà
2026-03-01 10:42:22

µ± AI Óà 3 ÌìÍê³ÉÈËÀà¾û¿Æ×Ô¶¯»¯×±±¸ÓÐÏÞ¹«Ë¾³ÌÐòÔ±Ô­±¾3ÄêµÄ´úÂëʹÃüÁ¿£¬£¬ £¬£¬£¬£¬ÈËÀàµÄ½ÇÉ«»á±¬·¢ÔõÑùµÄת±ä£¿£¿£¿£¿ £¿

Ä¿½ñ£¬£¬ £¬£¬£¬£¬AI ÕýÔÚ´Ó¹¤¾ß±äΪÈËÀàµÄ¡°¶ÓÓÑ¡±¡£¡£¡£¡£¡£¡£¡£Ëæ×Å´óÄ£×ӵļÓËÙÉú³¤£¬£¬ £¬£¬£¬£¬AI ÔÚÈí¼þ¹¤³ÌÁìÓòµÄ×÷ÓÃÒѲ»ÔÙÊǸ¨Öú´úÂ벹ȫ£¬£¬ £¬£¬£¬£¬¶øÊÇÕýÔÚ³ÉΪ¿É×ÔÖ÷±àÂëµÄÖÇÄÜÌ壨Agent£©¡£¡£¡£¡£¡£¡£¡£

ÏÖÔÚ£¬£¬ £¬£¬£¬£¬ÎÒÃÇÖ»ÐèÏò AI ÐÎò´úÂëÏëҪʵÏֵĹ¦Ð§£¬£¬ £¬£¬£¬£¬Ëü¾ÍÄÜ×Ô¶¯ÌìÉúÍêÕû´úÂ룻£»£»£»£»£»£»£»½èÖú Agent£¬£¬ £¬£¬£¬£¬ÉõÖÁÄÜÔÚÊ®¼¸·ÖÖÓÄÚÍê³ÉǧÐм¶±ðµÄ´úÂëÌìÉú»òÐ޸ġ£¡£¡£¡£¡£¡£¡£

½üÆÚ£¬£¬ £¬£¬£¬£¬¼ÓÄôóÅ®Íõ´óѧ²©Ê¿ºóÀîºÀÓëËùÔÚÍŶÓÔÚÒ»ÏîÑо¿ÖÐÊ״ι¹½¨ÁËÒ»¸ö´ó¹æÄ£Êý¾Ý¼¯ AIDev£¬£¬ £¬£¬£¬£¬ÏµÍ³ÆÊÎöºÍͳ¼ÆÁË×ÔÖ÷±àÂë Agent ÔÚ 7,000 ¶à¸ö½ÏÊ¢ÐеÄÈí¼þÖеÄÏÖʵÌåÏÖºÍÓ°Ïì¡£¡£¡£¡£¡£¡£¡£

ÆäÁýÕÖ¹æÄ£°üÀ¨ÔÚ GitHub ƽ̨ÉÏÒÑÌá½»µÄ³¬ 45.6 ÍòÌõ Agent ´úÂëºÏ²¢ÇëÇó£¨PR£¬£¬ £¬£¬£¬£¬pull requests£©£¬£¬ £¬£¬£¬£¬º­¸Ç 6.1 Íò¸ö´úÂë¿âºÍ 4.7 ÍòÃû¿ª·¢Õߣ¬£¬ £¬£¬£¬£¬°üÀ¨Ö÷Á÷µÄ AI ±àÂ빤¾ß OpenAI Codex¡¢GitHub Copilot¡¢Devin¡¢Cursor ºÍ Claude Code¡£¡£¡£¡£¡£¡£¡£

ͼحÀîºÀ£¨ÈªÔ´£ºÊÜ·ÃÕߣ©

Ñо¿Ö°Ô±ÔÚ AI ÁìÓòºÍÈí¼þ¹¤³Ì×öÏà¹ØÑо¿Ê±£¬£¬ £¬£¬£¬£¬ÍùÍù»áÑ¡ÔñÓà SWE-bench ×ö²âÊÔ£¬£¬ £¬£¬£¬£¬Í¨¹ý½»¸ø AI һЩ¸ßÖÊÁ¿¡¢ÓвâÊÔÑùÀýµÄʹÃü£¬£¬ £¬£¬£¬£¬À´ÓÅ»¯ AI ÐÔÄÜÒÔ¼°ÓÅ»¯ÏµÍ³Éè¼ÆµÈ¡£¡£¡£¡£¡£¡£¡£

µ«ÕâÒ²´øÀ´ÁËÐí¶àÌôÕ½ÐÔµÄÎÊÌâ¡£¡£¡£¡£¡£¡£¡£ÀýÈ磬£¬ £¬£¬£¬£¬Ò»¼Ò¹«Ë¾ÈôÊǽ«²âÊÔÎÊÌâÓÃÓÚѵÁ·Ä£×Ó£¬£¬ £¬£¬£¬£¬¼«ÓпÉÄÜÒò¡°×÷±×¡±µ¼Ö·ÖÊýÐé¸ß¡£¡£¡£¡£¡£¡£¡£±ðµÄ£¬£¬ £¬£¬£¬£¬ÓÉÓÚ SWE-bench ÊÇÒ»¸ö¾²Ì¬µÄ»ù×¼¼¯£¨benchmark£©£¬£¬ £¬£¬£¬£¬²¿·ÖÊý¾ÝÓпÉÄܱ£´æÒ»¶¨ÖͺóÐÔ¡£¡£¡£¡£¡£¡£¡£

ÀîºÀÖ¸³ö£¬£¬ £¬£¬£¬£¬¸ÃÑо¿×î´óµÄ²î±ðµãÔÚÓÚ£¬£¬ £¬£¬£¬£¬AIDev ÊÇÕæÊµÌìÏ¡¢´ó¹æÄ£¡¢ÊµÊ±ÊÕÂÞÊý¾ÝµÄÊý¾Ý¼¯£¬£¬ £¬£¬£¬£¬¸üÌù½üÓÚÒµ½çʵ¼ùºÍÉú²ú¡£¡£¡£¡£¡£¡£¡£±ðµÄ£¬£¬ £¬£¬£¬£¬Ñо¿Ö°Ô±»¹¿ÉÒÔʹÓøÃÊý¾Ý¼¯´òÔì¸üÐ嵀 benchmark¡£¡£¡£¡£¡£¡£¡£

£¨ÈªÔ´£ºarXiv£©

Ñо¿ÍŶÓÔÚ AI ±àÂë Agent µÄËÙÂʺÍÖÊÁ¿·½ÃæÕÒµ½ÁËÓÐȤµÄ·¢Ã÷¡£¡£¡£¡£¡£¡£¡£Ò»Ïî¸öÀýÆÊÎöЧ¹ûÏÔʾ£¬£¬ £¬£¬£¬£¬Óпª·¢ÕßÔÚʹÓà AI ±àÂë Agent ºó£¬£¬ £¬£¬£¬£¬3 ÌìÄÚÍê³ÉµÄʹÃüÁ¿¿¿½üÆäÒÑÍù 3 ÄêµÄ×ÜÁ¿¡£¡£¡£¡£¡£¡£¡£

¶ø AI ÔÚ×ÔÈ»ÓïÑÔ´¦Öóͷ£·½ÃæµÄÓÅÊÆ£¬£¬ £¬£¬£¬£¬Ò²Í¬ÑùÖµµÃ¹Ø×¢¡£¡£¡£¡£¡£¡£¡£ËûÃÇ·¢Ã÷£¬£¬ £¬£¬£¬£¬AI ÔÚ±àд´úÂë»òÎı¾·½ÃæµÄʹÃüÖÐÌåÏÖÓÅÒ죬£¬ £¬£¬£¬£¬ÀýÈç´ÓÎĵµÏà¹ØµÄºÏ²¢ÇëÇó½ÓÊÜÂÊÀ´¿´£¬£¬ £¬£¬£¬£¬OpenAI Codex ºÍ Claude Code »®·ÖΪ 88.6% ºÍ 85.7%£¬£¬ £¬£¬£¬£¬¶øÈËÀàÔڸ÷½ÃæÌåÏÖΪ 76.5%¡£¡£¡£¡£¡£¡£¡£

£¨ÈªÔ´£ºarXiv£©

ºÏ²¢ÇëÇó½ÓÊÜÂÊÊÇȨºâ AI ²ú³öÖÊÁ¿ºÍ¿ÉÐŶȵÄÒªº¦Ö¸±ê£¬£¬ £¬£¬£¬£¬ËüÓëÈËÀ࿪·¢Õß/ÏîĿά»¤Õß¶Ô AI Т˳µÄÈϿɶÈÇ×½üÏà¹Ø¡£¡£¡£¡£¡£¡£¡£¸ÃÍŶӻ¹·¢Ã÷£¬£¬ £¬£¬£¬£¬±àÂë Agent µÄºÏ²¢ÇëÇó½ÓÊÜÂʱÈÈËÀ࿪·¢ÕßµÍ 15% ÖÁ 40%£¨²î±ðʹÃüÀàÐÍÏÂÇø¼ä²î±ðÏÔÖø£©£¬£¬ £¬£¬£¬£¬ÓÈÆäÊÇÔÚй¦Ð§¿ª·¢¡¢ÐÞ¸´ Bug µÈÖØ´óµÄʹÃü·½Ãæ¡£¡£¡£¡£¡£¡£¡£ÀýÈ磬£¬ £¬£¬£¬£¬OpenAI Codex µÄ PR ½ÓÊÜÂÊΪ 64%£¬£¬ £¬£¬£¬£¬¶øÈËÀ࿪·¢ÕßµÄ PR ½ÓÊÜÂʸߴï 76.8%¡£¡£¡£¡£¡£¡£¡£

ÕâÒâζ×Å£¬£¬ £¬£¬£¬£¬AI д´úÂë²¢·ÇÖÜÈ«ÓâÔ½ÁËÈËÀà¡£¡£¡£¡£¡£¡£¡£ÐèÒª¿´µ½µÄÊÇ£¬£¬ £¬£¬£¬£¬Ö»¹ÜÏÖÔÚ AI ±àÂë Agent ÌìÉúËÙÂʺܿ죬£¬ £¬£¬£¬£¬µ«ÐÔÄÜ·½ÃæÉÐÓÐһЩȱÏÝ£¬£¬ £¬£¬£¬£¬ÔڽṹÉÏÒ²Ïà¶Ô½Ï¼òÆÓ£¬£¬ £¬£¬£¬£¬ÐèÒªÑо¿Ö°Ô±¼ÌÐø¶ÔÆä¾ÙÐÐÔöÇ¿£¬£¬ £¬£¬£¬£¬ÒÔÈ·±£´úÂëµÄºã¾Ã¿Éά»¤ÐÔ¡£¡£¡£¡£¡£¡£¡£

ÀîºÀ¶Ô DeepTech ÌåÏÖ£º¡°¶ÌÆÚ¿´£¬£¬ £¬£¬£¬£¬AI Agent µÄ´úÂë½ÓÊÜÂÊÏà¶ÔÈËÀà½ÏµÍ£¬£¬ £¬£¬£¬£¬Ð§ÂÊÓëÖÊÁ¿µÄÈ¡ÉáÈÔÐèȨºâ£¨trade-off£©£¬£¬ £¬£¬£¬£¬µ«ÕâÖÖÄ¥ºÏÆÚ¶ÔÓ¦µÄÊÇÊý¾Ý·ÉÂֵįô¶¯½×¶Î£¬£¬ £¬£¬£¬£¬ÐγɷÉÂÖЧӦºó£¬£¬ £¬£¬£¬£¬ÎÒÃÇÓÐÍû»ñµÃÉú²úÁ¦µÄÏÔÖøÌáÉý¡£¡£¡£¡£¡£¡£¡£¡±

£¨ÈªÔ´£ºarXiv£©

¸ÃÑо¿Í¨Ì«¹ýÎö×ÔÖ÷±àÂë Agent µÄÌåÏÖ£¬£¬ £¬£¬£¬£¬ÎªÎ´À´¸üºÃµØÓÅ»¯ÈËÓë AI Э×÷ÌṩÁËÊý¾Ý»ù´¡¡£¡£¡£¡£¡£¡£¡£ÕâÒ²´øÀ´ÁËÒ»ÖÖȫеÄÌìÉúģʽ£¬£¬ £¬£¬£¬£¬¿ª·¢ÕßÃæÁÙµÄÎÊÌâ²»ÊÇÔõÑùд¸ü¶àµÄ´úÂ룬£¬ £¬£¬£¬£¬¶øÊǽӵ½Ò»ÏîʹÃüºó£¬£¬ £¬£¬£¬£¬ÔõÑù²ð·Ö³É¸üϸµÄʹÃü£¬£¬ £¬£¬£¬£¬ÔÙÖÎÀíÕâЩ AI ¸üºÃµØÖ´ÐС£¡£¡£¡£¡£¡£¡£

¡°¸ÃÆ«ÏòÔÚѧ½çºÍ¹¤Òµ½ç»¹±£´æ½Ï´óµÄ¿Õȱ¡£¡£¡£¡£¡£¡£¡£±à³ÌÖ°Ô±µÄ½ÇɫҲ»áÖð½¥´Óд´úÂëµÄÈË£¬£¬ £¬£¬£¬£¬×ª»»³ÉÌṩ´úÂëÉó²é»òÌṩÖÎÀíģʽµÄÈË¡£¡£¡£¡£¡£¡£¡£ÏÖÔÚ£¬£¬ £¬£¬£¬£¬ÎÒÃÇÒ²ÔÚ×öÏà¹ØµÄÑо¿£¬£¬ £¬£¬£¬£¬À´Ì½Ë÷ÐÂÒ»´úÈí¼þ¿ª·¢Á÷³ÌÀ´Ö§³Ö¿ª·¢ÕßÃÇʹÓà AI Agent¡£¡£¡£¡£¡£¡£¡£¡±ÀîºÀÌåÏÖ¡£¡£¡£¡£¡£¡£¡£

±ðµÄÑо¿»¹Õ¹ÏÖ³ö£¬£¬ £¬£¬£¬£¬Ö»¹Ü AI µÄ·ºÆðÍÆ¶¯ÁËÈË»úЭͬÉó²éÁ÷³Ì£¬£¬ £¬£¬£¬£¬µ«Í¬Ê±Ò²¿ÉÄÜ»á´øÀ´Ë½¼ûµÈÎÊÌâ¡£¡£¡£¡£¡£¡£¡£ÀýÈ磬£¬ £¬£¬£¬£¬ÈôÊÇ AI д´úÂëµÄ Agent ÓëÉó²é´úÂëµÄ»úеÈË×Ôͳһ¹«Ë¾£¬£¬ £¬£¬£¬£¬ºÜÓпÉÄÜÔÚAIÉó²é»·½ÚºöÊÓÄ³Ð©ÌØ¶¨ÀàÐ͵Ĺýʧ¡£¡£¡£¡£¡£¡£¡£

ÔÚδÀ´µÄÑо¿ÖУ¬£¬ £¬£¬£¬£¬¸ÃÍŶÓÍýÏ뽨Éè¸üÖÜÈ«µÄ benchmark£¬£¬ £¬£¬£¬£¬¶Ô AI ±à³Ì Agent ¾ÙÐÐÕæÊµµÄÌåÏÖÆÀ²â¡£¡£¡£¡£¡£¡£¡£ËûÃÇ»¹ÍýÏ뽨ÉèÐÂ֪ʶ¿â£¬£¬ £¬£¬£¬£¬Íƶ¯ÁìÓòÄÚµÄÑо¿Ö°Ô±ÅäºÏË¢ÐÂÏà¹ØÆ«Ïò£¬£¬ £¬£¬£¬£¬°üÀ¨ÔõÑù¸üºÃµØÕ¹ÍûºÍÆÊÎöAI¿ÉÄܵÄʧ°Ü³¡¾°£¬£¬ £¬£¬£¬£¬ÒÔ¼°Ê§°ÜÔµ¹ÊÔ­ÓɵÈ¡£¡£¡£¡£¡£¡£¡£´Ó¸ü¾ÃÔ¶µÄÉú³¤À´¿´£¬£¬ £¬£¬£¬£¬Ì½Ë÷¸ü×Ô¶¯»¯Óë±ê×¼»¯µÄÉó²é»úÖÆ£¬£¬ £¬£¬£¬£¬Ò²ÊÇÒ»¸öÖµµÃÉîÈëÑо¿µÄÆ«Ïò¡£¡£¡£¡£¡£¡£¡£

Ïà¹ØÂÛÎÄÒÔ¡¶Èí¼þ¹¤³Ì 3.0 ÖÐ AI ¶ÓÓѵÄáÈÆð£º×ÔÖ÷±àÂë Agent ÔõÑùÖØËÜÈí¼þ¹¤³Ì¡·£¨The Rise of AI Teammates in Software Engineering (SE) 3.0: How Autonomous Coding Agents Are Reshaping Software Engineering£©ÎªÌâ½ÒÏþÔÚ arXiv[1]¡£¡£¡£¡£¡£¡£¡£ÏÖÔÚ£¬£¬ £¬£¬£¬£¬Ïà¹Ø´úÂëÒÑÔÚ GitHub ¿ªÔ´¡£¡£¡£¡£¡£¡£¡£

²Î¿¼×ÊÁÏ£º

1.Ïà¹ØÂÛÎÄ£ºhttps://arxiv.org/abs/2507.15003v1

2.AIDev Êý¾Ý¼¯»ñÈ¡£¡£¡£¡£¡£¡£¡£ºhttps://github.com/SAILResearch/AI_Teammates_in_SE3

ÅŰ棺ºúÀò»¨

¾û¿Æ×Ô¶¯»¯×±±¸ÓÐÏÞ¹«Ë¾