
¿ËÈÕ£¬£¬£¬£¬£¬£¬£¬£¬ÃÀÍÅÍÆ³öȫжàģ̬ͳһ´óÄ£×Ӽƻ® STAR£¨STacked AutoRegressive Scheme for Unified Multimodal Learning£©£¬£¬£¬£¬£¬£¬£¬£¬ÒÀ¸½Á¢ÒìµÄ"¶Ñµþ×Իعé¼Ü¹¹ + ʹÃüµÝ½øÑµÁ·" Ë«½¹µãÉè¼Æ£¬£¬£¬£¬£¬£¬£¬£¬ÊµÏÖÁË"Ã÷È·ÄÜÁ¦²»´òÕÛ¡¢ÌìÉúÄÜÁ¦´ï¶¥¼â" µÄË«ÖØÍ»ÆÆ¡£¡£¡£¡£¡£¡£¡£
ÔÚ GenEval£¨Îı¾ - ͼÏñ¶ÔÆë£©¡¢DPG-Bench£¨Öش󳡾°ÌìÉú£©¡¢ImgEdit£¨Í¼Ïñ±à¼£©µÈ benchmark ÖУ¬£¬£¬£¬£¬£¬£¬£¬STAR ʵÏÖÁË SOTA ÐÔÄÜ£»£»£»£»£»ÓÃ×î¼òѵÁ·Âß¼Óë½ô´ÕÄ£×ÓÉè¼ÆÈÃͳһ¶àģ̬´óÄ£×ÓÕæÕý×ßÏò¹¤Òµ¼¶Â䵨¡£¡£¡£¡£¡£¡£¡£

ÂÛÎÄÎÊÌ⣺STAR: Stacked AutoRegressive Scheme for Unified Multimodal LearningÂÛÎÄÁ´½Ó£ºhttps://arxiv.org/pdf/2512.13752ÏîÄ¿Ö÷Ò³£ºhttps://star-mm-ai.github.io´úÂëµØµã£ºhttps://github.com/MM-MVR/STARÒªº¦´Ê£ºÍ³Ò»¶àģ̬¡¢¶Ñµþ×Իع顢ʹÃü½¥½øÊ½ÑµÁ·

Ò»¡¢ÐÐҵʹµã£ºÍ³Ò»¶àģ̬´óÄ£× ¡°ÄÜÁ¦×çÖ䡱
ÔÚͨÏò AGI µÄÀú³ÌÖУ¬£¬£¬£¬£¬£¬£¬£¬½« ¡°ÊÓ¾õÃ÷È·¡± Óë ¡°Í¼ÏñÌìÉú¡± ͳһÓÚ¼òµ¥²ÎÊý¿Õ¼ä±»ÊÓΪ¶àģ̬´óÄ£×ÓµÄÊ¥±£¬£¬£¬£¬£¬£¬£¬£¬È»¶øÊµ¼ù²ãÃæÈ´ºã¾ÃÊÜÖÆÓÚ ¡°ÄÜÁ¦×çÖ䡱£¬£¬£¬£¬£¬£¬£¬£¬ÏêϸÌåÏÖΪÈýÖØÃ¬¶Ü¡£¡£¡£¡£¡£¡£¡£
1. ÓÅ»¯Ä¿µÄ»¥³â ¡ª¡ª ÓïÒå¶ÔÆëÓëÏñËØ±£ÕæµÄÁãºÍ²©ÞÄ
Ã÷ȷʹÃüµÄ½¹µãÊÇ"ÓïÒå¶ÔÆëÓëÂß¼ÍÆÀí"¡ª¡ª ºÃ±Èʶ±ðͼÏñÖеÄÎïÌå¡¢»Ø¸²Í¼ÎÄÏà¹ØÎÊÌ⣬£¬£¬£¬£¬£¬£¬£¬ÐèҪģ×Ó¾«×¼²¶»ñ¿çģ̬µÄÓïÒ幨Áª£»£»£»£»£»¶øÌìÉúʹÃüµÄ½¹µãÊÇ"ÏñËØ±£ÕæÓë´´Òâ±í´ï"¡ª¡ª ºÃ±Èƾ֤Îı¾ÐÎòÌìÉú¸ßÇåͼÏñ£¬£¬£¬£¬£¬£¬£¬£¬ÐèҪģ×Ó¼æ¹Ëϸ½Ú»¹ÔÓëÄÚÈÝÁ¬¹áÐÔ¡£¡£¡£¡£¡£¡£¡£Á½ÕßµÄÓÅ»¯Ä¿µÄ¡¢ÌØÕ÷¿Õ¼äÏÔÖø²î±ð£¬£¬£¬£¬£¬£¬£¬£¬µ¼ÖÂÍŽáѵÁ·ÏÝÈëÁãºÍ²©ÞÄ£ºÇ¿»¯ÌìÉúÄÜÁ¦£¬£¬£¬£¬£¬£¬£¬£¬Ã÷ȷ׼ȷÂÊ»áϽµ£»£»£»£»£»Éî¸ûÃ÷ȷʹÃü£¬£¬£¬£¬£¬£¬£¬£¬ÌìÉúͼÏñµÄÇåÎú¶È¡¢ÓïÒåÒ»ÖÂÐÔ»á´òÕÛ¡£¡£¡£¡£¡£¡£¡£
2. ѵÁ··¶Ê½·±¸´ ¡ª¡ª ´ÓÁãѵÁ·Óë»ìÏý¼Ü¹¹µÄË«ÖØÆ¿¾±
ÏÖÓÐÁ½ÌõÊÖÒÕõè¾¶¾ùÃæÁٸ߰ºÑµÁ·±¾Ç®£º
(1) ¶Ëµ½¶Ë´ÓÁãѵÁ·ÐèÔÚÒÚ¼¶Í¼ÎÄ - ÌìÉúÅä¶ÔÊý¾ÝÉÏ×ö¶àʹÃüƽºâ£¬£¬£¬£¬£¬£¬£¬£¬ÓÅ»¯¿Õ¼äά¶È¸ß´ïǧά£¬£¬£¬£¬£¬£¬£¬£¬³¬²ÎÃô¸ÐÐÔ³ÊÖ¸Êý¼¶·Å´ó£¬£¬£¬£¬£¬£¬£¬£¬ÑµÁ·ÖÜÆÚ³£ÒÔ ¡°Ô¡± Ϊµ¥Î»£»£»£»£»£»
(2) »ìÏý¼Ü¹¹Í¨¹ýÀ©É¢Ä£×ÓÓë×ԻعéÄ£×ÓµÄ×éºÏʵÏÖ¹¦Ð§ÁýÕÖ£¬£¬£¬£¬£¬£¬£¬£¬µ«ÐèÒªÉè¼ÆÖØ´óµÄÌØÕ÷ת»»ÇÅ£¨feature bridge£©¡¢ÌØÁíÍâÊÊÅäÆ÷£¨adapter£©»ò¸´ºÏËðʧ£¨hybrid loss£©£¬£¬£¬£¬£¬£¬£¬£¬ÔöÌíÁËÕûÌåµ÷²ÎÄѶȡ£¡£¡£¡£¡£¡£¡£
3. ÄÜÁ¦À©Õ¹ÍË»¯ ¡ª¡ª ÔÖÄÑÐÔÒÅÍüÓëÈÝÁ¿±¥ºÍ
ÔÚԤѵÁ·Ã÷È·Ö÷¸ÉÉÏÔöÁ¿ÒýÈëÌìÉúʹÃüʱ£¬£¬£¬£¬£¬£¬£¬£¬Ä£×Ó·ºÆðµä·¶µÄÔÖÄÑÐÔÒÅÍü£¨catastrophic forgetting£©£¬£¬£¬£¬£¬£¬£¬£¬Ô±¾ÉÆÓÚµÄͼÏñÎÊ´ð¡¢Âß¼ÍÆÀíÄÜÁ¦»áÏÔÖøÏ½µ¡£¡£¡£¡£¡£¡£¡£ÆäȪԴÔÚÓÚ²ÎÊýÈÝÁ¿±¥ºÍÓë±íÕ÷×ÌÈÅ ¡ª¡ª ÌìÉúʹÃüµÄÏñËØ¼¶ÈŶ¯ÔÚÌØÕ÷¿Õ¼äÐγÉÔëÉù£¬£¬£¬£¬£¬£¬£¬£¬¸Ä±äÁËÔçÆÚ¶ÔÆëµÄÓïÒåÌØÕ÷£¬£¬£¬£¬£¬£¬£¬£¬ÖÂʹ ¡°ÍòÄÜÀ©Õ¹¡± ³ÉΪ ¡°ÂÖ»»×¨¾«¡±¡£¡£¡£¡£¡£¡£¡£
ÃæÁÙÕâЩÐÐҵʹµã£¬£¬£¬£¬£¬£¬£¬£¬ÃÀÍÅ MM ÍŶÓÌá³öÁËÒ»¸öÖ±»÷½¹µãµÄÎÊÌ⣺ÄÜ·ñÔÚÍêÈ«±£´æ¶àģ̬Ã÷È·ÄÜÁ¦µÄÌõ¼þÏ£¬£¬£¬£¬£¬£¬£¬£¬Ò»Á¬¡¢¸ßЧµØÔöǿģ×ÓµÄÌìÉúÓë±à¼ÄÜÁ¦£¿£¿£¿£¿£¿£¿£¿£¿STAR ¼Æ»®µÄ½µÉú£¬£¬£¬£¬£¬£¬£¬£¬¸ø³öÁËÒ»¶¨ÇÒ¿ÉÀ©Õ¹µÄ½â´ð¡£¡£¡£¡£¡£¡£¡£
¶þ¡¢½¹µãÁ¢Òì£ºÖØ¹¹¶àģ̬ѧϰµÄ"ÄÜÁ¦Éú³¤¹æÔò"
STAR µÄÒªº¦²»ÊǼòµ¥ÊÖÒÕÍ»ÆÆ£¬£¬£¬£¬£¬£¬£¬£¬¶øÊǹ¹½¨ÁËÒ»Ì× ¡°ÄÜÁ¦µþ¼Ó²»³åÍ»¡± µÄ¶àģ̬ѧϰϵͳ£¬£¬£¬£¬£¬£¬£¬£¬½¹µãÎ§ÈÆ¡¸¶³½á»ù´¡ + ¶ÑµþÀ©Õ¹ + ·Ö½×ѵÁ·¡¹·¶Ê½£¬£¬£¬£¬£¬£¬£¬£¬Í¨¹ýÈý´ó½¹µãÉè¼ÆÊµÏÖ¡¸Ã÷È·¡¢ÌìÉú¡¢±à¼¡¹Èý´óÄÜÁ¦µÄͳһ£¬£¬£¬£¬£¬£¬£¬£¬Í¬Ê±×èÖ¹Ï໥×ÌÈÅ¡£¡£¡£¡£¡£¡£¡£Õû¸ö¿ò¼ÜÓÉ ¡°¶Ñµþͬ¹¹ AR Ä£×Ó + ʹÃüµÝ½øÑµÁ· + ¸¨ÖúÔöÇ¿»úÖÆ¡± Èý´ó²¿·ÖÐͬ×é³É¡£¡£¡£¡£¡£¡£¡£
1¡¢½¹µã¼Ü¹¹£º¶Ñµþͬ¹¹ AR Ä£×Ó£¨Stacked-Isomorphic AR£©
STAR µÄ½¹µã¼Ü¹¹Á¢Ò죬£¬£¬£¬£¬£¬£¬£¬ÊÇÆä"¶Ñµþͬ¹¹ AR Ä£¿£¿£¿£¿£¿£¿£¿£¿é" µÄÉè¼Æ£¬£¬£¬£¬£¬£¬£¬£¬³¹µ×¼ò»¯Á˶àģ̬ÄÜÁ¦À©Õ¹µÄÖØÆ¯ºó£¬£¬£¬£¬£¬£¬£¬£¬¾ÍÏñ¸øÄ£×Ó"´î»ýľ" Ò»ÑùÎÞа¸ßЧ£º
£¨1£©Í¬¹¹Éè¼Æ£¬£¬£¬£¬£¬£¬£¬£¬ÁãÊÊÅ䱾Ǯ£ºÐÂÔöµÄ¶ÑµþÄ£¿£¿£¿£¿£¿£¿£¿£¿éÓë»ù´¡ AR Ä£×Ó½ÓÄÉÍêÈ«ÏàͬµÄ¼Ü¹¹£¨×Ô×¢ÖØÁ¦»úÖÆ + ǰÀ¡Éñ¾ÍøÂ磩£¬£¬£¬£¬£¬£¬£¬£¬²ÎÊý³õʼ»¯Ö±½Ó¸´Óûù´¡Ä£×ӵĶ¥²ã²ÎÊý¡£¡£¡£¡£¡£¡£¡£ÕâÒâζ×ÅÐÂÔöÄ£¿£¿£¿£¿£¿£¿£¿£¿éÎÞÐèÖØÐÂѧϰ»ù´¡ÌØÕ÷£¬£¬£¬£¬£¬£¬£¬£¬ÄÜ¿ìËÙÊÊÅäÏÖÓÐÄ£×ÓµÄÌØÕ÷¿Õ¼ä£¬£¬£¬£¬£¬£¬£¬£¬×èÖ¹Á˹Űå»ìÏý¼Ü¹¹ÖÐ"ÌØÕ÷ת»»ÇÅ" µÄÖØ´óÉè¼Æ£»£»£»£»£»
£¨2£©µ¥Ä¿µÄѵÁ·£¬£¬£¬£¬£¬£¬£¬£¬¼«¼òÓÅ»¯£ºÎÞÐèÉè¼ÆÌØÁíÍâËðʧº¯Êý£¬£¬£¬£¬£¬£¬£¬£¬½öͨ¹ý±ê×¼µÄ"ÏÂÒ»¸ö token Õ¹Íû" Ä¿µÄ¼´¿ÉÍê³ÉÌìÉúÓë±à¼ÄÜÁ¦µÄѵÁ·¡£¡£¡£¡£¡£¡£¡£ÕâһĿµÄÓë»ù´¡Ä£×ÓµÄѵÁ·Ä¿µÄÍêȫһÖ£¬£¬£¬£¬£¬£¬£¬£¬È·±£ÁËѵÁ·Àú³ÌµÄÎȹÌÐÔ£¬£¬£¬£¬£¬£¬£¬£¬´ó·ù½µµÍµ÷²ÎÄѶȣ»£»£»£»£»
£¨3£©²ÎÊý½ô´Õ£¬£¬£¬£¬£¬£¬£¬£¬Â䵨ÓѺãºSTAR-3B ½öÔÚ Qwen2.5-VL-3B »ù´¡ÉÏÐÂÔö 1.2B ²ÎÊý£¨16 ²ã¶ÑµþÄ£¿£¿£¿£¿£¿£¿£¿£¿é£©£¬£¬£¬£¬£¬£¬£¬£¬STAR-7B ÐÂÔö 3B ²ÎÊý£¨14 ²ã¶ÑµþÄ£¿£¿£¿£¿£¿£¿£¿£¿é£©£¬£¬£¬£¬£¬£¬£¬£¬È´ÊµÏÖÁËÌìÉúÄÜÁ¦µÄ¿çԽʽÌáÉý¡£¡£¡£¡£¡£¡£¡£STAR µÄ½ô´ÕÉè¼ÆºÜÊÇÊʺϹ¤Òµ»¯°²ÅÅ£¬£¬£¬£¬£¬£¬£¬£¬ÄÜÓÐÓýµµÍÍÆÀí±¾Ç®¡£¡£¡£¡£¡£¡£¡£

2¡¢½¹µã·¶Ê½£ºÊ¹ÃüµÝ½øÊ½ÑµÁ·£¨Task-Progressive Training£©
STAR Í»ÆÆÁË´«Ò»ÇÐһģ×Ó ¡°»ìÔÚÒ»ÆðѵÁ·¡± µÄģʽ£¬£¬£¬£¬£¬£¬£¬£¬°Ñ¶àģ̬ѧϰ²ð³ÉËĽ׶εݽøÁ÷³Ì£¬£¬£¬£¬£¬£¬£¬£¬Ã¿Ò»²½¶¼¶³½áÒÑÓн¹µãÄÜÁ¦£¬£¬£¬£¬£¬£¬£¬£¬À©Õ¹ÐÂÊÖÒÕ£º
£¨1£©µÚÒ»½×¶Î£¨VQ ѵÁ·£©£ºÏÈѵÁ· ¡°Í¼Ïñ·Ö´Ê¡± ÄÜÁ¦£¬£¬£¬£¬£¬£¬£¬£¬ÑµÁ· STAR-VQ °ÑͼƬ²ð³ÉϸÁ£¶ÈÀëÉ¢ token£¬£¬£¬£¬£¬£¬£¬£¬ÎªºóÐøÌìÉú / ±à¼´òÏ»ù´¡£¡£¡£¡£¡£¡£¡£»£»£»£»£»
£¨2£©µÚ¶þ½×¶Î£¨Îı¾ÉúͼԤѵÁ·£©£ºÔÚ¶³½áµÄÃ÷È·Ä£×ÓÉÏ£¬£¬£¬£¬£¬£¬£¬£¬¶Ñµþ AR Ä£¿£¿£¿£¿£¿£¿£¿£¿éרÃÅѧÎÄÉúͼʹÃü£¬£¬£¬£¬£¬£¬£¬£¬Ö»¸üÐÂÐÂÄ£¿£¿£¿£¿£¿£¿£¿£¿é²ÎÊý£¬£¬£¬£¬£¬£¬£¬£¬²»ÅöÔÓÐÃ÷È·ÄÜÁ¦£»£»£»£»£»
£¨3£©µÚÈý½×¶Î£¨AR - À©É¢¶ÔÆëѵÁ·£©£ºµ¥¶ÀÓÅ»¯À©É¢½âÂëÆ÷£¬£¬£¬£¬£¬£¬£¬£¬ÈÃÌìÉúµÄͼƬ¸üÇåÎú£¬£¬£¬£¬£¬£¬£¬£¬ÆäËûÄ£¿£¿£¿£¿£¿£¿£¿£¿é¼á³Ö¶³½á£»£»£»£»£»
£¨4£©µÚËĽ׶Σ¨Í³Ò»Ö¸Áî΢µ÷£©£ºÍŽáѵÁ·¶Ñµþ AR ºÍÀ©É¢½âÂëÆ÷£¬£¬£¬£¬£¬£¬£¬£¬Í¬Ê±ÕÆÎÕ ¡°Éúͼ + ±à¼¡±£¬£¬£¬£¬£¬£¬£¬£¬ÓÃÌݶÈ×èÖ¹»úÖÆ×èÖ¹ÐÂʹÃü×ÌÈžÉÄÜÁ¦¡£¡£¡£¡£¡£¡£¡£
STAR ͨ¹ýʹÃüµÝ½øÊ½ÑµÁ·£¬£¬£¬£¬£¬£¬£¬£¬ÈÃÿһ²½ÐÂÄÜÁ¦µÄѧϰ¶¼²»ÆÆËðÒÑÓгÉÄÜÁ¦£¬£¬£¬£¬£¬£¬£¬£¬ÊµÏÖ ¡°Ã÷È·ÄÜÁ¦²»ÍË»¯£¬£¬£¬£¬£¬£¬£¬£¬ÌìÉú / ±à¼ÄÜÁ¦Öð²½ÔöÇ¿¡±¡£¡£¡£¡£¡£¡£¡£

3¡¢¸¨ÖúÔöÇ¿»úÖÆ£ºÁ½´ó¸Åº¦ÓÅ»¯
1. ¸ßÈÝÁ¿Í¼ÏñÁ¿»¯Æ÷£¨STAR-VQ£©
¹Å°å VQ Ä£×Ó²ð·ÖͼƬ´Ö¡¢Ï¸½Úɥʧ¶à£¬£¬£¬£¬£¬£¬£¬£¬STAR-VQ ×öÁËÁ½´óÉý¼¶£º
£¨1£©¹æÄ£À©ÈÝ£º´úÂë±¾¹æÄ£´Ó 16384 ÌáÉýµ½ 65536£¬£¬£¬£¬£¬£¬£¬£¬ÏòÁ¿Î¬¶È´Ó 8 άÌáÉýµ½ 512 ά£¬£¬£¬£¬£¬£¬£¬£¬Äܲ¶»ñ¸ü¶àͼÏñϸ½Ú£»£»£»£»£»
£¨2£©×èÖ¹Í߽⣺ͨ¹ýÐÂÔö codebook Ó³Éä²ã£¬£¬£¬£¬£¬£¬£¬£¬½â¾ö´ó codebook ѵÁ·Öг£¼ûµÄÂë±¾Íß½âÎÊÌ⣬£¬£¬£¬£¬£¬£¬£¬°ü¹ÜËùÓÐ token ¶¼Äܱ»ÓÐÓÃʹÓ㻣»£»£»£»
£¨3£©½¹µã×÷ÓãºÌìÉú¸ü¾«×¼µÄÊÓ¾õ token£¬£¬£¬£¬£¬£¬£¬£¬ÈúóÐøÌìÉú / ±à¼Ê¹ÃüÄÜ»¹Ô¸üϸÄåµÄͼÏñϸ½Ú¡£¡£¡£¡£¡£¡£¡£
2. ÒþÊ½ÍÆÀí»úÖÆ£¨Implicit Reasoning£©
ÃæÁÙÖØ´óÌáÐÑ£¬£¬£¬£¬£¬£¬£¬£¬¹Å°åÌìÉúÄ£×ÓÈÝÒ×·ºÆðÓïÒå´íλ¡¢Ï¸½ÚÒÅ©µÄÎÊÌâ¡£¡£¡£¡£¡£¡£¡£STAR µÄÒþÊ½ÍÆÀí»úÖÆ£¬£¬£¬£¬£¬£¬£¬£¬ÈÃÄ£×Óѧ»á"ÏÈÍÆÀí£¬£¬£¬£¬£¬£¬£¬£¬ÔÙÌìÉú"£º
£¨1£©µ±ÎüÊÕµ½ÖØ´óÌáÐÑʱ£¬£¬£¬£¬£¬£¬£¬£¬¶³½áµÄ»ù´¡ AR Ä£×ÓÏȾÙÐÐÍÆÀí£¬£¬£¬£¬£¬£¬£¬£¬ÌìÉúÔ̺¬½¹µã֪ʶµÄÒþʽ latent tokens£»£»£»£»£»
£¨2£©ÕâЩ latent tokens ×÷ΪÌõ¼þÊäÈ룬£¬£¬£¬£¬£¬£¬£¬Ö¸µ¼¶ÑµþÄ£¿£¿£¿£¿£¿£¿£¿£¿é¾ÙÐÐͼÏñÌìÉú¡£¡£¡£¡£¡£¡£¡£ÕâÒ»Éè¼ÆÊµÏÖÁË"ÓïÒåÍÆÀí" Óë"ÏñËØÌìÉú" µÄ½âñ£¬£¬£¬£¬£¬£¬£¬ÈÃÌìÉúÀú³Ì¸üÓÐÂß¼£¬£¬£¬£¬£¬£¬£¬£¬´ó·ùÌáÉýÁËÖØ´ó³¡¾°ÏµÄÓïÒå¶ÔÆë¶È¡£¡£¡£¡£¡£¡£¡£
Èý¡¢ÊµÑéЧ¹û
STAR µÄÍ»ÆÆÐÔÌåÏÖ£¬£¬£¬£¬£¬£¬£¬£¬»ñµÃÁËȨÍþ benchmark µÄÖÜÈ«ÑéÖ¤£¬£¬£¬£¬£¬£¬£¬£¬ÔÚÃ÷È·¡¢ÌìÉú¡¢±à¼Èý´óʹÃüÖоùÕ¹ÏÖ³ö¶¥¼âʵÁ¦¡£¡£¡£¡£¡£¡£¡£
1. ÌìÉúʹÃü£º
ÔÚÎı¾ - ͼÏñÌìÉúµÄ½¹µã benchmark ÖУ¬£¬£¬£¬£¬£¬£¬£¬STAR µÄÌåÏÖ¾ªÑÞ£º
£¨1£©GenEval£¨ÓïÒå¶ÔÆëȨÍþ benchmark£©£ºSTAR-7B ÒÔ 0.91 µÄ×ۺϵ÷ÖˢРSOTA¡£¡£¡£¡£¡£¡£¡£ÔÚÎïÌ弯Êý¡¢ÑÕÉ«ÊôÐÔ¡¢¿Õ¼ä¹ØÏµ¡¢ÊµÌåÊôÐÔµÈ 6 ¸ö×ÓʹÃüÖУ¬£¬£¬£¬£¬£¬£¬£¬STAR ÓÐ 5 ÏîÅÅÃûµÚÒ»£»£»£»£»£»
£¨2£©DPG-Bench£¨Öش󳡾°ÌìÉú benchmark£©£ºSTAR-7B ÒÔ 87.44 µÄµÃ·ÖÁìÏÈ£¬£¬£¬£¬£¬£¬£¬£¬ÔÚ¶àÎïÌå×éºÏ¡¢Öش󳡾°ÐÎòµÈʹÃüÖÐÌåÏÖÍ»³ö£¬£¬£¬£¬£¬£¬£¬£¬ÌìÉúµÄͼÏñ²»µ«Ï¸½Ú¸»ºñ£¬£¬£¬£¬£¬£¬£¬£¬»¹Äܾ«×¼»¹ÔÎı¾ÖеÄÂß¼¹ØÏµ£»£»£»£»£»
£¨3£©WISEBench£¨ÌìÏÂÖªÊ¶ÍÆÀí benchmark£©£ºSTAR-7B ÒÔ 0.66 µÄ×ۺϵ÷֣¬£¬£¬£¬£¬£¬£¬£¬ÓâԽͬÀàͳһģ×Ó£¬£¬£¬£¬£¬£¬£¬£¬Ö¤ÊµÆäÒþÊ½ÍÆÀí»úÖÆÄÜÓÐÓÃʹÓÃÌìÏÂ֪ʶ£¬£¬£¬£¬£¬£¬£¬£¬ÌáÉýÖØ´óÌáÐѵÄÌìÉúÖÊÁ¿¡£¡£¡£¡£¡£¡£¡£


2. ±à¼Ê¹Ãü£º
ÔÚͼÏñ±à¼ benchmark ÖУ¬£¬£¬£¬£¬£¬£¬£¬STAR Õ¹ÏÖ³öǿʢµÄÎÞаÊÊÅäÄÜÁ¦£¬£¬£¬£¬£¬£¬£¬£¬Äܾ«×¼ÏìÓ¦"Ìí¼ÓÎïÌå¡¢Ìæ»»Åä¾°¡¢µ÷½âÆø¸Å¡¢É¾³ýÔªËØ" µÈÖÖÖÖ±à¼Ö¸Á
£¨1£©ImgEdit£¨ÁýÕÖ 9 Àà±à¼Ê¹Ãü£©£ºSTAR-7B ÒÔ 4.34 µÄ×ۺϵ÷ÖˢРSOTA¡£¡£¡£¡£¡£¡£¡£ÔÚ"ÎïÌåÌáÈ¡"" Ðж¯±à¼" µÈ×ÓʹÃüÖУ¬£¬£¬£¬£¬£¬£¬£¬µÃ·Ö»®·ÖµÖ´ï 4.19¡¢4.60£¬£¬£¬£¬£¬£¬£¬£¬ÁìÏÈͬÀàÄ£×Ó£»£»£»£»£»
£¨2£©MagicBrush£¨ÓïÒå±à¼ benchmark£©£ºSTAR-7B µÄ CLIP-I µÃ·Ö´ï 0.934£¨ÓïÒåÒ»ÖÂÐÔ£©£¬£¬£¬£¬£¬£¬£¬£¬L1 Îó²îµÍÖÁ 0.056£¨ÏñËØ±£Õæ¶È£©¡£¡£¡£¡£¡£¡£¡£ÕâÒâζ×Å STAR ÔÚÍê³É±à¼Ê¹ÃüµÄͬʱ£¬£¬£¬£¬£¬£¬£¬£¬ÄÜ×îºéÁ÷ƽ±£´æÔͼµÄ½¹µãÄÚÈÝ£¬£¬£¬£¬£¬£¬£¬£¬×èÖ¹"Ì«¹ý±à¼" »ò"ÓïÒ寫Àë"¡£¡£¡£¡£¡£¡£¡£


3. Ã÷ȷʹÃü£º
¼´±ãרעÓÚÔöÇ¿ÌìÉúÓë±à¼ÄÜÁ¦£¬£¬£¬£¬£¬£¬£¬£¬STAR µÄÃ÷È·ÄÜÁ¦ÒÀÈ»¼á³Ö¶¥¼âˮƽ¡£¡£¡£¡£¡£¡£¡£ÔÚ 9 ´óȨÍþÃ÷È· benchmark ÖУ¬£¬£¬£¬£¬£¬£¬£¬STAR µÄÌåÏÖÁìÏÈÓÚͬÀà¶àģ̬ģ×Ó¡£¡£¡£¡£¡£¡£¡£

ËÄ¡¢×ܽáÓëÕ¹Íû
STAR µÄʵÖÊÊÇ ¡°ÓÃ×Á·µÄ½á¹¹ÊµÏÖ×îÖÜÈ«µÄÄÜÁ¦Í³Ò»¡±£ºÍ¨¹ý ¡°Ê¹ÃüµÝ½ø¡± ½â¾öѵÁ·³åÍ»£¬£¬£¬£¬£¬£¬£¬£¬Í¨¹ý ¡°¶Ñµþͬ¹¹ AR¡± ½µµÍÀ©Õ¹±¾Ç®£¬£¬£¬£¬£¬£¬£¬£¬Í¨¹ý ¡°STAR-VQ + ÒþÊ½ÍÆÀí¡± ÌáÉýÄÜÁ¦ÉÏÏÞ£¬£¬£¬£¬£¬£¬£¬£¬×îÖÕʵÏÖ ¡°Ã÷È·¡¢ÌìÉú¡¢±à¼¡± Èý´óʹÃüµÄ¶¥¼âÐÔÄÜ£¬£¬£¬£¬£¬£¬£¬£¬Îª¶àģ̬ģ×ӵĿÉÒ»Á¬À©Õ¹ÌṩÁËÈ«ÐÂ˼Ð÷¡£¡£¡£¡£¡£¡£¡£
STAR Ϊ¶àģ̬ģ×ÓµÄÎÞ×ÌÈÅ¡¢¿ÉÀ©Õ¹À©Õ¹ÌṩÁËÈ«ÐÂÊÖÒÕ·¾¶£¬£¬£¬£¬£¬£¬£¬£¬ºóÐø¿É´ÓÒÔÏÂÆ«Ïò½øÒ»²½Ì½Ë÷£º
£¨1£©ÄÜÁ¦½çÏßÀ©Õ¹£ºÔÚÏÖÓÐÃ÷È·¡¢ÌìÉú¡¢±à¼»ù´¡ÉÏ£¬£¬£¬£¬£¬£¬£¬£¬ÄÉÈëÊÓÆµÌìÉú¡¢3D ÖØÐ޵ȸüÖØ´óµÄ¶àģ̬ʹÃü£¬£¬£¬£¬£¬£¬£¬£¬ÑéÖ¤¿ò¼ÜµÄ·º»¯ÐÔ£»£»£»£»£»
£¨2£©Ð§ÂÊÓÅ»¯£ºÄ¿½ñÄ£×ÓÈÔÐè¶à½×¶ÎѵÁ·£¬£¬£¬£¬£¬£¬£¬£¬Î´À´¿É̽Ë÷¸ü¸ßЧµÄÍŽáѵÁ·Õ½ÂÔ£¬£¬£¬£¬£¬£¬£¬£¬»òÇáÁ¿»¯¶ÑµþÄ£¿£¿£¿£¿£¿£¿£¿£¿éÒÔ½µµÍ°²Åű¾Ç®£»£»£»£»£»
£¨3£©ÍÆÀíÄÜÁ¦É£º½øÒ»²½Ç¿»¯ÒþÊ½ÍÆÀí»úÖÆ£¬£¬£¬£¬£¬£¬£¬£¬ÍŽáÍⲿ֪ʶ¿â»òÇ¿»¯Ñ§Ï°£¬£¬£¬£¬£¬£¬£¬£¬ÌáÉýÄ£×ÓÔÚ³¬ÖØ´óÂß¼¡¢¿çÁìÓò֪ʶ³¡¾°ÏµÄÌìÉú׼ȷÐÔ£»£»£»£»£»
£¨4£©¶àģ̬ÈÚºÏÉý¼¶£ºÍØÕ¹ÎÄÖØÇìÓ庣¸Ûº½ÎïÁ÷ÓÐÏÞ¹«Ë¾±¾¡¢Í¼ÏñÖ®ÍâµÄģ̬£¨ÈçÓïÒô¡¢´¥¾õ£©£¬£¬£¬£¬£¬£¬£¬£¬¹¹½¨¸üÖÜÈ«µÄͨÓöàģ̬ϵͳ£¬£¬£¬£¬£¬£¬£¬£¬ÍƸÐÈ˹¤Í¨ÓÃÖÇÄÜ£¨AGI£©µÄÉú³¤¡£¡£¡£¡£¡£¡£¡£