
?28jndÕ¹Íûpc?ΪÄãÌṩ28jndÕ¹ÍûpcAPP°²×¿°æÏÂÔØ£¬£¬£¬£¬£¬£¬ÀúÊ·°æ±¾¡¢¾É°æÏÂÔØ£¬£¬£¬£¬£¬£¬Éó²é×îÐÂ28jndÕ¹ÍûpcÊÖ»ú°æÏÈÈÝ¡¢Ó¦ÓýØÍ¼¡¢ÍøÓÑ̸ÂÛ£¬£¬£¬£¬£¬£¬Àû±ã¿ì½ÝµÄ½«°²×¿°æ28jndÕ¹ÍûpcÓ¦ÓÃÃâ·ÑÏÂÔØµ½ÊÖ»ú¡£¡£¡£¡£¡£¡£¡£¡£
ÊÇÒ»¿îÕæÊµµÄ¼ÝʻģÄâÓÎÏ·£¬£¬£¬£¬£¬£¬ÓÎÏ·ÈÃÄãÕæÊµÌåÑé¼ÝÊ»µÄÐËȤ£¬£¬£¬£¬£¬£¬²¢ÇÒÓÐʵʱµÄ¼ÝÊ»ÒôЧ´úÈ룬£¬£¬£¬£¬£¬»¹ÄܸÄ×°×Ô¼ºµÄ³µÁ¾£¬£¬£¬£¬£¬£¬Ìá¸ßÆû³µµÄÐÔÄÜ£¬£¬£¬£¬£¬£¬È«ÐµÄ3D»ÖÊÉè¼Æ£¬£¬£¬£¬£¬£¬ÏßÉϵØÍ¼Éè¼ÆÖØ´ó£¬£¬£¬£¬£¬£¬ÈÃÄãí§ÒâÕö¿ªåÛÓΣ¬£¬£¬£¬£¬£¬Ï²»¶¾ÍÀ´ÓÎÏ·ÀïʵÑéһϰɡ£¡£¡£¡£¡£¡£¡£¡£
ÊÇÒ»¿îÊ®·Ö´Ì¼¤µÄðÏÕ½âÃÕÌÓ×ßÓÎÏ·£¬£¬£¬£¬£¬£¬Íæ¼ÒÐèҪͨ¹ý̽Ë÷Õâ¸ö¿Õ¼ä£¬£¬£¬£¬£¬£¬×ÐϸµØÊÓ²ìÇéÐΣ¬£¬£¬£¬£¬£¬²¢ÔËÓÃ×Ô¼ºµÄÍÆÀíÄÜÁ¦£¬£¬£¬£¬£¬£¬½«ÕâЩÏßË÷ÁªÏµÆðÀ´£¬£¬£¬£¬£¬£¬ÒÔ»ñµÃÌÓ×ßµÄÒªº¦ÐÅÏ¢¡£¡£¡£¡£¡£¡£¡£¡£
ÊÇÒ»¿î¶þ´ÎÔª¿¨ÅÆÓÎÏ·£¬£¬£¬£¬£¬£¬ÓÎÏ·ÓµÓи»ºñµÄʳÁ飬£¬£¬£¬£¬£¬Ï¸ÄåµÄ»Ã棬£¬£¬£¬£¬£¬¸»ºñµÄÓÎÏ·¸£Àû£¬£¬£¬£¬£¬£¬¶àÑùµÄÓÎÏ·Íæ·¨£¬£¬£¬£¬£¬£¬¿ÉÒÔ¸øÓèÍæ¼Ò·×ÆçÑùµÄÓÎÏ·ÌåÑé¼°ÐËȤ¡£¡£¡£¡£¡£¡£¡£¡£
ÊÇÒ»¿î¿¨ÅÆÀྺ¼¼ÓÎÏ·£¬£¬£¬£¬£¬£¬Íæ¼ÒÄܹ»ÍøÂçÖÖÖÖ¸÷ÑùµÄ¹ÖÎï¿¨ÅÆ£¬£¬£¬£¬£¬£¬²¢ÔÚÕ½¶·ÖÐ×éºÏ³ö×îǿʢµÄ¿¨×飬£¬£¬£¬£¬£¬×ÔÓɵÄÈ¥Íê³ÉÖÖÖֹؿ¨Ã°ÏÕ£¬£¬£¬£¬£¬£¬»ñµÃÎÞÏÞµÄÕ½¶·Òâ¼ûÒâÒå¡£¡£¡£¡£¡£¡£¡£¡£
ÊÇÒ»¿îÕ½ÕùÕ½ÂÔÓÎÏ·£¬£¬£¬£¬£¬£¬ÓÎÏ·ÓµÓÐȫʱ¶¯Ì¬Õ½¶·£¬£¬£¬£¬£¬£¬¸»ºñ¶à²ÊµÄÍæ·¨£¬£¬£¬£¬£¬£¬×ª±äβâµÄÕ½¶·µÈÌØÉ«£¬£¬£¬£¬£¬£¬¿ÉÒÔ¸øÓèÍæ¼Ò¼¤ÇéÒâ¼ûÒâÒåÓÎÏ·ÌåÑé¡£¡£¡£¡£¡£¡£¡£¡£
ÊÇÒ»¿îȫд´ÒâÁíÀàÍæ·¨µÄ¶ÔÕ½¾º¼¼Ã°ÏÕÊÖÓΣ¬£¬£¬£¬£¬£¬Âܲ·µ¶×ÅÊöÕ½ÓÎÏ·½ÓÄÉ¿¨Í¨¸´¹ÅÏñËØ»³¾É»ÖÊ£¬£¬£¬£¬£¬£¬¹Ø¿¨³¡¾°¸»ºñ´Ì¼¤¾«²Ê£¬£¬£¬£¬£¬£¬Âܲ·µ¶×ÅÊöÕ½ÓÎÏ·Ê®·ÖÄÍÍæÓÐÒâ˼¡£¡£¡£¡£¡£¡£¡£¡£
»úе֮Ðı༲¿
28jndÕ¹ÍûpcÀ©É¢ÓïÑÔÄ£×Ó£¨Diffusion Language Models, DLLMs£©ÒòÆä¶àÖÖDZÔÚµÄÌØÕ÷¶ø±¸ÊܹØ×¢£¬£¬£¬£¬£¬£¬ÈçÄܼÓËٵķÇ×Իغϲ¢ÐÐÌìÉúÌØÕ÷£¬£¬£¬£¬£¬£¬ÄÜÖ±½ÓÆð²Ý±à¼µÄÌØÕ÷£¬£¬£¬£¬£¬£¬ÄÜÊý¾ÝÔöÇ¿µÄÌØÕ÷¡£¡£¡£¡£¡£¡£¡£¡£È»¶ø£¬£¬£¬£¬£¬£¬ÆäÄ£×ÓÄÜÁ¦ÍùÍùÂäÎéÓÚÒ»ÂɹæÄ£µÄÇ¿Á¦×Իع飨AR£©Ä£×Ó¡£¡£¡£¡£¡£¡£¡£¡£
¿ËÈÕ£¬£¬£¬£¬£¬£¬»ªÖпƼ¼´óѧºÍ×Ö½ÚÌø¶¯ÍŽáÍÆ³öÁËStable-DiffCoder¡£¡£¡£¡£¡£¡£¡£¡£Õâ²»µ«½öÊÇÒ»¸öеÄÀ©É¢´úÂëÄ£×Ó£¬£¬£¬£¬£¬£¬¸üÊÇÒ»´Î¹ØÓÚ ¡¸À©É¢ÑµÁ·ÄÜ·ñÌáÉýÄ£×ÓÄÜÁ¦ÉÏÏÞ¡¹ µÄÉî¶È̽Ë÷¡£¡£¡£¡£¡£¡£¡£¡£
Stable-DiffCoder ÔÚÍêÈ«¸´Óà Seed-Coder ¼Ü¹¹¡¢Êý¾ÝµÄÌõ¼þÏ£¬£¬£¬£¬£¬£¬Í¨¹ýÒýÈëBlock Diffusion Ò»Á¬Ô¤ÑµÁ·£¨CPT£©¼°Ò»ÏµÁÐÎȹÌÐÔÓÅ»¯Õ½ÂÔ£¬£¬£¬£¬£¬£¬ÀÖ³ÉʵÏÖÁËÐÔÄÜ·´³¬¡£¡£¡£¡£¡£¡£¡£¡£ÔÚ ¶à¸ö Code Ö÷Á÷°ñµ¥ÉÏ£¨Èç MBPP£¬£¬£¬£¬£¬£¬BigCodeBench µÈ£©£¬£¬£¬£¬£¬£¬Ëü²»µ«»÷°ÜÁËÆä AR ÔÐÍ£¬£¬£¬£¬£¬£¬¸üÔÚ 8B ¹æÄ£ÏÂÓâÔ½ÁË Qwen2.5-Coder £¬£¬£¬£¬£¬£¬Qwen3£¬£¬£¬£¬£¬£¬DeepSeek-Coder µÈÒ»ÖÚÇ¿Á¦¿ªÔ´Ä£×Ó£¬£¬£¬£¬£¬£¬Ö¤ÊµÎúÀ©É¢ÑµÁ··¶Ê½×Ô¼º¾ÍÊÇÒ»ÖÖǿʢµÄÊý¾ÝÔöÇ¿ÊֶΡ£¡£¡£¡£¡£¡£¡£¡£
![]()
ÂÛÎÄÎÊÌ⣺Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language ModelÂÛÎÄÁ´½Ó: https://arxiv.org/pdf/2601.15892Github Á´½Ó: https://github.com/ByteDance-Seed/Stable-DiffCoderÄ£×ÓÁ´½Ó: https://huggingface.co/collections/ByteDance-Seed/stable-diffcoder
![]()
À©É¢Àú³ÌÄÑÒÔ¸ßЧѧϰÑù±¾ÖªÊ¶
À©É¢Àú³ÌËäÈ»ÍâòÉÏ¿ÉÒÔÀ©³äÐí´ó¶¼¾Ý£¬£¬£¬£¬£¬£¬¿ÉÒÔ×÷Ϊһ¸öÊý¾ÝÔöÇ¿µÄÊֶΣ¬£¬£¬£¬£¬£¬¿ÉÊÇÏÖʵÉÏ»áÒýÈëÐí¶àÔëÉùÉõÖÁ¹ýʧ֪ʶµÄѧϰ¡£¡£¡£¡£¡£¡£¡£¡£
ÀýÈçÏÂÃæµÄÀý×Ó£º
½«Æä mask ³É
![]()
¿ÉÒÔ·¢Ã÷¹ØÓÚ×îºóÒ»¸ö mask_n£¬£¬£¬£¬£¬£¬ÆäÖ»ÄÜÔÚÍû¼û a=1£¬£¬£¬£¬£¬£¬b=2 µÄÇéÐÎÏÂȥѧϰ a+b=7£¬£¬£¬£¬£¬£¬»áÐγɹýʧµÄ֪ʶӳÉä¡£¡£¡£¡£¡£¡£¡£¡£×îºó³äÆäÁ¿Ò²Ö»ÄÜѧµ½£¬£¬£¬£¬£¬£¬a=3£¬£¬£¬£¬£¬£¬b=4 ÔÚ a+b = Õâ¸öÓï¾³ÏµĹ²ÏÖ¸ÅÂʸü´óÒ»µã£¬£¬£¬£¬£¬£¬²»¿Éѧµ½Ã÷È·µÄ¼Ó¹æÔòÔò¡£¡£¡£¡£¡£¡£¡£¡£
token ÍÆÀíµÄ֪ʶºÍÁ÷³ÌÉè¼Æ
ÂÛÎÄͨ¹ý½¨Ä£Õâ¸ö֪ʶµÄѧϰÀ´Ú¹ÊÍÕâ¸öÕ÷Ïó£º
![]()
¼ÙÉè c ÊÇÄ¿½ñ¿É¼ûµÄÑù±¾£¬£¬£¬£¬£¬£¬Æ¾Ö¤ÕæÊµÂþÑÜͨ¹ýÕâЩÑù±¾ÔÚÄ¿½ñλÖÃÄܹ»ÍÆÀí³öµÄ token ÜöÝÍΪ C (c)£¬£¬£¬£¬£¬£¬¾ÞϸΪ K (c)£¨ÕâÀï¶à¸ö token Í¬Ê±ÍÆÀíµÄÇé¾°Ò»Ö£¬£¬£¬£¬£¬£¬Òò´ËÖ»¼òÆÓµÄ˼Á¿µ¥¸ö token ÍÆÀí£©¡£¡£¡£¡£¡£¡£¡£¡£ÓÉÓÚʹÓõÄÕæÊµÂþÑÜÀ´½ç˵µÄ£¬£¬£¬£¬£¬£¬ÒÔÊÇ c Ô½¶àÔ½Çå½àµÄʱ¼ä£¬£¬£¬£¬£¬£¬K (c) ԽС¡£¡£¡£¡£¡£¡£¡£¡£
![]()
Òò´Ë£¬£¬£¬£¬£¬£¬ÈôÊÇÓô¿Ë«ÏòµÄÀ©É¢Àú³Ì£¬£¬£¬£¬£¬£¬ÔÚ mask ±ÈÀý½Ï´óµÄʱ¼ä£¬£¬£¬£¬£¬£¬Ä¿½ñ token ¼ûµ½µÄ c ±äС£¬£¬£¬£¬£¬£¬²»Çå½àµÄ¸ÅÂʱä´ó£¬£¬£¬£¬£¬£¬µ¼Ö K (c) ±ä´ó£¬£¬£¬£¬£¬£¬ÄÑÒÔÓ³Éäµ½ÇåÎúµÄ¹æÔò¡£¡£¡£¡£¡£¡£¡£¡£Í¬Ê±Æä»á±¬·¢»á±¬·¢ÖÖÖÖ¸÷ÑùµÄ c£¬£¬£¬£¬£¬£¬Æ½¾ùÿ¸ö c µÄѧϰÁ¿»á¼õС¡£¡£¡£¡£¡£¡£¡£¡£ÁíÍ⣬£¬£¬£¬£¬£¬»¹Òª°ü¹ÜѵÁ·²ÉÑùµÄ c ¸úÍÆÀíÓÃµÄ c ÊÇÒ»Öµģ¬£¬£¬£¬£¬£¬²Å»ª¸üºÃµÄʹÓÃѵÁ·Ñ§Ï°µÄ֪ʶ¡£¡£¡£¡£¡£¡£¡£¡£
½ÓÏÂÀ´ÂÛÎÄͨ¹ýÔÚ 2.5B µÄÄ£×ÓÉè¼ÆÊµÑéÀ´½øÒ»²½²ûÊͲ¢Ö¤ÊµÕâ¸ö½áÂÛ¡£¡£¡£¡£¡£¡£¡£¡£ÂÛÎÄ´ÓÒ»¸ö AR model ³õʼ»¯£¬£¬£¬£¬£¬£¬È»ºóѵÁ·Ò»¶ÎеÄ֪ʶ¡£¡£¡£¡£¡£¡£¡£¡£ÂÛÎÄÉè¼ÆÁË 3 ¸öѵÁ··½·¨À´Ì½Ë÷£º
![]()
£¨1£©AR->BiDLLM: Óà AR µÄ·½·¨¼ÌÐøÑµÁ·£¬£¬£¬£¬£¬£¬ÔÚ 100k step µÄʱ¼ä CPT ³ÉË«ÏòµÄ DLLM¡£¡£¡£¡£¡£¡£¡£¡£
£¨2£©ARDLLM->BiDLLM: Óà AR µÄ½á¹¹£¬£¬£¬£¬£¬£¬¿ÉÊÇʹÓô¿Ë«ÏòµÄ²ÉÑùģʽÀ´ÑµÁ·¡£¡£¡£¡£¡£¡£¡£¡£È»ºó 100k step CPT ³É BiDLLM¡£¡£¡£¡£¡£¡£¡£¡£
£¨3£©BiDLLM£ºÊ¹Óô¿Ë«ÏòµÄ DLLM ѵÁ·¡£¡£¡£¡£¡£¡£¡£¡£
¿ÉÒÔ·¢Ã÷£¬£¬£¬£¬£¬£¬×îºóЧ¹ûÊÇ£¨1£©>£¨2£©>£¨3£©£¬£¬£¬£¬£¬£¬ÕâÒ²ÇкÏÇ°ÃæµÄÀíÂÛ¡£¡£¡£¡£¡£¡£¡£¡£²»±ØËæ»ú [MASK] µÄ£¨1£©¼Æ»®¹ØÓÚ֪ʶÓиü¿ìµÄѹËõËÙÂÊ£¬£¬£¬£¬£¬£¬²¢ÇÒת»»³É BiDLLM Ò²¼á³Ö×Å×î¼ÑÐÔÄÜ£¬£¬£¬£¬£¬£¬Õâ¿ÉÒÔ֤ʵÔÚÒª¸ßЧµÄѧºÃÒ»¸ö DLLM£¬£¬£¬£¬£¬£¬¿ÉÒÔÓà AR »òÕßС block size µÄ block diffusion À´¾ÙÐÐ֪ʶѹËõ¡£¡£¡£¡£¡£¡£¡£¡£ÁíÍâÓÐȤµÄÊÇ£¬£¬£¬£¬£¬£¬ÔÚ block=32 ʱ£¨1£©ºÍ£¨2£©µÄÌåÏֱȣ¨3£©²î£¬£¬£¬£¬£¬£¬¿ÉÊÇÔÚ 100k Ö®ºóÌåÏֱȣ¨3£©ºÃ¡£¡£¡£¡£¡£¡£¡£¡£100k ֮ǰ¿ÉÒÔ˵Ã÷£¬£¬£¬£¬£¬£¬AR ²ÉÑùµÄ c ¸ú block size=32 ÍÆÀíÀú³ÌµÄ c ²»Ì«Æ¥Å䣬£¬£¬£¬£¬£¬¿ÉÊÇÓÉÓÚ AR ѹËõÁË´ó×ÚÓÐÓõÄ֪ʶ£¬£¬£¬£¬£¬£¬ÉÔ΢ CPT һϾÍÄÜÊÊÅäÕâÖÖÍÆÀíÀú³Ì¡£¡£¡£¡£¡£¡£¡£¡£Í¬Ê±Ò²¿ÉÒÔ˵Ã÷£¬£¬£¬£¬£¬£¬AR ÕâÖֽṹµÄÏÈÑ飬£¬£¬£¬£¬£¬¿ÉÄܸüÊÊºÏ prompt+response ÕâÖÖ´Ó×ó²à×îÏÈÍÆÀíµÄÀú³Ì¡£¡£¡£¡£¡£¡£¡£¡£
Òò´ËÎÒÃǽ«ÑµÁ·Á÷³ÌÉè¼ÆÎª£¬£¬£¬£¬£¬£¬ÏÈÓà AR ѹËõÒ»±é֪ʶ£¬£¬£¬£¬£¬£¬È»ºóÓà AR ÍË»ðµÄǰһ¸ö checkpoint ¼ÌÐø CPT ³ÉС block µÄ block diffusion£¬£¬£¬£¬£¬£¬À´Ì½Ë÷ diffusion Àú³ÌµÄÊý¾ÝÔöÇ¿ÄÜÁ¦¡£¡£¡£¡£¡£¡£¡£¡£
ÎÈ¹ÌµÄ DLLM warmup Õ½ÂÔÒ»Á¬Ô¤ÑµÁ·Éè¼Æ
À©É¢Ä£×ÓµÄÒ»Á¬Ô¤ÑµÁ·Í¨³£¶Ô³¬²ÎÊýµÄÉè¼Æ£¨ÈçѧϰÂÊ£©ºÜÊÇÃô¸Ð£¬£¬£¬£¬£¬£¬ÈÝÒ×·ºÆð grad norm µÄÒì³£±ä¸ß£¬£¬£¬£¬£¬£¬ÕâÒ²»áÊܵ½ÖÖÖÖѵÁ·¼Ü¹¹µÄÓ°Ïì¡£¡£¡£¡£¡£¡£¡£¡£ÎªÁ˼á³ÖÖÖÖÖѵÁ·¼Ü¹¹µÄѧϰÎȹ̣¬£¬£¬£¬£¬£¬ÒÔ¼°·±Ôӵĵ÷²ÎÀú³Ì£¬£¬£¬£¬£¬£¬ÍŶÓÉè¼ÆÁËÒ»ÖÖÊÊÅäµÄ warmup Õ½ÂÔ¡£¡£¡£¡£¡£¡£¡£¡£
![]()
DLLM µÄ CPT Àú³Ì²»ÎȹÌÖ÷ÒªÊܵ½ÏÂÃæ 3 ¸öÔµ¹ÊÔÓÉÓ°Ï죺
£¨1£©Attention ´Óµ¥ÏòÄð³ÉË«Ïò
£¨2£©Mask ±ä¶àµ¼ÖÂʹÃü±äµÃºÜÄÑ
£¨3£©ÎªÁË¶ÔÆë ELBO£¬£¬£¬£¬£¬£¬»áÔÚ½»Ö¯ìØÇ°Ãæ³ËÉϼÓȨϵÊý¡£¡£¡£¡£¡£¡£¡£¡£ºÃ±ÈÖ» mask ÁËÒ»¸ö token£¬£¬£¬£¬£¬£¬»áµÈ¼ÛÓÚÖ»ÅÌËãÁËÕâ¸ö token µÄ loss£¬£¬£¬£¬£¬£¬»á´ó·ùÔö´óÕâ¸ö token ¹ØÓÚÌݶȵÄÓ°Ï죬£¬£¬£¬£¬£¬½ø¶øÓ°Ïì grad norm ºÍ loss¡£¡£¡£¡£¡£¡£¡£¡£
ÓÉÓÚÍË»ð attention µÄ·½·¨ÄÑÒÔÎÞаÊÊÅä flash attention µÈ¼Ü¹¹£¬£¬£¬£¬£¬£¬¸ÃÍŶÓÕë¶Ô£¨2£©£¨3£©À´Éè¼Æ warmup Àú³Ì¡£¡£¡£¡£¡£¡£¡£¡£ÏêϸµÄ£¬£¬£¬£¬£¬£¬ÔÚ warmup ½×¶Î½« mask ±ÈÀýÉϽçÖð½¥ warmup µ½×î´óÖµ£¬£¬£¬£¬£¬£¬´Ó¶øÊ¹µÃÒ»×îÏÈʹÃü´ÓÒ×±äÄÑ¡£¡£¡£¡£¡£¡£¡£¡£
![]()
Æä´Î£¬£¬£¬£¬£¬£¬ÔÚ warmup ½×¶ÎÈ¥µô½»Ö¯ìØÖмÓȨµÄϵÊý£¬£¬£¬£¬£¬£¬´Ó¶øÈÃÿ¸ö token ¶Ô loss µÄÓ°Ïì¸üƽÎÈ£º
![]()
Block-wise ½Ø¶ÏµÄÔëÉùµ÷Àí
ÔÚʹÓà block diffusion ʱ£¬£¬£¬£¬£¬£¬ÓÉÓÚͨ¹ý cross attention Æ´½ÓÁËÇå½àµÄǰ׺£¬£¬£¬£¬£¬£¬¿ÉÒÔʹµÃÿ¸ö token ¶¼±¬·¢ÓÐÓÃµÄ loss¡£¡£¡£¡£¡£¡£¡£¡£È»¶øÈôÊÇʹÓùŰåµÄ noise schedule »áʹµÃÓÐЩ¿é²»±¬·¢ loss Ðźţ¬£¬£¬£¬£¬£¬Í¨¹ýÇó½â»ý·Ö¿ÉÒÔËã³ö block ²»±¬·¢ÐźŵĸÅÂÊÈçÏ£¬£¬£¬£¬£¬£¬ÕâÔÚС block ʱ»áÌØÊâÏÔ×Å£º
![]()
Òò´ËÍŶÓ×öÁËÁ½¸öÉè¼Æ£º£¨1£©Ç¿ÖÆÃ¿¸ö¿é¶¼²ÉÑùÒ»¸ö token£¨2£©½« noise ²ÉÑùϽçÉèÖÃΪ 1/B£¬£¬£¬£¬£¬£¬ÕâÑù¿ÉÒÔʹµÃÖÁÉÙÆÚÍû²ÉÑùÒ»¸ö token¡£¡£¡£¡£¡£¡£¡£¡£Í¬Ê±¿ÉÒÔ×èÖ¹Ç¿ÖÆ²ÉÑù 1 ¸ö token Ö®ºó£¬£¬£¬£¬£¬£¬Ô±¾¶ÔÓ¦µÄ t ¹ýС£¬£¬£¬£¬£¬£¬´Ó¶øÊ¹µÃ½»Ö¯ìؼÓȨ¹ý´óµÄÎÊÌâ¡£¡£¡£¡£¡£¡£¡£¡£
![]()
ʵÑéЧ¹û£º¶à¸ö´úÂë benchmark ÔÚ 8B ×óÓÒµÄÄ£×Ó¼á³ÖÁìÏÈ
¹ØÓÚ Base Ä£×Ó
![]()
![]()
![]()
Stable-DiffCoder-8B-Base ÔÚ´úÂëÌìÉú£¬£¬£¬£¬£¬£¬¶à´úÂëÓïÑÔÌìÉú£¬£¬£¬£¬£¬£¬´úÂëÍÆÀíÉÏÌåÏÖ¾«²Ê¡£¡£¡£¡£¡£¡£¡£¡£Áè¼ÝһϵÁÐ AR ºÍ diffusion-based µÄÄ£×Ó¡£¡£¡£¡£¡£¡£¡£¡£ÁíÍâ¿ÉÒÔ·¢Ã÷Ä£×ÓÔÚÏ£º±´úÂëÓïÑÔÉÏ£¨Èç C#£¬£¬£¬£¬£¬£¬PHP µÈ£¬£¬£¬£¬£¬£¬Ô¤ÑµÁ·ÖÐÊý¾Ý½ÏÉÙ£©£¬£¬£¬£¬£¬£¬Ïà±ÈÓÚ AR baseline »ñµÃÁË´ó·ùÔöÇ¿£¬£¬£¬£¬£¬£¬¿ÉÒÔ֤ʵ DLLM µÄѵÁ·Àú³ÌÆðµ½ÁËÒ»¶¨µÄÊý¾ÝÔöÇ¿µÄЧ¹û¡£¡£¡£¡£¡£¡£¡£¡£Í¬Ê±ÔÚ´úÂëÍÆÀíÄÜÁ¦ÉÏÒ²»ñµÃÁËÔöÇ¿¡£¡£¡£¡£¡£¡£¡£¡£
¹ØÓÚ Instruct Ä£×Ó
Stable-DiffCoder-8B-Instruct ÔÚ´úÂëÌìÉú£¬£¬£¬£¬£¬£¬´úÂë±à¼£¬£¬£¬£¬£¬£¬´úÂëÍÆÀíµÈʹÃüÉÏ×öÁË×ÛºÏÆÀ²â£¬£¬£¬£¬£¬£¬²¢ÓÐ×ÅÓÅÔ½µÄÌåÏÖ¡£¡£¡£¡£¡£¡£¡£¡£ÆäÖÐÔÚ³£ÓõÄʹÃü£¨humaneval£¬£¬£¬£¬£¬£¬mbpp£©ÉÏ´ó·ùÁè¼ÝÔÓÐ AR baseline ºÍÆäËû 8B ×óÓÒµÄ DLLM model¡£¡£¡£¡£¡£¡£¡£¡£ÔÚ²âÊÔ¼¯±ÕÔ´µÄ MHPP µÖ´ï qwen32B µÄˮƽ£¬£¬£¬£¬£¬£¬BigCodeBench ÉϸüÊÇÁè¼ÝһϵÁÐÄ£×Ó²¢½ö´ÎÓÚ DeepSeek236B µÄÄ£×Ó¡£¡£¡£¡£¡£¡£¡£¡£Í¬Ê±ÔÚ´úÂë±à¼ CanItEdit ʹÃüÉϸüÊÇÓÐמªÑÞµÄЧ¹û¡£¡£¡£¡£¡£¡£¡£¡£
![]()
![]()
![]()
![]()
![]()
×ܽáÓëÕ¹Íû
Stable-DiffCoder µÄÐû²¼£¬£¬£¬£¬£¬£¬Í»ÆÆÁË ¡¸À©É¢Ä£×ÓÖ»ÄÜ×ö²¢ÐмÓËÙ¡¹ µÄ¿Ì°åÓ¡Ï󡣡£¡£¡£¡£¡£¡£¡£Ëü֤ʵÎú£ºÀ©É¢ÑµÁ··¶Ê½×Ô¼º¾ÍÊÇÒ»ÖÖ¼«¼ÑµÄ±íÕ÷ѧϰÊֶΡ£¡£¡£¡£¡£¡£¡£¡£Í¨¹ýºÏÀíµÄ¿Î³ÌÉè¼Æ¼°ÎȹÌÐÔÓÅ»¯£¬£¬£¬£¬£¬£¬À©É¢Ä£×ÓÍêÈ«¿ÉÒÔÔÚ´úÂëÃ÷È·ºÍÌìÉúÖÊÁ¿ÉÏÓâÔ½¹Å°åµÄ AR Ä£×Ó¡£¡£¡£¡£¡£¡£¡£¡£
¹ØÓÚδÀ´µÄ´óÄ£×ÓÑݽø£¬£¬£¬£¬£¬£¬Sta28jndÕ¹Íûpcble-DiffCoder ÌáÐÑÁËÒ»Ìõз¾¶£ºÒ²ÐíÎÒÃDz»ÐèÒªÑïÆú AR£¬£¬£¬£¬£¬£¬¶øÊǽ« AR ×÷Ϊ¸ßЧµÄ֪ʶѹËõÆ÷£¬£¬£¬£¬£¬£¬ÔÙʹÓà Diffusion ×÷Ϊ ¡¸Ç¿»¯¼Á¡¹£¬£¬£¬£¬£¬£¬½øÒ»²½ÍƸßÄ£×ÓµÄÖÇÄÜÉÏÏÞ¡£¡£¡£¡£¡£¡£¡£¡£

