• ? µçÄÔ°æ-ÃÃ×Ó΢ÐŶþάÂëͼƬ-ÀîÑÇÅôØòÀëºó·³ÐÄÊÂÒ»¶Ñ£º£º£ºÅ®¶ù´óÁ˹ܲ»ÁË£¬£¬£¬ÐÂÈüµÀ±»Ö¸¿ÓÃÉÓÕÆ­

    ¿ªÔªÓÎÏ·´óÌüapp

    ɽÎ÷ÐÂÎÅÍø

    ×îÐÂAPP

    ÈÈÃÅAPP

    • µ¶ÇÐÂøÍ·£¡£¡£¡¡°´ó¿Ú¾¶ÐÂÐÍ»ðÅڰ桱054AÁÙ·Ú½¢¶õ¶û¶à˹½¢¹ÙÐûÁÁÏà

      ¼ß-35ûÓв൯²ÖÈ´¼ÓËÙÁ¿²ú£¬£¬£¬¾¿¾¹´æ²»±£´æÖØ´óÕ½Êõ¶Ì°å£¿

      ÕâÊÇÒ»¸ö¹ØÓÚ AI µ×²ãÂß¼­Öع¹µÄʱ¿Ì¡£ºã¾ÃÒÔÀ´£¬£¬£¬Transformer ¼Ü¹¹±»À§ÔÚÒ»¸öÌÚ¹óµÄã£ÂÛÖУº£º£ºÎÒÃÇÓÃ×Å×îÏȽøµÄ GPU ËãÁ¦£¬£¬£¬È¥Èà AI Ä£×Ó " ËÀ¼ÇÓ²±³ " ÄÇЩ²é×Öµä¾ÍÄÜÖªµÀµÄ¾²Ì¬ÖªÊ¶¡£DeepSeek ÁºÎÄ·æÍŶÓÓëÆä±±´óÏàÖúÕßÔÚ½ñÈÕÆÆÏþÐû²¼µÄÖØ°õÂÛÎÄ¡¶Conditional Memory via Scalable Lookup¡·£¬£¬£¬³¹µ×Í»ÆÆÁËÕâÒ»½©¾Ö¡£ËûÃÇÌá³öÁËÒ»ÖÖȫеÄEngram£¨Ó¡¼££©Ä£¿é£¬£¬£¬ÔڹŰåµÄ " Ìõ¼þÅÌËã "£¨MoE£©Ö®Í⣬£¬£¬¿ª·¢Á˵ڶþÌõÏ£º±»¯Õ½Ïß¡ª¡ª" Ìõ¼þÓ°Ïó "¡£Õâ²»µ«ÊÇÒ»´ÎÊÖÒÕÐÞ²¹£¬£¬£¬¶øÊÇÒ»³¡¹ØÓÚÄ£×Ó " ÄÔÈÝÁ¿ " µÄ¹©Ó¦²àˢС£Ëü֤ʵÎú£º£º£ºµ±ÎÒÃǽ« " Ó°Ïó " ´Ó " ÅÌËã " ÖаþÀ룬£¬£¬°Ñ¸Ã±³µÄ½»¸ø " ×Öµä "£¬£¬£¬°Ñ¸ÃËãµÄ½»¸ø´óÄÔ£¬£¬£¬AI µÄÍÆÀíÄÜÁ¦½«Ó­À´·´Ö±¾õµÄ±¬·¢Ê½ÔöÌí¡£DeepSeek ÍýÏëÔÚ 2 Ô´º½ÚǰºóÕýʽÐû²¼ V4£¬£¬£¬¶øÕâÒ»¿Ì»òÐí¾ÍÊÇ DeepSeek V4 ½µÉúµÄǰҹ¡£ ÐòÕ£º£º£ºÁù²ãÉñ¾­ÍøÂçµÄ " ÎÞÓù¦ "¹ÊÊÂµÄÆðµã£¬£¬£¬Ô´ÓÚ DeepSeek ÍÅ¶Ó¶Ô Transformer ÄÚ²¿ÔË×÷»úÖÆµÄÒ»´Î " ºË´Å¹²Õñ " ɨÃè¡£ÔÚÈ˹¤ÖÇÄܵĺںÐ×ÓÀ£¬£¬µ±´óÄ£×Ó¿´µ½ "Diana, Princess of Wales"£¨´÷°²ÄÈ£¬£¬£¬Íþ¶ûÊ¿Íõåú£©Õâ¸ö¶ÌÓïʱ£¬£¬£¬ËüµÄÄÚ²¿±¬·¢ÁËÒ»³¡ÁîÈ˷ѽâÇÒ¼«ÆäÌÚ¹óµÄ " ÄÚÚ§ "¡£Ñо¿Ö°Ô±·¢Ã÷£¬£¬£¬ÎªÁËʶ±ðÕâ¸öÀο¿µÄʵÌ壬£¬£¬Ä£×Ó¾¹È»¶¯ÓÃÁËÕûÕû 6 ²ãÍøÂ磺£º£ºµÚ 1-2 ²ã£º£º£ºÄ£×Ó»¹ÔÚ×ÁÄ¥ "Wales" »òÐíÊÇÒ»¸ö¹ú¼Ò£»£»µÚ 3 ²ã£º£º£ºËüÒâʶµ½ÕâÊÇÅ·ÖÞµÄÒ»¸öµØÀí¿´·¨£»£»µÚ 4 ²ã£º£º£ºËü×îÏÈÆ´¼¯³ö "Princess of Wales" ËÆºõÊÇÒ»¸öÍ·ÏΣ»£»µÚ 5 ²ã£º£º£ºËüåÚÏëµ½ÁË " Íþ¶ûÊ¿Ç×ÍõµÄÆÞ×Ó "£»£»µÚ 6 ²ã£º£º£ºÖ±µ½ÕâÀ£¬£¬Ëü²ÅÖÕÓÚÈ·ÈÏ£¬£¬£¬ÕâÊÇÖ¸ÄÇÎ»ÖøÃûµÄ " ´÷°²ÄÈÍõåú "¡£ÔÚһλ׷Çó¼«ÖÂЧÂʵļܹ¹Ê¦ÑÛÖУ¬£¬£¬Õâ¼òÖ±ÊÇËãÁ¦µÄ±©éåÌìÎï¡£" ´÷°²ÄÈÍõåú " ÊÇÒ»¸ö¿Í¹Û±£´æµÄ¡¢¡¢¾²Ì¬µÄʵÌ壬£¬£¬Ëü²»»áÓÉÓÚÉÏÏÂÎĵÄת±ä¶ø¸Ä±äÆäʵÖÊ¡£ÎªÁËÌáÈ¡Õâ¸öÔ­À´²é×Öµä¾ÍÄÜÖªµÀµÄÊÂʵ£¬£¬£¬Transformer ¾¹È»¶¯ÓÃÁËÕûÕû 6 ²ãÉî¶ÈµÄÌÚ¹ó¾ØÕóÔËËãÈ¥ " ÖØÐÞ " Õâ¸ö¿´·¨¡£Õâ¾ÍÏñÊÇÒ»¸ö¾øÊÀÌì²Å£¬£¬£¬ÔÚÈ¥½â¾ö΢»ý·ÖÄÑÌâ֮ǰ£¬£¬£¬Ã¿´Î¶¼µÃÏÈ»¨°ëСʱĬдһ±é¾Å¾Å³Ë·¨±í¡£ ÕâÖÖ " ÒþʽӰÏó " µÄ»úÖÆ£¬£¬£¬ÆÈʹģ×Ó½«Ãû¹óµÄ²ÎÊýÈÝÁ¿ºÍÍøÂçÉî¶È£¬£¬£¬ÆÌÕÅÔÚÁ˼òÆÓµÄģʽƥÅäÉÏ¡£DeepSeek ÔÚÕâÆª³¤´ï 33 Ò³µÄÂÛÎÄÖУ¬£¬£¬Ìá³öÁËÒ»¸öÖ±»÷Áé»êµÄ¿½ÎÊ£º£º£ºÎªÊ²Ã´²»Ö±½Ó¸ø´óÄ£×ÓÅäÒ»±¾¿ÉÒÔËæ²éËæÓÃµÄ " ³¬µÈ×Öµä "£¿ µÚÒ»Õ£º£º£º¼Ü¹¹ÖØËÜ¡ª¡ª Engram Ä£¿éµÄ±©Á¦ÃÀѧΪÏàʶ¾öÕâ¸öÎÊÌ⣬£¬£¬DeepSeek Ìá³öÁËÒ»ÖÖÃûΪ "Engram£¨Ìõ¼þÓ°Ïó£©" µÄÈ«ÐÂÄ£¿é¡£ÈôÊÇ˵ MoE£¨»ìÏýר¼ÒÄ£×Ó£©ÊÇ°Ñ " ´óÄÔ " ·Ö³ÉÁ˲î±ðµÄÇøÓò£¬£¬£¬Èòî±ðµÄר¼ÒÈÏÕæ²î±ðµÄ˼Ë÷£¨Ìõ¼þÅÌË㣩£»£»ÄÇô Engram ¾ÍÊǸø´óÄÔÍâ¹ÒÁËÒ»¸öÖØ´óµÄ " º£ÂíÌå "£¬£¬£¬×¨ÃÅÈÏÕæ´æ´¢¾²Ì¬ÖªÊ¶£¨Ìõ¼þÓ°Ï󣩡£1. ¸´Éú "N-gram"£º£º£º´Ó¹ÅÀÏÖÇ»ÛÖÐѰÕÒÃÕµ×Engram µÄ½¹µãÁé¸Ð£¬£¬£¬¾¹È»À´×ÔÓÚ NLP£¨×ÔÈ»ÓïÑÔ´¦Àí£©ÁìÓòµÄ " ÉϹÅÉñÆ÷ " ¡ª¡ª N-gram¡£ÔÚÉî¶ÈѧϰͳÖÎÌìÏÂ֮ǰ£¬£¬£¬ÎÒÃǾÍÊÇ¿¿Í³¼Æ "N ¸ö´Êͬʱ·ºÆðµÄ¸ÅÂÊ " À´Ã÷È·ÓïÑԵġ£DeepSeek ½«ÕâÒ»¾­µä¿´·¨¾ÙÐÐÁËÏÖ´ú»¯µÄħ¸Ä£º£º£º¹Å°åµÄ Transformer£º£º£ºÖªÊ¶ÊèÉ¢ÔÚÉñ¾­ÔªµÄÈ¨ÖØ£¨Weights£©À£¬£¬Ìáȡ֪ʶÐèÒª¾­ÓÉÖØ´óµÄÏßÐÔ²ãÅÌË㣬£¬£¬ÖØÆ¯ºó¸ß¡£Engram Ä£¿é£º£º£ºËüÊÇÒ»¸öÖØ´óµÄ¡¢¡¢¿ÉÀ©Õ¹µÄǶÈë±í£¨Embedding Table£©¡£µ±Ä£×Ó¶Áµ½ " ÕÅÖÙ¾° " »òÕß " ËÄ´ó·¢Ã÷ " ÕâÖÖÀο¿´îÅ䣨N-gram£©Ê±£¬£¬£¬²»ÐèÒª¶¯ÓôóÄÔÆ¤²ãÈ¥ÍÆÀí£¬£¬£¬Ö±½Óͨ¹ý¹þÏ£Ë÷Òý£¬£¬£¬ÔÚÄÚ´æ±íÖÐ " ²é " ³ö¶ÔÓ¦µÄÏòÁ¿¡£ÕâÒ»Àú³ÌµÄʱ¼äÖØÆ¯ºóÊÇO ( 1 ) ¡ª¡ªÕâÒâζ×ÅÎÞÂÛ֪ʶ¿âÅòÕ͵½¶à´ó£¨ÄÄÅÂÊÇ 1000 ÒÚ²ÎÊý£©£¬£¬£¬²éÕÒËÙÂÊÏÕЩÎȹÌ£¬£¬£¬ÇÒ¼«¿ì¡£2. Èý´óÊÖÒÕ»¤³ÇºÓ¼ÈÈ»²é±íÕâôºÃ£¬£¬£¬ÎªÊ²Ã´ÒÔǰûÈË×ö£¿ÓÉÓÚÓÐÈý¸öÀ¹Â·»¢£º£º£º´æ´¢±¬Õ¨¡¢¡¢¶àÒå´Ê³åÍ»¡¢¡¢²ÎÊý·ÖÅä¡£DeepSeek ¸ø³öÁ˽̿ÆÊé¼¶µÄ½â¾ö·½°¸£º£º£ºA. ´Ê±íѹËõ£º£º£º¼«ÖµÄÈ¥ÖØÌìÏÂÉϵĴÊ×é×éºÏÊÇÌìÎÄÊý×Ö¡£DeepSeek Ê×ÏÈ×öÁËÒ»²½ " ÎÞËðѹËõ "¡£ÔÚ·Ö´ÊÆ÷£¨Tokenizer£©²ãÃæ£¬£¬£¬Ëü½«ÓïÒåÏàͬµ«Ð´·¨²î±ðµÄ´Ê¾ÙÐÐÁ˹éÒ»»¯¡£ÀýÈ磬£¬£¬"Apple"£¨Ê××Öĸ´óд£©ºÍ "apple"£¨Ð¡Ð´£©ÔÚÓïÒåÉÏͨ³£Ö¸Í³Ò»¸ö¹¤¾ß¡£Í¨¹ýÓ³ÉäºÏ²¢£¬£¬£¬ÓÐÓôʱíÖ±½ÓËõСÁË 23%¡£Õâ²»µ«½ÚÔ¼Á˿ռ䣬£¬£¬¸üÈÃ֪ʶµÄÃܶȴó·ùÌáÉý¡£B. ¶àÍ·¹þÏ££º£º£º½â¾ö " ¹þÏ£³åÍ» "²»¿ÉÄܰÑËùÓÐ N-gram ¶¼´æÏÂÀ´¡£Engram ʹÓÃÁË " ¶àÍ·¹þÏ££¨Multi-Head Hashing£©" ÊÖÒÕ¡£Í¨¹ý¶à¸ö¹þÏ£º£º£º¯Êý£¬£¬£¬½«ÎÞÏÞµÄ N-gram Ó³Éäµ½ÓÐÏÞµÄÄÚ´æ²ÛλÖС£ËäÈ»»áÓйþÏ£³åÍ»£¨¼´Á½¸ö²î±ðµÄ´Ê±»Ó³Éäµ½ÁËͳһ¸öλÖã©£¬£¬£¬µ«Í¨¹ý " ¶àÍ· " Éè¼Æ£¬£¬£¬Ä£×Ó¿ÉÒÔ´Ó¶à¸öºòѡЧ¹ûÖÐÆ´¼¯³ö׼ȷµÄÐÅÏ¢£¬£¬£¬¼«´óµØÌá¸ßÁ˳°ôÐÔ¡£C. ÉÏÏÂÎÄÃſأº£º£º¸øÓ°ÏóÅä¸ö " ²ÃÅÐ "ÕâÊÇ×ÃîµÄÒ»±Ê¡£²é±íÊÇËÀµÄ£¬£¬£¬ÓïÑÔÊÇ»îµÄ¡£ºÃ±È " Æ»¹û " Õâ¸ö´Ê¡£ÔÚ " ³ÔÆ»¹û " µÄÓᄈϣ¬£¬£¬Ëüָˮ¹û£»£»ÔÚ " Æ»¹ûÐû²¼»á " µÄÓᄈϣ¬£¬£¬ËüÖ¸¿Æ¼¼¹«Ë¾¡£Ö±½Ó²é±í¿ÉÄÜ»áÒýÈëÔëÉù¡£DeepSeek Éè¼ÆÁËÒ»¸ö " ÉÏÏÂÎĸÐÖªÃÅ¿Ø "£¨Context-aware Gating£©¡£Query£¨ÅÌÎÊ£©£º£º£ºÄ¿½ñÉÏÏÂÎĵÄÒþ²Ø×´Ì¬£¨Hidden State£©¡£Key/Value£¨¼üÖµ£©£º£º£º²é±í»ñµÃµÄ¾²Ì¬ÏòÁ¿¡£Õâ¸öÃſؾÍÏñÒ»¸ö²ÃÅС£ÈôÊDzé³öÀ´µÄ " ¾²Ì¬ÖªÊ¶ " ºÍÄ¿½ñµÄ " ÉÏÏÂÎÄ " ²»´î£¬£¬£¬²ÃÅоͻá°ÑÈ¨ÖØÑ¹µÍ£¨Gate ÖµÇ÷Ïò 0£©£¬£¬£¬ÈÃÄ£×ÓºöÂÔÕâ¸öÔëÉù£»£»ÈôÊÇÍêÉÆÆõºÏ£¨ºÃ±È " É˺®ÔÓ²¡ÂÛ " ºóËæ×Å " ÕÅÖÙ¾° "£©£¬£¬£¬²ÃÅоͻá°Ñ´óÃÅ·­¿ª£¨Gate ÖµÇ÷Ïò 1£©£¬£¬£¬Ö±½Ó°Ñ֪ʶעÈëÄ£×Ó¡£ µÚ¶þÕ£º£º£º»Æ½ð±ÈÀý¡ª¡ª·¢Ã÷ AI Ä£× "U ÐÍÇúÏß "¼Ü¹¹Éè¼ÆºÃÁË£¬£¬£¬½ÓÏÂÀ´µÄÎÊÌâÊÇ£º£º£ºÔõô·Ö¾Ó²ú£¿¼ÙÉèÎÒÃÇÏÔ¿¨ÀïµÄÏÔ´æÊÇÓÐÏ޵쬣¬£¬×ܲÎÊýÔ¤ËãÒ²ÊÇÀο¿µÄ¡£ÎÒÃÇÓ¦¸Ã°Ñ¼¸¶à²ÎÊý·ÖÅ䏸 MoE µÄ " ר¼Ò "£¨ÈÏÕæÅÌË㣩£¬£¬£¬¼¸¶à²ÎÊý·ÖÅ䏸 Engram µÄ " ×Öµä "£¨ÈÏÕæÓ°Ï󣩣¿ÕâÊÇÒ»¸öµä·¶µÄ×ÊÔ´ÉèÖò©ÞÄ¡£DeepSeek ÍŶӾÙÐÐÁËÒ»³¡´ó¹æÄ£µÄÏûÈÚʵÑ飬£¬£¬É¨ÃèÁË´Ó 0% µ½ 100% µÄ·ÖÅä±ÈÀý£¬£¬£¬Ð§¹û»­³öÁËÒ»ÌõÍêÉÆµÄ "U ÐÍ Scaling Law ÇúÏß "¡£ÕâÕÅͼչÏÖÁË AI Ä£×ÓÉè¼ÆµÄµ×²ã¼ÍÂÉ£º£º£º×ó²à¼«¶Ë£¨´¿ Engram£©£º£º£ºÈôÊǰѲÎÊýÈ«¸ø×ֵ䣬£¬£¬Loss ºÜ¸ß¡£ÓÉÓÚÄ£×ÓÄð³ÉÁË " Êé°×³Õ "£¬£¬£¬¹âÓÐËÀ¼ÇÓ²±³£¬£¬£¬Ã»ÓÐÂß¼­ÍÆÀíÄÜÁ¦¡£ÓҲ༫¶Ë£¨´¿ MoE£©£º£º£ºÈôÊǰѲÎÊýÈ«¸ø×¨¼Ò£¬£¬£¬Loss Ò²ºÜ¸ß¡£ÓÉÓÚר¼ÒÃDZ»ÆÈ°Ñ¾«Éñ¶¼»¨ÔÚ±³Ê飨ӰÏó¾²Ì¬ÖªÊ¶£©ÉÏ£¬£¬£¬Ã»¿Õ¸ÉÕýÊ¡£»£»Æ½ðÖ§½âµã£¨¦Ñ ¡Ö 75%-80%£©£º£º£ºµ±ÎÒÃǽ«Ô¼20%-25% µÄÏ£º±²ÎÊýÔ¤Ëã·Ö¸ø Engram£¬£¬£¬Ê£Ïµĸø MoE ʱ£¬£¬£¬Ä£×ÓµÄÑéÖ¤¼¯ Loss ½µµ½ÁË×îµÍµã¡£ÕâÊÇÒ»¸ö¼«¾ßÖ¸µ¼ÒâÒåµÄ·¢Ã÷£º£º£º¹ØÓÚ¼¸°ÙÒÚ²ÎÊýµÄ´óÄ£×ÓÀ´Ëµ£¬£¬£¬´¿´â¶ÑÆöÅÌË㵥루MoE ר¼Ò£©ÒѾ­ÊDZ߼ÊЧӦµÝ¼õÁË£¬£¬£¬±ØÐèÒýÈëרÃŵľ²Ì¬Ó°ÏóÄ£¿éÀ´ÊµÏÖ " ´æËãÆ½ºâ "¡£ µÚÈýÕ£º£º£º·´Ö±¾õµÄ±¬·¢¡ª¡ªÎªÊ²Ã´ " ²é×Öµä " ÄÜÌá¸ß " ÊýѧЧ¹û "£¿ÈôÊÇ Engram ½ö½öÊÇÈÃÄ£×Ó " ¼ÇÐÔ¸üºÃ "£¬£¬£¬ÕâÆªÂÛÎĵķÖÁ¿»¹È±·¦ÒÔÕð¾ªÉçÇø¡£ÊÂʵ£¬£¬£¬RAG£¨¼ìË÷ÔöÇ¿ÌìÉú£©Ò²Äܽâ¾ö֪ʶÎÊÌâ¡£ÕæÕýÈÃÒµ½ç¸ÐÓ¦Õ𺳵Ä£¬£¬£¬ÊÇʵÑéЧ¹ûÖÐÄÇЩÒâÁÏÖ®ÍâµÄÊÕÒæ¡£DeepSeek ¹¹½¨ÁËÈý¸ö±ÈÕÕÄ£×Ó£¬£¬£¬ÑÏ¿á¿ØÖÆ¼¤»î²ÎÊýÄ¿£¨3.8B£©ºÍѵÁ·Êý¾ÝÁ¿£¨262B tokens£©ÍêȫһÖ£º£º£ºDense-4B£º£º£º¹Å°åµÄŨÃÜÄ£×Ó¡£MoE-27B£º£º£º´¿ MoE Ä£×Ó£¨72 ¸öר¼Ò£©¡£Engram-27B£º£º£º»ìÏýÄ£×Ó£¨55 ¸öר¼Ò + 5.7B Engram ²ÎÊý£©¡£Ð§¹ûÁîÈË´óµøÑÛ¾µ£º£º£º1. ÒâÁÏÖ®ÖУº£º£ºÖªÊ¶ÀàʹÃü°Ô°ñÔÚ MMLU£¨×ÛºÏ֪ʶ£©ÉÏ£¬£¬£¬Engram Ä£×ÓÌáÉýÁË3.4 ·Ö£»£»ÔÚ CMMLU£¨ÖÐÎÄ֪ʶ£©ÉÏ£¬£¬£¬ÌáÉýÁË4.0 ·Ö¡£ÕâºÜºÃÃ÷È·£¬£¬£¬Íâ¹ÒÁË×ֵ䣬£¬£¬ÖªÊ¶×ÔÈ»¸üºÃÁË£¬£¬£¬»Ã¾õ¸üÉÙÁË¡£2. ÒâÁÏÖ®Í⣺£º£ºÂß¼­¡¢¡¢´úÂë¡¢¡¢ÊýѧÖÜÈ«±©Õǰ´Àí˵£¬£¬£¬" ²é×Öµä " ºÍ " ×öÊýѧÌâ " û¹ØÏµ¡£µ«ÔÚ BBH£¨×ÛºÏÍÆÀí£©ÉÏ£¬£¬£¬Engram-27B ¾¹È»±Èͬ²ÎÊýµÄ´¿ MoE »ùÏßÌáÉýÁËÕûÕû5.0 ·Ö£¡£¡£¡MATH£¨Êýѧ£©£º£º£ºÌáÉý2.4 ·Ö¡£HumanEval£¨´úÂëÌìÉú£©£º£º£ºÌáÉý3.0 ·Ö¡£ARC-Challenge£¨ÖØ´óÍÆÀí£©£º£º£ºÌáÉý3.7 ·Ö¡£3. Éî¶ÈÆÊÎö£º£º£ºÓÐÓÃÉî¶È£¨Effective Depth£©ÀíÂÛΪʲô£¿Ò»¸ö " ËÀ¼ÇÓ²±³ " µÄÄ£¿é£¬£¬£¬ÎªÊ²Ã´ÄÜÌá¸ßÖÇÉÌ£¿DeepSeek ÍŶÓʹÓÃLogitLensºÍ "CKA£¨ÖÐÐÄºË¶ÔÆë£©" ÊÖÒÕ£¬£¬£¬¶ÔÄ£×ÓÄÚ²¿¾ÙÐÐÁË " ÆÊ½â "¡£ËûÃÇ·¢Ã÷ÁËÒ»¸ö¾ªÈ˵ÄÕ÷Ï󣺣º£º»¹¼ÇµÃ¿ªÍ·µÄ " ´÷°²ÄÈÍõåú " Âð£¿ÔÚ´¿ MoE Ä£×ÓÖУ¬£¬£¬Ç°¼¸²ãÍøÂç¶¼ÔÚæ×Å " Æ´¼¯¿´·¨ "¡£¶øÔÚ Engram Ä£×ÓÖУ¬£¬£¬ÓÉÓÚµÚ 2 ²ã¾Í²åÈëÁË Engram Ä£¿é£¬£¬£¬¾²Ì¬ÖªÊ¶µÄ¼ìË÷ÔÚ¼«ÔçµÄ½×¶Î¾ÍÍê³ÉÁË¡£ÕâÒâζ×Å£¬£¬£¬Ô­±¾ÓÃÓÚ " ËÀ¼ÇÓ²±³ " µÄǰ¼¸²ãÍøÂç±»½â·ÅÁË£¡£¡£¡ÕâÏ൱ÓÚ¸øÄ£×Ó " ÐéÔö " ÁËÉî¶È¡£ ÄÇЩ±»ÊͷųöÀ´µÄÍøÂç²ãºÍ×¢ÖØÁ¦Í·£¨Attention Heads£©£¬£¬£¬²»ÔÙÐèÒª´¦ÀíààËյľֲ¿ÒÀÀµ£¨ºÃ±Èʶ±ð " ÕÅÖÙ¾° " ÊÇË­£©£¬£¬£¬´Ó¶ø¿ÉÒÔÈ«Éñ¹á×¢µØÍ¶Èëµ½¸üÖØ´óµÄÈ«¾ÖÍÆÀí¡¢¡¢³¤³ÌÂß¼­¹¹½¨ºÍ´úÂëÂß¼­ÌìÉúÖÐÈ¥¡£Engram µÄʵÖÊ£¬£¬£¬²»ÊÇ " Ìæ»» " ÍÆÀí£¬£¬£¬¶øÊÇͨ¹ý " ·ÖÁ÷ " Ôӻ£¬£¬ÈôóÄÔרעÓÚ¸ü¸ßά¶ÈµÄ˼Ë÷¡£ µÚËÄÕ£º£º£º¹¤³ÌÆæ¼£¡£¡£¡ª¡ªÍ»ÆÆÓ¢Î°´ïµÄ " ÏÔ´æ°ÔȨ "¹ØÓÚ»ª¶û½ÖµÄͶ×ÊÕߺÍËãÁ¦ÖÐÐĵÄÔËάÕßÀ´Ëµ£¬£¬£¬ÕâÆªÂÛÎÄ×îÐԸеĵط½²»ÔÚÓÚ Score£¬£¬£¬¶øÔÚÓÚCost£¨±¾Ç®£©¡£ÔÚ AI ʱ´ú£¬£¬£¬×îÌÚ¹óµÄ×ÊÔ´²»ÊÇËãÁ¦£¨FLOPs£©£¬£¬£¬¶øÊÇÏԴ棨HBM£©¡£Ó¢Î°´ï H100 Ö®ÒÔÊǹ󣬣¬£¬ºÜºéÁ÷ƽÉÏÊÇÓÉÓÚÄÇϡȱµÄ HBM3e ÄÚ´æ¡£¶ø Engram ´øÀ´ÁËÒ»¸öÇ㸲ÐÔµÄÌØÕ÷£º£º£º³¹µ×µÄ´æËãÊèÉ¢¡£1. MoE µÄÍ´µã£º£º£ºÏÔ´æÍÌÊÉÕ߹ŰåµÄ MoE Ä£×Ó£¬£¬£¬Æä·ÓÉ»úÖÆ£¨Routing£©ÊǶ¯Ì¬µÄ¡£Ä£×Ó±ØÐèÏÈËã³öÄ¿½ñ Token µÄÌØÕ÷£¬£¬£¬ËãÍêÕâÒ»²ã£¬£¬£¬²ÅÖªµÀÏÂÒ»²ã¸ÃÕÒÄĸöר¼Ò¡£ÕâÒâζ×Å£¬£¬£¬ËùÓеÄר¼ÒÄ£×Ó±ØÐèʱ¿ÌÔÚÌÚ¹óµÄ GPU ÏÔ´æÀï´ýÃü£¬£¬£¬Ëæ½ÐËæµ½¡£2. Engram µÄÍ»ÆÆ£º£º£ºÈ·¶¨µÄÔ¤ÖªEngram µÄ²é±íÂß¼­ÊÇÈ·¶¨ÐԵġ£Ö»ÒªÊäÈëµÄÎı¾È·¶¨ÁË£¨ºÃ±È "A New Axis of Sparsity"£©£¬£¬£¬ÄÇôËü¶ÔÓ¦µÄ N-gram Ë÷Òý¾ÍÈ·¶¨ÁË¡£ÎÒÃÇ»ù´¡²»ÐèÒªµÈÄ£×ÓËãÍêǰһ²ã£¬£¬£¬ÔÚ Token ½øÈëÄ£×ÓµÄÄÇһ˲¼ä£¬£¬£¬ÎÒÃǾÍÖªµÀËüÐèÒª²éÄÄÕűíµÄÄÄÒ»ÐС£3. CPU µÄÄæÏ®£º£º£º°Ñ´óÄ£×ÓÈû½øÄÚ´æÌõÕâÒ»ÌØÕ÷´øÀ´ÁËÖØ´óµÄ¹¤³ÌÓ¯Àû£º£º£ºÐ¶ÔØ£¨Offload£©£º£º£ºÎÒÃÇ¿ÉÒ԰Ѽ¸°ÙÒÚ¡¢¡¢ÉõÖÁÉÏǧÒÚ²ÎÊýµÄ Engram ´Ê±í£¬£¬£¬Ö±½ÓÈÓµ½×ÔÖÆ¡¢¡¢Á¿´ó¡¢¡¢Ò×À©Õ¹µÄ "CPU Äڴ棨DRAM£©" À£¬£¬ÉõÖÁ·ÅÔÚ NVMe SSD ÉÏ¡£Ô¤È¡£¨Prefetching£©£º£º£ºÔÚ GPU Æ´ÃüÅÌËãǰһ²ã Transformer µÄʱ¼ä£¬£¬£¬CPU ʹÓà PCIe ͨµÀ£¬£¬£¬Òì²½µØ°ÑÏÂÒ»²ãÐèÒªµÄÓ°ÏóÊý¾Ý " Ԥȡ " ³öÀ´£¬£¬£¬ÍÆË͵½ GPU¡£ÑÚÊÎÑÓ³Ù£¬£¬£¬²¢Ðд¦Àí¡£DeepSeek ʵ²âÊý¾ÝÏÔʾ£º£º£º×ÝÈ»¹ÒÔØÁË100B£¨Ç§ÒÚ£©²ÎÊýµÄ Engram ±íµ½ CPU Äڴ棬£¬£¬Ïà±ÈÓÚ´¿ GPU ÍÆÀí£¬£¬£¬ÍÌÍÂÁ¿µÄϽµ²»µ½ 3%¡£ÕâÊÇÒ»¸öÈÃËùÓÐÓÉÓÚÂò²»µ½ HBM ¶ø½¹ÂǵÄÈË¿ñϲµÄ½áÂÛ¡£ÕâÒâζ×Å£¬£¬£¬Î´À´µÄ´óÄ£×Ó£¬£¬£¬" Ó°ÏóÈÝÁ¿ " ¿ÉÒԵͳÉÍâµØÎÞÏÞÀ©ÕÅ£¬£¬£¬¶ø²»±Ø±»Ó¢Î°´ïµÄÏԴ濨²±×Ó¡£ µÚÎåÕ£º£º£º³¤Îı¾µÄʤÀû¡ª¡ª NIAH ²âÊÔµÄÔ¾Éý³ýÁËͨÓÃÍÆÀí£¬£¬£¬Engram ÔÚ³¤Îı¾£¨Long Context£©ÁìÓòµÄÌåÏÖͬÑù֤ʵÎú " ·Ö¹¤ " µÄ¼ÛÖµ¡£ÔÚ³¤Îı¾´¦ÀíÖУ¬£¬£¬×¢ÖØÁ¦»úÖÆ£¨Attention£©µÄ´°¿ÚÊÇÓÐÏ޵ġ£ÈôÊÇ×¢ÖØÁ¦±»´ó×ڵľֲ¿ÐÅÏ¢£¨ÈçÀο¿¶Ì

      ÏÂÔØ

    • ÁãÊÛÐÂÇ÷ÊÆ£¬£¬£¬²ØÔÚ¡°ºÐÇø·¿¡±Àï

      ×ÅÃû¿ç¹úÒ©ÆóÈñ¿µµÏÍ˳öÖйúÊг¡

      ÈÕ±¾¼±Ñ°Ï¡ÍÁÌæ»»£¬£¬£¬ÄÑÔÚÄĶù£¿

      ÏÂÔØ

    • ÀÏÏ缦£¬£¬£¬×ż±·É³ö°²»Õ
    • ¡°°ÑÊÖ´Ó¸ñÁêÀ¼µºÉÏÄÿª¡±£º£º£ºµ¤Âó¶à¸ö¶¼»á½«¾ÙÐз´ÃÀʾÍþ

      ãÆÑ§¾§µÄÖÂǸÐÅдµÃºÜºÃ£¬£¬£¬µ«±£´æÒ»¸öÖÂÃüÎÊÌâ

      пîÈÕ²úZÁÁÏàÐÂÔöÊÖ¶¯±äËÙÏäǰÁ³Î¢µ÷

      ÏÂÔØ

    • »ªÎª¸ß¼¶ÕÕÁÏÌïÌΣº£º£ºÆóÒµ³ÉÊìµÄ±ê¼Ç¡ª¡ª°ÑÊ×´´È˹ؽøÖƶȵÄÁý×Ó

      Ã×¹þÓÎͶ×ʵĿ¨Åƹ«Ë¾ÒªÉÏÊÐÁË

      È«ÇòÊ׿îÁ¿²ú¹Ì̬µç³ØÒѽµÉúÔÚ·ÒÀ¼£¬£¬£¬ÖйúÆóҵΪºÎ²»Å£¿

      ÏÂÔØ

    • ¡¶»ªÎªÄêÖÕ½±·ÖÅä²½·¥¡·£º£º£ºÇ¿µ÷Т˳ÀûÈ󣬣¬£¬¿ÉÏòδÀ´½è½±½ð

      ²ÌÒÀÁÖ2ÒÚÑݳª»á¿÷Ëð³¬7ÍòÍò£¬£¬£¬²¿·ÖÍøÓÑÒÔΪÄÚÈÝ»ÄÌÆ½«Æä¾Ù±¨

      ¶àÖ»»ù½ðÐû²¼ÏÞ¹º£¬£¬£¬ÒµÄÚÈËÊ¿£º£º£ºÖ÷ÒªÊÇΪÁË¿ØÖÆ»ù½ð¹æÄ£

      ÏÂÔØ

    • ¶àÖ»»ù½ðÐû²¼ÏÞ¹º£¬£¬£¬ÒµÄÚÈËÊ¿£º£º£ºÖ÷ÒªÊÇΪÁË¿ØÖÆ»ù½ð¹æÄ£

      пîÈÕ²úZÁÁÏàÐÂÔöÊÖ¶¯±äËÙÏäǰÁ³Î¢µ÷

      ÖÇÆ×VSMiniMax£º£º£º¹ú²ú´óÄ£×ÓË«ÐÛÉÏÊмÇ

      ÏÂÔØ

    • ¾Í²îËûÁË£¡£¡£¡°ÔÁè¡¢¡¢¡°ÓÌÌ«Öí¡±¡¢¡¢ÄÉ´âÀñ¡­¡­µÂ¹ú½¾üµÚ26É¡±øÍű»ÆØ³óÎÅ

      »¢Ðá[×÷¡¤ÐáÖ®ÐÇ]ÖܰñµÚ295¡«296ÆÚ

      ´æ´¢Ð¾Æ¬»ò½«ÕǼÛ50%£¬£¬£¬ÊÖ»ú³§É̽ôÆÈ¼õ²úǧԪ»ú

      ÏÂÔØ

    • ίÄÚÈðÀ­À×´ï±øÐÎòµÄ¿Ö²À³¡¾°£¬£¬£¬Í»ÏÔ¼ß-16DµÄÕ½ÂÔÕ½Êõ¼ÛÖµ

      ±ÈÑǵÏÐÂÆ·ÅÆ¡°Áì»ã¡±À´ÁË£¬£¬£¬4¿î³µÐÍÆØ¹â£¡£¡£¡ÖªÇéÈËÊ¿£º£º£º×¨¹©´óÅúÁ¿²É¹ºÐèÇó

      ÉϺ££º£º£ºÖ§³Ö´æÁ¿Ḛ́ìÂ¥ÓîË¢ÐÂÎªÑøÀÏÉèÊ©£¬£¬£¬ÃãÀø¸÷ÇøÆ¾Ö¤´²Î»ÊýÄ¿¸øÓèÒ»´ÎÐÔ½òÌù

      ÏÂÔØ

    • ÒÁÀÊÊ±ÊÆÍ»È»·´×ª£¬£¬£¬ÌØÀÊÆÕÓÖÐÄÉúÒ»¼Æ

      ËÙ¿´£¡£¡£¡»ÆÈÊÑ«CES2026Ñݽ²Íò×Öʵ¼£º£º£ºË¦³ö¡°ÎïÀíAI¡±ÍõÅÆ

      »¨Ò»¸öÔ¡°¿çÄꡱµÄÄêÇáÈË£¬£¬£¬ÊÂʵÔÚ¡°¿ç¡±Ê²Ã´

      ÏÂÔØ

    • ¹ú·À¿Æ¼¼¹¤Òµ¾Ö£º£º£º¼ß10CEÊ×´ÎÈ¡µÃʵսս¹û£¬£¬£¬¿ÕÕ½Öл÷Âä¶à¼ÜÕ½»ú£¬£¬£¬×Ô¼ºÎÞÒ»Ëðʧ

      CES2026¼ûÖ¤ÐÂÒ»´úAIÅãͬ»úеÈËÂ仧ͨË×¼ÒÍ¥

      ÌØÀÊÆÕ7Äêǰ´³´ó»ö£¡£¡£¡¶íÂÞ˹¡°½û¼Éµ¼µ¯¡±Õ¨´©ÎÚ¿ËÀ¼£¬£¬£¬±±Ô¼±»´òãÂ

      ÏÂÔØ

    • ȸ³²±©À×£¬£¬£¬ÐÅÈÎÒÑËÀ

      ÉîÛÚÒ»½ÖµÀÍø¸ñÔ±ÉîÒ¹11µãÈë»§¼ì²éÏû·À£¬£¬£¬Ôâ¾ÜºóÈÔÈëÄÚÕÕÏàÒýͶËߣ¬£¬£¬½ÖµÀ»ØÓ¦

      ÓÅÏÈÔ®ÎÚ£¡£¡£¡Ó¢¹úʱ¸ô60ÄêÔÙ´ÎÑÐÖÆµ¯µÀµ¼µ¯£¬£¬£¬0.2¶Öµ¯Í·500ǧÃ×Éä³Ì

      ÏÂÔØ

    • ³É·É¹ÙÐû¼ß10CEÕ½¼¨ºó£¬£¬£¬Ó¢¹úÖÇ¿âÓÖ¡°²¹µ¶¡±£¬£¬£¬°ÑÓ¡¶ÈÈËÆø»µÁË

      Shopify¹É¼ÛÒòAI¹ºÎï¿´·¨´óÕÇ£¬£¬£¬ÏÖÔÚÕýЯÊÖGeminiÓëCopilot¼Ó´óͶÈë

      º«¹ú¡°179ËÀ¿ÕÄÑ¡±75ÃëºÚÏ»×Ó¼ÒôÊ×´ÎÐû²¼£»£»ÊÓ²ì³ÆÈôÎÞΧǽ¿ÉȫԱÉú»¹

      ÏÂÔØ

    • AloÏë×ölululemom£¬£¬£¬µ«¸üÏë×ömiumiu
    • AloÏë×ölululemom£¬£¬£¬µ«¸üÏë×ömiumiu

      ¾Ó¼Ò½¡ÉíÆ·ÅÆ¡¸ãåСÂí¡¹Íê³É½üÍòÍòÔªPre-AÂÖÈÚ×Ê

      Õ½¶·µµ°¸£º£º£º½É»ñµÂ¾üÎļþÖеÄ̹¿ËÇ鱨

      ÏÂÔØ

    • 2ÂÖ4·Ö£¡£¡£¡ÑÇÖÞ¹Ú¾ü±ÈÖйú¶Ó»¹Î£ÏÕ£º£º£º´òƽº«¹ú=¿ÉÄܳö¾Ö3¶Óͬ»ý5·Ö

      Ó¢¹úͨѶÖÎÀí¾Ö¶ÔXƽ̨Õö¿ªÊÓ²ì

      Ó¢¾üºäÕ¨»ú¿ÕÇÚ±»Òâ¾ü·ý²£¬£¬£¬×ªÔËʱ¾¹ÀÖ³ÉÐ®ÖÆ³ðÈË·É»ú£¬£¬£¬»Ø»ùµØ²îµã±»¡°Åç»ð¡±»÷Âä

      ÏÂÔØ

    • Ó¢¾üºäÕ¨»ú¿ÕÇÚ±»Òâ¾ü·ý²£¬£¬£¬×ªÔËʱ¾¹ÀÖ³ÉÐ®ÖÆ³ðÈË·É»ú£¬£¬£¬»Ø»ùµØ²îµã±»¡°Åç»ð¡±»÷Âä
    • ¶Ô²»Æð³ÂÐÇÐñ£¬£¬£¬Õâ´Î±»36Ëê´úÐñÃÔµ¹ÁË

      Ã÷µÀÉϺ£×â·¿£¬£¬£¬Âò³¬¹óµÄ¼Ò¾ßÈ´Éá²»µÃ×⳵룬£¬£¬×ÔÐгµ·ÅÎÝÀïºÃÂÒ

      ÐÁ°ØÇà±»ÆØÏÖÉí2026ÑëÊÓ´ºÍí²ÊÅÅ£¬£¬£¬¾àÀëÖìæÂæÂÈ¥ÊÀÒÑ8¸öÔÂ

      ÏÂÔØ

    • ²ÌÒÀÁÖ2ÒÚÑݳª»á¿÷Ëð³¬7ÍòÍò£¬£¬£¬²¿·ÖÍøÓÑÒÔΪÄÚÈÝ»ÄÌÆ½«Æä¾Ù±¨
    • 2ÂÖ4·Ö£¡£¡£¡ÑÇÖÞ¹Ú¾ü±ÈÖйú¶Ó»¹Î£ÏÕ£º£º£º´òƽº«¹ú=¿ÉÄܳö¾Ö3¶Óͬ»ý5·Ö

      ÖÇÆ×VSMiniMax£º£º£º¹ú²ú´óÄ£×ÓË«ÐÛÉÏÊмÇ

      Åí½£·æ£º£º£º2026£¡£¡£¡Ã¿Ò»¸öÈ˶¼ÊÇÒ»³¡Á¿×Ó¸ïÃü

      ÏÂÔØ

    ±êÇ©Áбí

    ×îÐÂÁôÑÔ

    ÈÈÃÅÊÖÓÎ

    ×ܽáÈ«Íø738ƪЧ¹û

    ÒåÎÚ±´´åСÏï×ÓÍíÉÏ¿ª·ÅÂð

    • Öֱ𣺣º£º ÉúÑÄ·þÎñ
    • ´óС£º£º£º 651.369MB
    • ϵͳ£º£º£º Android
    • ¸üУº£º£º 2026-01-14 00:32
    • ÈËÆø£º£º£º 30492
    • ̸ÂÛ£º£º£º 3547
    °²×¿ÏÂÔØ

    Ó¦ÓýéÉÜ

    • ÒåÎÚ±´´åСÏï×ÓÍíÉÏ¿ª·ÅÂð
    • ÒåÎÚ±´´åСÏï×ÓÍíÉÏ¿ª·ÅÂð
    • ÒåÎÚ±´´åСÏï×ÓÍíÉÏ¿ª·ÅÂð
    °Ù¶È°ü¹Ü£¬£¬£¬ÎªÄúËÑË÷»¤º½wAAAABJRU5ErkJggg==

    ×î¼Ñ»Ø¸²

    1¡¢¡¢ÒåÎÚ±´´åСÏï×ÓÍíÉÏ¿ª·ÅÂð?¡ª¡ªTG:Ôݲ»¹ûÕæ¡ª¡ª??????????????????????????

    2¡¢¡¢QQɨÂëͬ³Ç·þÎñ?¡ª¡ªTG:Ôݲ»¹ûÕæ¡ª¡ª?????????????????????????

    3¡¢¡¢?ÖØ°õÐÂÎÅÀ´Ï®£¡£¡£¡??ÒåÎÚ±´´åСÏï×ÓÍíÉÏ¿ª·ÅÂð-APPÏÂÔØ?Ö§³Ö:winall/win7/win10/win11?ϵͳÀàÐÍ?:ÒåÎÚ±´´åСÏï×ÓÍíÉÏ¿ª·ÅÂð(2025ȫվ)×îа汾IOS/°²×¿¹Ù·½Èë¿ÚN.19.90.78(Ç徲ƽ̨)

    4¡¢¡¢?¶À¼Ò£¡£¡£¡???ÒåÎÚ±´´åСÏï×ÓÍíÉÏ¿ª·ÅÂð-APPÏÂÔØ?Ö§³Ö:winall/win7/win10/win11?ϵͳÀàÐÍ?:ÒåÎÚ±´´åСÏï×ÓÍíÉÏ¿ª·ÅÂð(2025ȫվ)×îа汾IOS/°²×¿¹Ù·½Èë¿ÚN.7.71.23(Ç徲ƽ̨)

    u=131048712,165454654&fm=30&app=106&f=JPEG?w=312&h=208&s=B8826397500272E84C385C640300E070 u=3941562296,165454567&fm=30&app=106&f=JPEG?w=312&h=208&s=75BBAD771F20772ECEE5F144030060B1 u=877781737,165125920&fm=30&app=106&f=JPEG?w=312&h=208&s=C0D27A85D64119551181E28A03003097

    Ö©Öë³ØÖеÄ302Ìø×ªÊ¹Óù淶

    ×÷Ϊһ¸öרҵµÄSEOÐÐÒµÕ¾³¤£¬£¬£¬Ïàʶ²¢ÕÆÎÕÖ©Öë³Ø³ÌÐòµÄÔ­ÀíºÍÓÃ;ÊǺÜÊÇÖ÷ÒªµÄ¡£Ö©Öë³ØÊÇÒ»ÖÖÓÃÓÚÄ£ÄâËÑË÷ÒýÇæÖ©Ö루spider£©ÅÀÈ¡ÍøÒ³µÄ¹¤¾ß£¬£¬£¬Ëü¿ÉÒÔÄ£Äâ¶à¸öÖ©Öëͬʱ·ÃÎÊÍøÕ¾£¬£¬£¬²¢ÍøÂçÍøÕ¾ÉϵÄÐÅÏ¢¡£ÔÚSEOÓÅ»¯µÈÁìÓò£¬£¬£¬Ö©Öë³Ø³ÌÐò¿ÉÒÔ×ÊÖúÕ¾³¤¸üºÃµØÏàʶËÑË÷ÒýÇæ¶ÔÍøÕ¾µÄ»á¼ûÇéÐΣ¬£¬£¬´Ó¶ø×ö³öÏìÓ¦µÄÓÅ»¯¡£

    Ö©Öë³Ø³ÌÐòµÄÔ­Àí

    Ö©Öë³Ø³ÌÐòµÄÔ­ÀíÖ÷ÒªÊÇͨ¹ýÄ£Äâ¶à¸öÖ©Öëͬʱ·ÃÎÊÍøÕ¾£¬£¬£¬ÍøÂçÍøÕ¾ÉϵÄÐÅÏ¢¡£ÔÚÏÖʵ²Ù×÷ÖУ¬£¬£¬Õ¾³¤¿ÉÒÔÉèÖÃÖ©Öë³Ø³ÌÐòÄ£Äâ²î±ðËÑË÷ÒýÇæµÄÖ©Ö룬£¬£¬ºÃ±ÈGoogle¡¢¡¢BingµÈ£¬£¬£¬ÒÔ´ËÀ´Ïàʶ²î±ðËÑË÷ÒýÇæ¶ÔÍøÕ¾µÄ»á¼ûÇé¿ö¡£Í¨¹ýÖ©Öë³Ø³ÌÐòÍøÂçµ½µÄÊý¾Ý£¬£¬£¬Õ¾³¤¿ÉÒÔÆÊÎöÍøÕ¾ÔÚËÑË÷ÒýÇæÖеÄÅÅÃûÇéÐΡ¢¡¢ÍøÒ³±»Ë÷ÒýµÄÇéÐεÈ£¬£¬£¬´Ó¶ø¸üºÃµØ¾ÙÐÐSEOÓÅ»¯¡£

    Ö©Öë³Ø³ÌÐòµÄÓÃ;

    Ö©Öë³Ø³ÌÐòÔÚSEOÓÅ»¯ÖÐÓÐ×ÅÆÕ±éµÄÓÃ;¡£Ê×ÏÈ£¬£¬£¬Í¨¹ýÖ©Öë³Ø³ÌÐò¿ÉÒÔÊÓ²ìËÑË÷ÒýÇæÖ©Öë¶ÔÍøÕ¾µÄ»á¼ûÇéÐΣ¬£¬£¬****ÏÖÍøÕ¾±»ÆÁ±Î»ò±»½µÈ¨µÄÇéÐΡ£Æä´Î£¬£¬£¬Ö©Öë³Ø³ÌÐò¿ÉÒÔ¼à¿ØÍøÕ¾µÄË÷ÒýÇéÐΣ¬£¬£¬****ÏÖÄÄЩÒ³ÃæÎ´±»Ë÷Òý»ò±»ÒÅ©¡£×îºó£¬£¬£¬Ö©Öë³Ø³ÌÐò»¹¿ÉÒÔ¸ú×ÙÍøÕ¾Òªº¦´ÊµÄÅÅÃûÇéÐΣ¬£¬£¬ÊµÊ±µ÷ÕûÓÅ»¯Õ½ÂÔ¡£

    ×îºó

    ÕâÊÇÒ»¸ö¹ØÓÚ AI µ×²ãÂß¼­Öع¹µÄʱ¿Ì¡£ºã¾ÃÒÔÀ´£¬£¬£¬Transformer ¼Ü¹¹±»À§ÔÚÒ»¸öÌÚ¹óµÄã£ÂÛÖУº£º£ºÎÒÃÇÓÃ×Å×îÏȽøµÄ GPU ËãÁ¦£¬£¬£¬È¥Èà AI Ä£×Ó " ËÀ¼ÇÓ²±³ " ÄÇЩ²é×Öµä¾ÍÄÜÖªµÀµÄ¾²Ì¬ÖªÊ¶¡£DeepSeek ÁºÎÄ·æÍŶÓÓëÆä±±´óÏàÖúÕßÔÚ½ñÈÕÆÆÏþÐû²¼µÄÖØ°õÂÛÎÄ¡¶Conditional Memory via Scalable Lookup¡·£¬£¬£¬³¹µ×Í»ÆÆÁËÕâÒ»½©¾Ö¡£ËûÃÇÌá³öÁËÒ»ÖÖȫеÄEngram£¨Ó¡¼££©Ä£¿é£¬£¬£¬ÔڹŰåµÄ " Ìõ¼þÅÌËã "£¨MoE£©Ö®Í⣬£¬£¬¿ª·¢Á˵ڶþÌõÏ£º±»¯Õ½Ïß¡ª¡ª" Ìõ¼þÓ°Ïó "¡£Õâ²»µ«ÊÇÒ»´ÎÊÖÒÕÐÞ²¹£¬£¬£¬¶øÊÇÒ»³¡¹ØÓÚÄ£×Ó " ÄÔÈÝÁ¿ " µÄ¹©Ó¦²àˢС£Ëü֤ʵÎú£º£º£ºµ±ÎÒÃǽ« " Ó°Ïó " ´Ó " ÅÌËã " ÖаþÀ룬£¬£¬°Ñ¸Ã±³µÄ½»¸ø " ×Öµä "£¬£¬£¬°Ñ¸ÃËãµÄ½»¸ø´óÄÔ£¬£¬£¬AI µÄÍÆÀíÄÜÁ¦½«Ó­À´·´Ö±¾õµÄ±¬·¢Ê½ÔöÌí¡£DeepSeek ÍýÏëÔÚ 2 Ô´º½ÚǰºóÕýʽÐû²¼ V4£¬£¬£¬¶øÕâÒ»¿Ì»òÐí¾ÍÊÇ DeepSeek V4 ½µÉúµÄǰҹ¡£ ÐòÕ£º£º£ºÁù²ãÉñ¾­ÍøÂçµÄ " ÎÞÓù¦ "¹ÊÊÂµÄÆðµã£¬£¬£¬Ô´ÓÚ DeepSeek ÍÅ¶Ó¶Ô Transformer ÄÚ²¿ÔË×÷»úÖÆµÄÒ»´Î " ºË´Å¹²Õñ " ɨÃè¡£ÔÚÈ˹¤ÖÇÄܵĺںÐ×ÓÀ£¬£¬µ±´óÄ£×Ó¿´µ½ "Diana, Princess of Wales"£¨´÷°²ÄÈ£¬£¬£¬Íþ¶ûÊ¿Íõåú£©Õâ¸ö¶ÌÓïʱ£¬£¬£¬ËüµÄÄÚ²¿±¬·¢ÁËÒ»³¡ÁîÈ˷ѽâÇÒ¼«ÆäÌÚ¹óµÄ " ÄÚÚ§ "¡£Ñо¿Ö°Ô±·¢Ã÷£¬£¬£¬ÎªÁËʶ±ðÕâ¸öÀο¿µÄʵÌ壬£¬£¬Ä£×Ó¾¹È»¶¯ÓÃÁËÕûÕû 6 ²ãÍøÂ磺£º£ºµÚ 1-2 ²ã£º£º£ºÄ£×Ó»¹ÔÚ×ÁÄ¥ "Wales" »òÐíÊÇÒ»¸ö¹ú¼Ò£»£»µÚ 3 ²ã£º£º£ºËüÒâʶµ½ÕâÊÇÅ·ÖÞµÄÒ»¸öµØÀí¿´·¨£»£»µÚ 4 ²ã£º£º£ºËü×îÏÈÆ´¼¯³ö "Princess of Wales" ËÆºõÊÇÒ»¸öÍ·ÏΣ»£»µÚ 5 ²ã£º£º£ºËüåÚÏëµ½ÁË " Íþ¶ûÊ¿Ç×ÍõµÄÆÞ×Ó "£»£»µÚ 6 ²ã£º£º£ºÖ±µ½ÕâÀ£¬£¬Ëü²ÅÖÕÓÚÈ·ÈÏ£¬£¬£¬ÕâÊÇÖ¸ÄÇÎ»ÖøÃûµÄ " ´÷°²ÄÈÍõåú "¡£ÔÚһλ׷Çó¼«ÖÂЧÂʵļܹ¹Ê¦ÑÛÖУ¬£¬£¬Õâ¼òÖ±ÊÇËãÁ¦µÄ±©éåÌìÎï¡£" ´÷°²ÄÈÍõåú " ÊÇÒ»¸ö¿Í¹Û±£´æµÄ¡¢¡¢¾²Ì¬µÄʵÌ壬£¬£¬Ëü²»»áÓÉÓÚÉÏÏÂÎĵÄת±ä¶ø¸Ä±äÆäʵÖÊ¡£ÎªÁËÌáÈ¡Õâ¸öÔ­À´²é×Öµä¾ÍÄÜÖªµÀµÄÊÂʵ£¬£¬£¬Transformer ¾¹È»¶¯ÓÃÁËÕûÕû 6 ²ãÉî¶ÈµÄÌÚ¹ó¾ØÕóÔËËãÈ¥ " ÖØÐÞ " Õâ¸ö¿´·¨¡£Õâ¾ÍÏñÊÇÒ»¸ö¾øÊÀÌì²Å£¬£¬£¬ÔÚÈ¥½â¾ö΢»ý·ÖÄÑÌâ֮ǰ£¬£¬£¬Ã¿´Î¶¼µÃÏÈ»¨°ëСʱĬдһ±é¾Å¾Å³Ë·¨±í¡£ ÕâÖÖ " ÒþʽӰÏó " µÄ»úÖÆ£¬£¬£¬ÆÈʹģ×Ó½«Ãû¹óµÄ²ÎÊýÈÝÁ¿ºÍÍøÂçÉî¶È£¬£¬£¬ÆÌÕÅÔÚÁ˼òÆÓµÄģʽƥÅäÉÏ¡£DeepSeek ÔÚÕâÆª³¤´ï 33 Ò³µÄÂÛÎÄÖУ¬£¬£¬Ìá³öÁËÒ»¸öÖ±»÷Áé»êµÄ¿½ÎÊ£º£º£ºÎªÊ²Ã´²»Ö±½Ó¸ø´óÄ£×ÓÅäÒ»±¾¿ÉÒÔËæ²éËæÓÃµÄ " ³¬µÈ×Öµä "£¿ µÚÒ»Õ£º£º£º¼Ü¹¹ÖØËÜ¡ª¡ª Engram Ä£¿éµÄ±©Á¦ÃÀѧΪÏàʶ¾öÕâ¸öÎÊÌ⣬£¬£¬DeepSeek Ìá³öÁËÒ»ÖÖÃûΪ "Engram£¨Ìõ¼þÓ°Ïó£©" µÄÈ«ÐÂÄ£¿é¡£ÈôÊÇ˵ MoE£¨»ìÏýר¼ÒÄ£×Ó£©ÊÇ°Ñ " ´óÄÔ " ·Ö³ÉÁ˲î±ðµÄÇøÓò£¬£¬£¬Èòî±ðµÄר¼ÒÈÏÕæ²î±ðµÄ˼Ë÷£¨Ìõ¼þÅÌË㣩£»£»ÄÇô Engram ¾ÍÊǸø´óÄÔÍâ¹ÒÁËÒ»¸öÖØ´óµÄ " º£ÂíÌå "£¬£¬£¬×¨ÃÅÈÏÕæ´æ´¢¾²Ì¬ÖªÊ¶£¨Ìõ¼þÓ°Ï󣩡£1. ¸´Éú "N-gram"£º£º£º´Ó¹ÅÀÏÖÇ»ÛÖÐѰÕÒÃÕµ×Engram µÄ½¹µãÁé¸Ð£¬£¬£¬¾¹È»À´×ÔÓÚ NLP£¨×ÔÈ»ÓïÑÔ´¦Àí£©ÁìÓòµÄ " ÉϹÅÉñÆ÷ " ¡ª¡ª N-gram¡£ÔÚÉî¶ÈѧϰͳÖÎÌìÏÂ֮ǰ£¬£¬£¬ÎÒÃǾÍÊÇ¿¿Í³¼Æ "N ¸ö´Êͬʱ·ºÆðµÄ¸ÅÂÊ " À´Ã÷È·ÓïÑԵġ£DeepSeek ½«ÕâÒ»¾­µä¿´·¨¾ÙÐÐÁËÏÖ´ú»¯µÄħ¸Ä£º£º£º¹Å°åµÄ Transformer£º£º£ºÖªÊ¶ÊèÉ¢ÔÚÉñ¾­ÔªµÄÈ¨ÖØ£¨Weights£©À£¬£¬Ìáȡ֪ʶÐèÒª¾­ÓÉÖØ´óµÄÏßÐÔ²ãÅÌË㣬£¬£¬ÖØÆ¯ºó¸ß¡£Engram Ä£¿é£º£º£ºËüÊÇÒ»¸öÖØ´óµÄ¡¢¡¢¿ÉÀ©Õ¹µÄǶÈë±í£¨Embedding Table£©¡£µ±Ä£×Ó¶Áµ½ " ÕÅÖÙ¾° " »òÕß " ËÄ´ó·¢Ã÷ " ÕâÖÖÀο¿´îÅ䣨N-gram£©Ê±£¬£¬£¬²»ÐèÒª¶¯ÓôóÄÔÆ¤²ãÈ¥ÍÆÀí£¬£¬£¬Ö±½Óͨ¹ý¹þÏ£Ë÷Òý£¬£¬£¬ÔÚÄÚ´æ±íÖÐ " ²é " ³ö¶ÔÓ¦µÄÏòÁ¿¡£ÕâÒ»Àú³ÌµÄʱ¼äÖØÆ¯ºóÊÇO ( 1 ) ¡ª¡ªÕâÒâζ×ÅÎÞÂÛ֪ʶ¿âÅòÕ͵½¶à´ó£¨ÄÄÅÂÊÇ 1000 ÒÚ²ÎÊý£©£¬£¬£¬²éÕÒËÙÂÊÏÕЩÎȹÌ£¬£¬£¬ÇÒ¼«¿ì¡£2. Èý´óÊÖÒÕ»¤³ÇºÓ¼ÈÈ»²é±íÕâôºÃ£¬£¬£¬ÎªÊ²Ã´ÒÔǰûÈË×ö£¿ÓÉÓÚÓÐÈý¸öÀ¹Â·»¢£º£º£º´æ´¢±¬Õ¨¡¢¡¢¶àÒå´Ê³åÍ»¡¢¡¢²ÎÊý·ÖÅä¡£DeepSeek ¸ø³öÁ˽̿ÆÊé¼¶µÄ½â¾ö·½°¸£º£º£ºA. ´Ê±íѹËõ£º£º£º¼«ÖµÄÈ¥ÖØÌìÏÂÉϵĴÊ×é×éºÏÊÇÌìÎÄÊý×Ö¡£DeepSeek Ê×ÏÈ×öÁËÒ»²½ " ÎÞËðѹËõ "¡£ÔÚ·Ö´ÊÆ÷£¨Tokenizer£©²ãÃæ£¬£¬£¬Ëü½«ÓïÒåÏàͬµ«Ð´·¨²î±ðµÄ´Ê¾ÙÐÐÁ˹éÒ»»¯¡£ÀýÈ磬£¬£¬"Apple"£¨Ê××Öĸ´óд£©ºÍ "apple"£¨Ð¡Ð´£©ÔÚÓïÒåÉÏͨ³£Ö¸Í³Ò»¸ö¹¤¾ß¡£Í¨¹ýÓ³ÉäºÏ²¢£¬£¬£¬ÓÐÓôʱíÖ±½ÓËõСÁË 23%¡£Õâ²»µ«½ÚÔ¼Á˿ռ䣬£¬£¬¸üÈÃ֪ʶµÄÃܶȴó·ùÌáÉý¡£B. ¶àÍ·¹þÏ££º£º£º½â¾ö " ¹þÏ£³åÍ» "²»¿ÉÄܰÑËùÓÐ N-gram ¶¼´æÏÂÀ´¡£Engram ʹÓÃÁË " ¶àÍ·¹þÏ££¨Multi-Head Hashing£©" ÊÖÒÕ¡£Í¨¹ý¶à¸ö¹þÏ£º£º£º¯Êý£¬£¬£¬½«ÎÞÏÞµÄ N-gram Ó³Éäµ½ÓÐÏÞµÄÄÚ´æ²ÛλÖС£ËäÈ»»áÓйþÏ£³åÍ»£¨¼´Á½¸ö²î±ðµÄ´Ê±»Ó³Éäµ½ÁËͳһ¸öλÖã©£¬£¬£¬µ«Í¨¹ý " ¶àÍ· " Éè¼Æ£¬£¬£¬Ä£×Ó¿ÉÒÔ´Ó¶à¸öºòѡЧ¹ûÖÐÆ´¼¯³ö׼ȷµÄÐÅÏ¢£¬£¬£¬¼«´óµØÌá¸ßÁ˳°ôÐÔ¡£C. ÉÏÏÂÎÄÃſأº£º£º¸øÓ°ÏóÅä¸ö " ²ÃÅÐ "ÕâÊÇ×ÃîµÄÒ»±Ê¡£²é±íÊÇËÀµÄ£¬£¬£¬ÓïÑÔÊÇ»îµÄ¡£ºÃ±È " Æ»¹û " Õâ¸ö´Ê¡£ÔÚ " ³ÔÆ»¹û " µÄÓᄈϣ¬£¬£¬Ëüָˮ¹û£»£»ÔÚ " Æ»¹ûÐû²¼»á " µÄÓᄈϣ¬£¬£¬ËüÖ¸¿Æ¼¼¹«Ë¾¡£Ö±½Ó²é±í¿ÉÄÜ»áÒýÈëÔëÉù¡£DeepSeek Éè¼ÆÁËÒ»¸ö " ÉÏÏÂÎĸÐÖªÃÅ¿Ø "£¨Context-aware Gating£©¡£Query£¨ÅÌÎÊ£©£º£º£ºÄ¿½ñÉÏÏÂÎĵÄÒþ²Ø×´Ì¬£¨Hidden State£©¡£Key/Value£¨¼üÖµ£©£º£º£º²é±í»ñµÃµÄ¾²Ì¬ÏòÁ¿¡£Õâ¸öÃſؾÍÏñÒ»¸ö²ÃÅС£ÈôÊDzé³öÀ´µÄ " ¾²Ì¬ÖªÊ¶ " ºÍÄ¿½ñµÄ " ÉÏÏÂÎÄ " ²»´î£¬£¬£¬²ÃÅоͻá°ÑÈ¨ÖØÑ¹µÍ£¨Gate ÖµÇ÷Ïò 0£©£¬£¬£¬ÈÃÄ£×ÓºöÂÔÕâ¸öÔëÉù£»£»ÈôÊÇÍêÉÆÆõºÏ£¨ºÃ±È " É˺®ÔÓ²¡ÂÛ " ºóËæ×Å " ÕÅÖÙ¾° "£©£¬£¬£¬²ÃÅоͻá°Ñ´óÃÅ·­¿ª£¨Gate ÖµÇ÷Ïò 1£©£¬£¬£¬Ö±½Ó°Ñ֪ʶעÈëÄ£×Ó¡£ µÚ¶þÕ£º£º£º»Æ½ð±ÈÀý¡ª¡ª·¢Ã÷ AI Ä£× "U ÐÍÇúÏß "¼Ü¹¹Éè¼ÆºÃÁË£¬£¬£¬½ÓÏÂÀ´µÄÎÊÌâÊÇ£º£º£ºÔõô·Ö¾Ó²ú£¿¼ÙÉèÎÒÃÇÏÔ¿¨ÀïµÄÏÔ´æÊÇÓÐÏ޵쬣¬£¬×ܲÎÊýÔ¤ËãÒ²ÊÇÀο¿µÄ¡£ÎÒÃÇÓ¦¸Ã°Ñ¼¸¶à²ÎÊý·ÖÅ䏸 MoE µÄ " ר¼Ò "£¨ÈÏÕæÅÌË㣩£¬£¬£¬¼¸¶à²ÎÊý·ÖÅ䏸 Engram µÄ " ×Öµä "£¨ÈÏÕæÓ°Ï󣩣¿ÕâÊÇÒ»¸öµä·¶µÄ×ÊÔ´ÉèÖò©ÞÄ¡£DeepSeek ÍŶӾÙÐÐÁËÒ»³¡´ó¹æÄ£µÄÏûÈÚʵÑ飬£¬£¬É¨ÃèÁË´Ó 0% µ½ 100% µÄ·ÖÅä±ÈÀý£¬£¬£¬Ð§¹û»­³öÁËÒ»ÌõÍêÉÆµÄ "U ÐÍ Scaling Law ÇúÏß "¡£ÕâÕÅͼչÏÖÁË AI Ä£×ÓÉè¼ÆµÄµ×²ã¼ÍÂÉ£º£º£º×ó²à¼«¶Ë£¨´¿ Engram£©£º£º£ºÈôÊǰѲÎÊýÈ«¸ø×ֵ䣬£¬£¬Loss ºÜ¸ß¡£ÓÉÓÚÄ£×ÓÄð³ÉÁË " Êé°×³Õ "£¬£¬£¬¹âÓÐËÀ¼ÇÓ²±³£¬£¬£¬Ã»ÓÐÂß¼­ÍÆÀíÄÜÁ¦¡£ÓҲ༫¶Ë£¨´¿ MoE£©£º£º£ºÈôÊǰѲÎÊýÈ«¸ø×¨¼Ò£¬£¬£¬Loss Ò²ºÜ¸ß¡£ÓÉÓÚר¼ÒÃDZ»ÆÈ°Ñ¾«Éñ¶¼»¨ÔÚ±³Ê飨ӰÏó¾²Ì¬ÖªÊ¶£©ÉÏ£¬£¬£¬Ã»¿Õ¸ÉÕýÊ¡£»£»Æ½ðÖ§½âµã£¨¦Ñ ¡Ö 75%-80%£©£º£º£ºµ±ÎÒÃǽ«Ô¼20%-25% µÄÏ£º±²ÎÊýÔ¤Ëã·Ö¸ø Engram£¬£¬£¬Ê£Ïµĸø MoE ʱ£¬£¬£¬Ä£×ÓµÄÑéÖ¤¼¯ Loss ½µµ½ÁË×îµÍµã¡£ÕâÊÇÒ»¸ö¼«¾ßÖ¸µ¼ÒâÒåµÄ·¢Ã÷£º£º£º¹ØÓÚ¼¸°ÙÒÚ²ÎÊýµÄ´óÄ£×ÓÀ´Ëµ£¬£¬£¬´¿´â¶ÑÆöÅÌË㵥루MoE ר¼Ò£©ÒѾ­ÊDZ߼ÊЧӦµÝ¼õÁË£¬£¬£¬±ØÐèÒýÈëרÃŵľ²Ì¬Ó°ÏóÄ£¿éÀ´ÊµÏÖ " ´æËãÆ½ºâ "¡£ µÚÈýÕ£º£º£º·´Ö±¾õµÄ±¬·¢¡ª¡ªÎªÊ²Ã´ " ²é×Öµä " ÄÜÌá¸ß " ÊýѧЧ¹û "£¿ÈôÊÇ Engram ½ö½öÊÇÈÃÄ£×Ó " ¼ÇÐÔ¸üºÃ "£¬£¬£¬ÕâÆªÂÛÎĵķÖÁ¿»¹È±·¦ÒÔÕð¾ªÉçÇø¡£ÊÂʵ£¬£¬£¬RAG£¨¼ìË÷ÔöÇ¿ÌìÉú£©Ò²Äܽâ¾ö֪ʶÎÊÌâ¡£ÕæÕýÈÃÒµ½ç¸ÐÓ¦Õ𺳵Ä£¬£¬£¬ÊÇʵÑéЧ¹ûÖÐÄÇЩÒâÁÏÖ®ÍâµÄÊÕÒæ¡£DeepSeek ¹¹½¨ÁËÈý¸ö±ÈÕÕÄ£×Ó£¬£¬£¬ÑÏ¿á¿ØÖÆ¼¤»î²ÎÊýÄ¿£¨3.8B£©ºÍѵÁ·Êý¾ÝÁ¿£¨262B tokens£©ÍêȫһÖ£º£º£ºDense-4B£º£º£º¹Å°åµÄŨÃÜÄ£×Ó¡£MoE-27B£º£º£º´¿ MoE Ä£×Ó£¨72 ¸öר¼Ò£©¡£Engram-27B£º£º£º»ìÏýÄ£×Ó£¨55 ¸öר¼Ò + 5.7B Engram ²ÎÊý£©¡£Ð§¹ûÁîÈË´óµøÑÛ¾µ£º£º£º1. ÒâÁÏÖ®ÖУº£º£ºÖªÊ¶ÀàʹÃü°Ô°ñÔÚ MMLU£¨×ÛºÏ֪ʶ£©ÉÏ£¬£¬£¬Engram Ä£×ÓÌáÉýÁË3.4 ·Ö£»£»ÔÚ CMMLU£¨ÖÐÎÄ֪ʶ£©ÉÏ£¬£¬£¬ÌáÉýÁË4.0 ·Ö¡£ÕâºÜºÃÃ÷È·£¬£¬£¬Íâ¹ÒÁË×ֵ䣬£¬£¬ÖªÊ¶×ÔÈ»¸üºÃÁË£¬£¬£¬»Ã¾õ¸üÉÙÁË¡£2. ÒâÁÏÖ®Í⣺£º£ºÂß¼­¡¢¡¢´úÂë¡¢¡¢ÊýѧÖÜÈ«±©Õǰ´Àí˵£¬£¬£¬" ²é×Öµä " ºÍ " ×öÊýѧÌâ " û¹ØÏµ¡£µ«ÔÚ BBH£¨×ÛºÏÍÆÀí£©ÉÏ£¬£¬£¬Engram-27B ¾¹È»±Èͬ²ÎÊýµÄ´¿ MoE »ùÏßÌáÉýÁËÕûÕû5.0 ·Ö£¡£¡£¡MATH£¨Êýѧ£©£º£º£ºÌáÉý2.4 ·Ö¡£HumanEval£¨´úÂëÌìÉú£©£º£º£ºÌáÉý3.0 ·Ö¡£ARC-Challenge£¨ÖØ´óÍÆÀí£©£º£º£ºÌáÉý3.7 ·Ö¡£3. Éî¶ÈÆÊÎö£º£º£ºÓÐÓÃÉî¶È£¨Effective Depth£©ÀíÂÛΪʲô£¿Ò»¸ö " ËÀ¼ÇÓ²±³ " µÄÄ£¿é£¬£¬£¬ÎªÊ²Ã´ÄÜÌá¸ßÖÇÉÌ£¿DeepSeek ÍŶÓʹÓÃLogitLensºÍ "CKA£¨ÖÐÐÄºË¶ÔÆë£©" ÊÖÒÕ£¬£¬£¬¶ÔÄ£×ÓÄÚ²¿¾ÙÐÐÁË " ÆÊ½â "¡£ËûÃÇ·¢Ã÷ÁËÒ»¸ö¾ªÈ˵ÄÕ÷Ï󣺣º£º»¹¼ÇµÃ¿ªÍ·µÄ " ´÷°²ÄÈÍõåú " Âð£¿ÔÚ´¿ MoE Ä£×ÓÖУ¬£¬£¬Ç°¼¸²ãÍøÂç¶¼ÔÚæ×Å " Æ´¼¯¿´·¨ "¡£¶øÔÚ Engram Ä£×ÓÖУ¬£¬£¬ÓÉÓÚµÚ 2 ²ã¾Í²åÈëÁË Engram Ä£¿é£¬£¬£¬¾²Ì¬ÖªÊ¶µÄ¼ìË÷ÔÚ¼«ÔçµÄ½×¶Î¾ÍÍê³ÉÁË¡£ÕâÒâζ×Å£¬£¬£¬Ô­±¾ÓÃÓÚ " ËÀ¼ÇÓ²±³ " µÄǰ¼¸²ãÍøÂç±»½â·ÅÁË£¡£¡£¡ÕâÏ൱ÓÚ¸øÄ£×Ó " ÐéÔö " ÁËÉî¶È¡£ ÄÇЩ±»ÊͷųöÀ´µÄÍøÂç²ãºÍ×¢ÖØÁ¦Í·£¨Attention Heads£©£¬£¬£¬²»ÔÙÐèÒª´¦ÀíààËյľֲ¿ÒÀÀµ£¨ºÃ±Èʶ±ð " ÕÅÖÙ¾° " ÊÇË­£©£¬£¬£¬´Ó¶ø¿ÉÒÔÈ«Éñ¹á×¢µØÍ¶Èëµ½¸üÖØ´óµÄÈ«¾ÖÍÆÀí¡¢¡¢³¤³ÌÂß¼­¹¹½¨ºÍ´úÂëÂß¼­ÌìÉúÖÐÈ¥¡£Engram µÄʵÖÊ£¬£¬£¬²»ÊÇ " Ìæ»» " ÍÆÀí£¬£¬£¬¶øÊÇͨ¹ý " ·ÖÁ÷ " Ôӻ£¬£¬ÈôóÄÔרעÓÚ¸ü¸ßά¶ÈµÄ˼Ë÷¡£ µÚËÄÕ£º£º£º¹¤³ÌÆæ¼£¡£¡£¡ª¡ªÍ»ÆÆÓ¢Î°´ïµÄ " ÏÔ´æ°ÔȨ "¹ØÓÚ»ª¶û½ÖµÄͶ×ÊÕߺÍËãÁ¦ÖÐÐĵÄÔËάÕßÀ´Ëµ£¬£¬£¬ÕâÆªÂÛÎÄ×îÐԸеĵط½²»ÔÚÓÚ Score£¬£¬£¬¶øÔÚÓÚCost£¨±¾Ç®£©¡£ÔÚ AI ʱ´ú£¬£¬£¬×îÌÚ¹óµÄ×ÊÔ´²»ÊÇËãÁ¦£¨FLOPs£©£¬£¬£¬¶øÊÇÏԴ棨HBM£©¡£Ó¢Î°´ï H100 Ö®ÒÔÊǹ󣬣¬£¬ºÜºéÁ÷ƽÉÏÊÇÓÉÓÚÄÇϡȱµÄ HBM3e ÄÚ´æ¡£¶ø Engram ´øÀ´ÁËÒ»¸öÇ㸲ÐÔµÄÌØÕ÷£º£º£º³¹µ×µÄ´æËãÊèÉ¢¡£1. MoE µÄÍ´µã£º£º£ºÏÔ´æÍÌÊÉÕ߹ŰåµÄ MoE Ä£×Ó£¬£¬£¬Æä·ÓÉ»úÖÆ£¨Routing£©ÊǶ¯Ì¬µÄ¡£Ä£×Ó±ØÐèÏÈËã³öÄ¿½ñ Token µÄÌØÕ÷£¬£¬£¬ËãÍêÕâÒ»²ã£¬£¬£¬²ÅÖªµÀÏÂÒ»²ã¸ÃÕÒÄĸöר¼Ò¡£ÕâÒâζ×Å£¬£¬£¬ËùÓеÄר¼ÒÄ£×Ó±ØÐèʱ¿ÌÔÚÌÚ¹óµÄ GPU ÏÔ´æÀï´ýÃü£¬£¬£¬Ëæ½ÐËæµ½¡£2. Engram µÄÍ»ÆÆ£º£º£ºÈ·¶¨µÄÔ¤ÖªEngram µÄ²é±íÂß¼­ÊÇÈ·¶¨ÐԵġ£Ö»ÒªÊäÈëµÄÎı¾È·¶¨ÁË£¨ºÃ±È "A New Axis of Sparsity"£©£¬£¬£¬ÄÇôËü¶ÔÓ¦µÄ N-gram Ë÷Òý¾ÍÈ·¶¨ÁË¡£ÎÒÃÇ»ù´¡²»ÐèÒªµÈÄ£×ÓËãÍêǰһ²ã£¬£¬£¬ÔÚ Token ½øÈëÄ£×ÓµÄÄÇһ˲¼ä£¬£¬£¬ÎÒÃǾÍÖªµÀËüÐèÒª²éÄÄÕűíµÄÄÄÒ»ÐС£3. CPU µÄÄæÏ®£º£º£º°Ñ´óÄ£×ÓÈû½øÄÚ´æÌõÕâÒ»ÌØÕ÷´øÀ´ÁËÖØ´óµÄ¹¤³ÌÓ¯Àû£º£º£ºÐ¶ÔØ£¨Offload£©£º£º£ºÎÒÃÇ¿ÉÒ԰Ѽ¸°ÙÒÚ¡¢¡¢ÉõÖÁÉÏǧÒÚ²ÎÊýµÄ Engram ´Ê±í£¬£¬£¬Ö±½ÓÈÓµ½×ÔÖÆ¡¢¡¢Á¿´ó¡¢¡¢Ò×À©Õ¹µÄ "CPU Äڴ棨DRAM£©" À£¬£¬ÉõÖÁ·ÅÔÚ NVMe SSD ÉÏ¡£Ô¤È¡£¨Prefetching£©£º£º£ºÔÚ GPU Æ´ÃüÅÌËãǰһ²ã Transformer µÄʱ¼ä£¬£¬£¬CPU ʹÓà PCIe ͨµÀ£¬£¬£¬Òì²½µØ°ÑÏÂÒ»²ãÐèÒªµÄÓ°ÏóÊý¾Ý " Ԥȡ " ³öÀ´£¬£¬£¬ÍÆË͵½ GPU¡£ÑÚÊÎÑÓ³Ù£¬£¬£¬²¢Ðд¦Àí¡£DeepSeek ʵ²âÊý¾ÝÏÔʾ£º£º£º×ÝÈ»¹ÒÔØÁË100B£¨Ç§ÒÚ£©²ÎÊýµÄ Engram ±íµ½ CPU Äڴ棬£¬£¬Ïà±ÈÓÚ´¿ GPU ÍÆÀí£¬£¬£¬ÍÌÍÂÁ¿µÄϽµ²»µ½ 3%¡£ÕâÊÇÒ»¸öÈÃËùÓÐÓÉÓÚÂò²»µ½ HBM ¶ø½¹ÂǵÄÈË¿ñϲµÄ½áÂÛ¡£ÕâÒâζ×Å£¬£¬£¬Î´À´µÄ´óÄ£×Ó£¬£¬£¬" Ó°ÏóÈÝÁ¿ " ¿ÉÒԵͳÉÍâµØÎÞÏÞÀ©ÕÅ£¬£¬£¬¶ø²»±Ø±»Ó¢Î°´ïµÄÏԴ濨²±×Ó¡£ µÚÎåÕ£º£º£º³¤Îı¾µÄʤÀû¡ª¡ª NIAH ²âÊÔµÄÔ¾Éý³ýÁËͨÓÃÍÆÀí£¬£¬£¬Engram ÔÚ³¤Îı¾£¨Long Context£©ÁìÓòµÄÌåÏÖͬÑù֤ʵÎú " ·Ö¹¤ " µÄ¼ÛÖµ¡£ÔÚ³¤Îı¾´¦ÀíÖУ¬£¬£¬×¢ÖØÁ¦»úÖÆ£¨Attention£©µÄ´°¿ÚÊÇÓÐÏ޵ġ£ÈôÊÇ×¢ÖØÁ¦±»´ó×ڵľֲ¿ÐÅÏ¢£¨ÈçÀο¿¶Ì

    ±¾ÎÄÁ´½Ó£º£º£º?/p/Products/8480643.html

    °Ù¶ÈÔÊÐí£º£º£ºÈçÓöÐéαڲƭ£¬£¬£¬ÖúÄú****(Ôð±à£º£º£º³ÂÞÈÔ£¡£¡£¡¢¡¢µËΰÏè)

    Ïà¹ØÓ¦ÓÃ

    ¡¾ÍøÕ¾µØÍ¼¡¿
    µçÄÔ°æ-ÃÃ×Ó΢ÐŶþάÂëͼƬ-ÀîÑÇÅôØòÀëºó·³ÐÄÊÂÒ»¶Ñ£º