torch engine support chatglm3 #1159

grimoire · 2024-02-19T12:15:17Z

ChatGLM3 shares most codes with ChatGLM2.

RMSNorm has been replaced with our triton kernel.

Fix batched inference.

Caution

I am not able to add model template in lmdeploy/model.py since the tokenizer can not encode special token to the special ids.

   def encode(self, s: str, bos: bool = False, eos: bool = False) -> List[int]:
       assert type(s) is str
       t = self.sp_model.encode(s)
       if bos:
           t = [self.bos_id] + t
       if eos:
           t = t + [self.eos_id]
       return t

   def decode(self, t: List[int]) -> str:
       text, buffer = "", []
       for token in t:
           if token in self.index_special_tokens:

               if buffer:
                   text += self.sp_model.decode(buffer)
                   buffer = []
               text += self.index_special_tokens[token]
           else:
               buffer.append(token)
       if buffer:
           text += self.sp_model.decode(buffer)
       return text

lvhan028 · 2024-02-22T12:57:38Z

May remove the chat template in model.py if it doesn't work

lvhan028

LGTM

lvhan028 · 2024-02-23T06:41:43Z

The chat template of chatGLM models is still an issue. @AllentDan may follow it up.

AllentDan · 2024-02-23T07:53:48Z

The chat template of chatGLM models is still an issue. @AllentDan may follow it up.

Encoding may not be the hardest part. It also influences incremental decoding.

support chatglm3

d949136

grimoire added the improvement label Feb 20, 2024

lvhan028 requested review from AllentDan and lvhan028 February 22, 2024 12:57

grimoire added 2 commits February 23, 2024 11:01

support chatglm3

a828a68

Merge branch 'main' into torch-chatglm3

5950f61

lvhan028 approved these changes Feb 23, 2024

View reviewed changes

lvhan028 added enhancement New feature or request and removed improvement labels Feb 23, 2024

AllentDan approved these changes Feb 23, 2024

View reviewed changes

lvhan028 merged commit 65d94f6 into InternLM:main Feb 23, 2024
4 checks passed

lvhan028 mentioned this pull request Feb 27, 2024

请问会支持chatglm3吗？ #1147

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch engine support chatglm3 #1159

torch engine support chatglm3 #1159

grimoire commented Feb 19, 2024

lvhan028 commented Feb 22, 2024

lvhan028 left a comment

lvhan028 commented Feb 23, 2024

AllentDan commented Feb 23, 2024

torch engine support chatglm3 #1159

torch engine support chatglm3 #1159

Conversation

grimoire commented Feb 19, 2024

lvhan028 commented Feb 22, 2024

lvhan028 left a comment

Choose a reason for hiding this comment

lvhan028 commented Feb 23, 2024

AllentDan commented Feb 23, 2024