Unfair comparison between ProtBert and ESM

In [ProtTrans](https://doi.org/10.1109/TPAMI.2021.3095381), the author says that:

> No auxiliary tasks like BERT's next-sentence prediction were used for any model described here.

But in the PEER, the `[CLS]` token is used for `ProtBert` as a protein-level embedding representation. In this case the `[CLS]` token may not have the ability to represent sequence embedding.

For `ProtBert`, should we use the same strategy as for `ESM` (i.e., mean pooling over all residues) to get a fairer comparison?



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unfair comparison between ProtBert and ESM #9

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Unfair comparison between ProtBert and ESM #9

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions