5.5

/10

Rejected4 位审稿人

最低5最高6标准差0.5

3.8

置信度

ICLR 2024

EquiPocket: an E(3)-Equivariant Geometric Graph Neural Network for Ligand Binding Site Prediction

yang zhang,Wenbing Huang,Zhewei Wei,Ye Yuan,Chongxuan Li

OpenReview PDF

提交: 2023-09-19更新: 2024-02-11

摘要

关键词

Pocket DetectionBinding Site PredictionGraph Neural NetworkDrug Discovery

评审与讨论

审稿意见

评分: 5置信度: 42023-10-31

The paper introduces an e(3)-equivariant geometric graph neural network for ligand binding site prediction. This model consists of three modules including the local/global geometric modeling module and surface message passing module, which are used to address several issues of the previous approach. The author compares their method with other methods and shows it has the best performance in terms of model parameters efficiency and accuracy.

优点

The writing is generally great and the description of the proposed method is clear.
The discussion regarding the protein size shift is great.

缺点

Geometric aware E(3)-equivariant GNN has been applied to several very related tasks like protein-ligand docking [1]. Consequently, the novelty of introducing E(3)-equivariant GNNs may be diminished.
I think it might be also necessary to compare with protein-ligand docking methods as binding site prediction is one of their outputs.

[1]: Zhang, Yangtian, et al. "E3bind: An end-to-end equivariant network for protein-ligand docking." arXiv preprint arXiv:2210.06069 (2022).

问题

It would be better to also compare the inference speed of different methods. Because this approach does massage passing on full protein atoms graph, it appears the inference speed would be very slow.
The setting for some baseline methods (EGNN, SchNet) needs to be more clear. For example, what kinds of graphs do you use for the EGNN? Do you also use the surface atom graph?
A more informative ablation would involve comparing EquiPocket, EquiPocket/L, EquiPocket/R, and EquiPocket/LR, where the symbol '/' means exclude. This is because L and R appear to be two feature extractors.

评论- Part 2/2 of Response to Reviewer fzz6

2023-11-19

Q2: The setting for some baseline methods (EGNN, SchNet) needs to be more clear. For example, what kinds of graphs do you use for the EGNN? Do you also use the surface atom graph?

We focused on using EGNN[10] and SchNet[11] solely on the protein structure graph for two main reasons:

Both EGNN and SchNet were originally developed for representing molecular nodes and structures, making them ideal to test whether solely taking protein structural information is sufficient for predicting ligand binding sites.

Our method EquiPocket differs significantly from previous prediction method [4, 5, 6, 7, 8] by employing surface atom graph, which represents an innovative contribution and differentiates from the conventional protein structure graph as baseline models.

Q3: A more informative ablation would involve comparing EquiPocket, EquiPocket/L, EquiPocket/R, and EquiPocket/LR, where the symbol '/' means exclude. This is because L and R appear to be two feature extractors.

Our model predominantly comprises two feature extractors: local geometric modeling module and global structural modeling module, subsequently followed by the surface-EGNN model. In response to your valuable suggestion, we propose the following definitions:

EquiPocket/L: This variant of EquiPocket excludes local geometric modeling module.

EquiPocket/R: This variant of EquiPocket excludes global structural modeling module.

EquiPocket/LR: This variant of EquiPocket excludes both local geometric modeling module and global structural modeling module.

Dataset		COACH420		HOLO4k		PDBbind
Model	Fail Ratio	DCC	DCA	DCC	DCA	DCC	DCA
EquiPocket/L	0.13	0.355	0.546	0.296	0.574	0.465	0.606
EquiPocket/R	0.09	0.364	0.541	0.294	0.598	0.474	0.627
EquiPocket/LR	0.16	0.308	0.502	0.268	0.543	0.409	0.566
EquiPocket	0.05	0.423	0.656	0.337	0.662	0.545	0.721

Analysis shows that omitting any of these modules negatively impacts performance. Specifically, excluding either the local geometric (L) or global structural (R) module leads to a 10%-15% decrease in DCC/DCA metrics; removing both L and R modules results in a more significant drop of 20%-25%. These results highlight the essential role of both feature extractors in predicting ligand binding sites. Notably, the more pronounced performance decline when omitting the local geometric module (L) suggests its higher importance in protein pocket prediction. This finding is consistent with current trends where methods like Fpocket[5], P2rank[4], and DeepSurf[7] primarily utilize geometric features for binding site prediction.

Reference

[1] Zhang Y, Cai H, Shi C, et al. E3bind: An end-to-end equivariant network for protein-ligand docking[J]. arXiv preprint arXiv:2210.06069, 2022.

[2] Lu W, Wu Q, Zhang J, et al. Tankbind: Trigonometry-aware neural networks for drug-protein binding structure prediction[J]. Advances in neural information processing systems, 2022, 35: 7236-7249.

[3] Liao Z, You R, Huang X, et al. DeepDock: enhancing ligand-protein interaction prediction by a combination of ligand and structure information[C]//2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 2019: 311-317.

[4] Krivák R, Hoksza D. P2Rank: machine learning based tool for rapid and accurate prediction of ligand binding sites from protein structure[J]. Journal of cheminformatics, 2018, 10: 1-12.

[5] Le Guilloux V, Schmidtke P, Tuffery P. Fpocket: an open source platform for ligand pocket detection[J]. BMC bioinformatics, 2009, 10(1): 1-11.

[6] Stepniewska-Dziubinska M M, Zielenkiewicz P, Siedlecki P. Improving detection of protein-ligand binding sites with 3D segmentation[J]. Scientific reports, 2020, 10(1): 5035.

[7] Mylonas S K, Axenopoulos A, Daras P. DeepSurf: a surface-based deep learning approach for the prediction of ligand binding sites on proteins[J]. Bioinformatics, 2021, 37(12): 1681-1690

[8] Jiménez J, Doerr S, Martínez-Rosell G, et al. DeepSite: protein-binding site predictor using 3D-convolutional neural networks[J]. Bioinformatics, 2017, 33(19): 3036-3042.

[9] Tubiana J, Schneidman-Duhovny D, Wolfson H J. Scannet: A web server for structure-based prediction of protein binding sites with geometric deep learning[J]. Journal of Molecular Biology, 2022, 434(19): 167758.

[10] Satorras V G, Hoogeboom E, Welling M. E (n) equivariant graph neural networks[C]//International conference on machine learning. PMLR, 2021: 9323-9332.

[11] Schütt K T, Sauceda H E, Kindermans P J, et al. Schnet–a deep learning architecture for molecules and materials[J]. The Journal of Chemical Physics, 2018, 148(24).

评论- Part 1/2 of Response to Reviewer fzz6

2023-11-19

Thank you for your valuable feedback! To address your concerns, we have included additional explanations as follows.

W1: Geometric aware E(3)-equivariant GNN has been applied to several very related tasks like protein-ligand docking [1]. Consequently, the novelty of introducing E(3)-equivariant GNNs may be diminished.

Thanks for the comment. There could be a misunderstanding regarding our research task. While it's true that E(3)-equivariant GNNs are used in tasks such as protein-ligand docking (like E3bind [1]), our EquiPocket focuses on protein pocket prediction, a distinct task from docking.

Docking methods like E3Bind [1] and TankBind [2] predict protein-ligand binding poses and affinities, often using method like P2rank [4] to first identify candidate binding sites (which is also called protein pocket in our context). Our EquiPocket and other related works [5, 6, 7, 8, 9] share the same goal as P2rank, identifying the center of candidate binding sites on the protein rather than locating the exact interface between each ligand and the target protein. The area around this center is seen as protein pocket for downstream tasks such as protein-ligand docking that will specify more precise protein-ligand interface around the predicted protein pocket. We have studied P2Rank[4] in our paper (Section A3.1).

Most current deep learning methods [6, 7, 8] for protein pocket prediction use CNNs and require voxelization of protein structures. EquiPocket innovatively uses a geometric-aware E(3)-equivariant GNN, which is a unique model unexplored by other research in this field. As also noted by Reviewer 9vNs, "the idea of using an E(3)-equivariant GNN for binding site prediction is of interest especially in the bioinformatic domain."

W2: I think it might be also necessary to compare with protein-ligand docking methods as binding site prediction is one of their outputs.

Thank your for the comment. Again, we highlight that protein-ligand docking and protein pocket prediction are two different tasks. A general pipeline in current methods (E3Bind [1] and TankBind [2]) is that we first predict the protein pocket, and then conduct protein-ligand docking and infer how specific ligands interact with the atoms around the predicted pocket. Although the protein-ligand docking methods also output binding site, this area is a fine-grained subset of the atoms around the protein pocket. Hence, it is not reasonable to compare with protein-ligand docking methods. We have further introduced the difference between these two tasks in the revision.

Q1: It would be better to also compare the inference speed of different methods. Because this approach does massage passing on full protein atoms graph, it appears the inference speed would be very slow.

Thank you for the feedback. The comparison of various methods for predicting 100 proteins reveals the following:

Method	Type	Time (s) per 100 proteins	Average DCC
fpocket	Geometric-based	23	0.214
Kalasanty	3D-CNN	86	0.321
DeepSurf	3D-CNN	641	0.366
EquiPocket	Ours	37	0.431

Dataset	Average Atom Num	Average Atom in Surface	Average the True Center of Binding Sites
COACH420	2123	1217	1.2
HOLO4K	3845	2052	2.4
PDBbind	3104	1677	1

fpocket[5]: Fastest with only 23 seconds for 100 proteins, leveraging manually defined geometric features. However, its performance metrics are not notable.

Kalasanty[6] and DeepSurf[7]: Both are 3D-CNN-based. DeepSurf, using detailed local grids on protein surfaces, outperforms Kalasanty in metrics but is slower and the least efficient among the methods compared.

EquiPocket: Our method takes 47 seconds per 100 proteins and shows the best DCC metrics. It's faster than 3D-CNN methods but slower than geometric-based ones. This is due to 3D-CNN methods transforming proteins into 3D images (eg. 36 * 36 * 36 grids~[6, 7, 8]), increasing computational costs compared to our using atom information (average 2000-3000 nodes in a protein). EquiPocket also integrates surface features with Surface-EGNN, enhancing efficiency over DeepSurf.

We have included these results and analyses in the revised paper. Thank you for your nice suggestion.

评论- Rebuttal Deadline Reminder

2023-11-23

Dear reviewer fzz6:

Thank you very much for your review.

I would like to kindly remind you that the rebuttal period is coming to an end. Could you please inform us if our responses have resolved your concerns, or if there are any other questions you need us to address?

2023-11-23

Thanks for providing additional details! They are quite valuable.

Thanks for pointing out that E3Bind results would be close to P2Rank. However, I've noticed that the revised manuscript lacks a direct comparison between your approach and P2Rank. I think such a comparison is important, especially considering that P2Rank has demonstrated a significant performance advantage over one of your baseline methods, DeepSite. So, I will keep my score.

评论- Urgent! New Rebuttal Response Reminder

2023-11-23

Dear reviewers and bros fzz6:

We have responded to your concerns and updated the revised paper.

May I ask if this has resolved your additional concern.

Thanks a lot.

2023-11-23

Thanks for adding these results!

It appears to me that this result is not robust enough to show your method is better than P2Rank as its performance is, in fact, lower than P2Rank. While I acknowledge that P2Rank may derive benefits from its more diverse training dataset, quantifying this advantage is less feasible. So I think maybe you need to either train your model on their dataset or re-train their model on your dataset.

It is great to know that your model is better than P2Rank (protrusion), indicating a potential superior ability to capture geometric information compared to P2Rank. However, it is not intuitively clear to me whether the additional information captured by your model isn't already accounted for by the other features used in P2Rank.

Thanks for your hard work! As the results don't actually address my concern, I will maintain the current score.

评论- Response to Reviewer fzz6

2023-11-23

Dear reviewers and bros fzz6:

Thank you for your reply.

I am somewhat heart-broken by this result, mainly because almost all deep learning models for ligand binding site prediction in the experiments did not compare with P2rank, affected by their data differences.

Nonetheless, I greatly appreciate your reply. Thanks a lot.

评论- Response to Reviewer fzz6

2023-11-23

Dear Reviewer fzz6:

Thank you very much for your reply.

We have studied P2Rank in our paper (Section A3.1). P2Rank[4] has great differences in training and validation data with deep learning methods including DeepSite[8], Kalasanty[7], DeepSurf[6], and our EquiPocket. Specifically, P2Rank uses data from CHEN11 and JOINED datasets, while deep learning methods commonly use scPDB[12]. P2Rank's paper mentions that CHEN11 is more diverse than scPDB, affecting model performance.The comparison results are as following:

DCA	COACH420	HOLO4K
P2Rank[protrusion]	0.642	0.593
P2rank	0.683	0.706
DeepSite	0.564	0.456
Kalasanty	0.636	0.515
deepsurf	0.658	0.635
EquiPocket	0.656	0.662

The results in above table show that our method essentially matches or surpasses most deep learning methods, even outperforming P2Rank [protrusion], which uses only geometric information, and slightly trailing behind P2Rank, which benefits from a more diverse dataset.
These results has been added to the revised paper.

Could you please inform us if this response have resolved your concerns, or if there are any other questions you need us to address?