Geometry Informed Tokenization of Molecules for Language Model Generation
摘要
评审与讨论
This paper proposes Geo2Seq, a 3D molecule tokenization method for 3D molecular generation. The authors convert molecules (in 3D space) to 1D sequences while preserving SE(3) invariance, and then train a molecule generative model based on language model architecture. Geo2Seq equipped with various language models show superior performance in molecule generation tasks.
update after rebuttal
I confirm that I have read the rebuttal and finalized my evaluation.
给作者的问题
- Did the authors train GPT and Mamba from scratch or pre-trained checkpoints?
- Can Geo2Seq be applied to text-to-molecule generation tasks, e.g., ChEBI-20 dataset? Such results will further highlight the effectiveness of using language models.
论据与证据
The claims made in the submission are supported by clear and convincing evidence.
方法与评估标准
Proposed methods and/or evaluation criteria (e.g., benchmark datasets) make sense for the problem or application at hand.
理论论述
The theoretical claims seem correct for me (at least at a high level).
实验设计与分析
I checked the soundness/validity of the experimental designs or analyses.
补充材料
I checked the supplementary material (especially additional experiments).
与现有文献的关系
The key contribution of this paper is to relate 3D molecule generation and language models. Previous 3D molecule generation relies on 3D graph generation technique, however, such graph generation tasks show poor performance due to the under-explored 3D graph architectures. This paper shows such drawbacks can be alleviated by tokenizing 3D molecules to 1D sequences, which can be directly incorporated with well-developed language models.
遗漏的重要参考文献
As far as I know, there exist some molecule 3D molecule tokenizing methods, e.g., 3D-MolT5: Towards Unified 3D Molecule-Text Modeling with 3D Molecular Tokenization [Pei et al., 2024] and Tokenizing 3D Molecule Structure with Quantized Spherical Coordinates [Gao et al., 2024], which can be discussed in this paper.
其他优缺点
Strengths
- The problem of interest, 3D molecule tokenization, is important for molecular domain and poses a potential for real-world applications, e.g., drug discovery.
- Proposed method seems reasonable; tokenizing 3D spherical coordinates in an SE(3)-invariant manner.
- The improvements in controllable generation are impressive; Geo2Seq highly outperforms previous 3D graph-based molecule generation techniques.
Weaknesses
- Scalability of Geo2Seq. Compared to graph-based models (which accepts continuous values), discretization technique in Geo2Seq may limit the performance when the molecules become large. The results in Table 1 also show that Geo2Seq works well on smaller molecules (QM9) but not quite well on larger molecules (GEOM).
其他意见或建议
Comments
- Representation can be further improved. Essential experimental ablations are deferred to the supplements, e.g., Table 4. I think the method section can be shortened, e.g., discussion about spherical coordinates.
Typos
L46, right column: "subsequent LMs used. and can seamlessly" -> "subsequent LMs used, and can seamlessly"
L194, left column: "can be be proved" -> "can be proved"
Dear Reviewer CPUN,
Thank you for your appreciation of our work and insightful comments! We have made efforts to thoroughly improve our work accordingly and provide responses for each concern here. Please also refer to our added experiments in this Link and our responses to other reviewers.
Some existing molecule 3D molecule tokenizing methods
- Thank you for the advice. We have included the discussions in the paper revision and also briefly discuss here.
3D-MolT5: Towards Unified 3D Molecule-Text Modeling with 3D Molecular Tokenization: 3D-MolT5 focuses on text-based molecule-related downstream tasks, including molecular property prediction, 3D molecule captioning, and text-based molecule generation. It does not consider 3D molecular generation tasks. thus we do not adopt it as a baseline method. 3D-MolT5 is designed to handle 3D-dependent tasks with text instructions using the T5 model. Its tokenization method is based on the Extended 3D Fingerprint (E3FP) algorithm, where the embeddings of the same atom in both 1D and 3D tokens are summed to form the final joint representation.Tokenizing 3D Molecule Structure with Quantized Spherical Coordinates: This is a concurrent work with our submission, submitted to arXiv in Dec 2024, thus we do not adopt it as a baseline method. This work uses SMILES and coordinates to build sequences and a VQ-VAE model to discretize the continuous coordinates. Compared to our method, this work uses VQ-VAE to learn structure tokens, which lacks guarantee of structural completeness.- In summary, our method differs by (1) extending canonical labeling to encode 3D structural isomorphism, (2) enabling reversibility between sequences and 3D isomorphic structures, and (3) establishing theoretical guarantees of structural completeness and geometric invariance. We believe our formulation complements theirs, and we thank the Reviewer for encouraging this discussion.
W1: Scalability of Geo2Seq
- Thank you for the point. This is an interesting question we have been thinking as well. Indeed, discretization could impose limits on very large molecules due to increased sequence length and reduced resolution. However, on GEOM-DRUGS dataset, Geo2Seq still achieves competitive performances. Moreover, theoretically, if we use larger vocabulary size, the discretization would not be a limitation for scalability. Larger molecules only bring longer context length of upto 750, which can be handled given the capability of LLMs.
- To our understanding, the reason for the suboptimal performances on GEOM-DRUGS could be that the size of the GEOM-DRUGS dataset requires larger LMs than what we are using. The QM9 data size is ~100k and we are using ~90M parameter LMs, while GEOM ~7M and we are using ~100M parameter LMs. While this benefits efficiency, GEOM-DRUGS might perform optimally with larger model sizes. Due to the time limit of the rebuttal, we will explore further improving the performance on GEOM-DRUGS dataset with larger LLMs in the future.
Paper Representation
- Thank you for the suggestion! We have shortened the discussion in Sec.3 and moved essential experimental ablations including Table 4 to the main text.
Typos
- Thank you for pointing out! We have revised the paper and corrected the typos.
Q1: train from scratch or pre-trained?
- We train from scratch. We propose a molecular tokenization method different from the NLP tokenization used by GPT and Mamba. Pre-trained checkpoints need to be used together with the pre-training tokenizer, thus not applicable in our setting. In addition, those checkpoints include little molecular 3D structural knowledge, thus not suitable for our molecular tasks.
Q2: text-to-molecule generation tasks?
- Thank you for your insightful comments! We appreciate your suggestion to explore the interesting applicability of our Geo2Seq towards text-molecule tasks. Indeed, Geo2Seq be applied to text-to-molecule generation tasks. The main difference would be extending the tokenization with for text, which can be enabled with the BPE or SentencePiece tokenizer. On the other hand, text-to-molecule generation tasks poses a different setting with various baselines such as MolT5 and LDMol. We believe this opens a promising future direction and we will include the discussion in the paper revision. Within the time limit of the rebuttal, we focus on the field of 3D molecule generation in this work and leave this exploration for future work.
We sincerely thank you for your time! Hope we have addressed your concerns through practical efforts and shown the contributions and significance of our work. We look forward to your reply and further discussions, thanks!
Sincerely,
Authors
Thank you for the response. At this moment, I do not have further questions, and I lean towards acceptance of this paper.
Dear Reviewer CPUN,
Thank you for your acknowledgment and response. We are grateful for your appreciation of our work and glad to know that we have resolved all your questions. We sincerely thank you for your time and efforts!
Sincerely,
Authors
This paper explores the use of language models (LMs) for generating 3D molecules, a task that has previously been challenging due to the complex geometric structure of molecules. The paper proposes a novel tokenization method called Geo2Seq, which converts 3D molecular structures into SE(3)-invariant 1D discrete sequences that can be effectively processed by LMs. Results show that with various state-of-the-art 3D molecule generation methods, including diffusion-based models like EDM and GEOLDM. Geo2Seq achieves comparable or better results in terms of atom stability, molecule stability, and valid percentage, especially in controlled generation tasks.
给作者的问题
Please see the above sections.
论据与证据
-
Generalizability to Continuous Domains: The paper acknowledges the limitations of the discrete tokenization approach in terms of generalization to the continuous domain of real numbers. However, the analysis provided is limited, and more evidence in various datasets is needed to demonstrate the extent of this limitation and potential solutions.
-
Uniqueness of Generated Molecules:
The paper reports a lower uniqueness percentage for Geo2Seq compared to diffusion-based methods on the QM9 dataset. This could be due to several factors, including the tokenization approach and the size of the dataset. Further investigation is needed to understand the underlying reasons and explore ways to improve uniqueness.
- Error Case Analysis:
The provided error case analysis is limited and focuses on specific examples. A more comprehensive analysis of different types of errors (e.g., syntax errors, repetition, hallucinations) and their causes is needed to better understand the robustness and limitations of the approach.
方法与评估标准
- Geo2Seq Tokenization: The use of canonical labeling and invariant spherical representations is a reasonable approach for converting 3D molecular structures into a format suitable for LMs. However, the paper lacks a comprehensive comparison with alternative tokenization methods, such as graph-based representations directly used with graph neural networks. A more thorough comparison would provide a better understanding of the advantages and limitations of Geo2Seq.
- Language Models: The choice of GPT and Mamba as LMs is reasonable, given their strong sequence modeling capabilities. However, the paper does not explore the potential of larger LMs, like llama and qwen.
- Evaluation Metrics: The paper focuses on atom stability, molecule stability, and valid percentage as primary evaluation metrics. While these metrics are important, they do not fully capture the quality and diversity of the generated molecules. Consider incorporating additional metrics, such as novelty, structural diversity, and property prediction accuracy, to provide a more comprehensive evaluation.
理论论述
-
Validity of the 3D Graph Isomorphism Definition: The paper extends the concept of graph isomorphism to 3D graphs, which is not a standard definition. While the paper provides a definition, it is crucial to establish the validity and soundness of this definition in the context of 3D molecular structures. The proof should clearly justify the extension and demonstrate its consistency with existing graph theory concepts.
-
Completeness of the Proof: The proof in the Appendix seems to focus on demonstrating the sufficiency of the Geo2Seq mapping (i.e., if two molecules are 3D isomorphic, their sequences are identical). However, it does not explicitly address the necessity (i.e., if two molecules have the same sequence, they must be 3D isomorphic). This is a crucial aspect of a bijective mapping, and a more comprehensive proof is needed to establish both sufficiency and necessity.
实验设计与分析
- Comparison with Alternative Tokenization Methods: The paper primarily focuses on comparing Geo2Seq with state-of-the-art 3D point cloud based methods. However, it lacks a comprehensive comparison with alternative tokenization methods, such as graph-based representations directly used with graph neural networks. This limits the understanding of the advantages and limitations of Geo2Seq compared to other approaches.
- Additional Metrics: The evaluation focuses on atom stability, molecule stability, and valid percentage. While these metrics are important, they do not fully capture the quality and diversity of the generated molecules. Incorporating additional metrics, such as novelty, structural diversity, and property prediction accuracy, would provide a more comprehensive evaluation of the generated molecules.
- Comparison with Pre-trained Models: The paper compares the performance of Geo2Seq with models that are not pre-trained on chemical datasets. Exploring the impact of pre-training on the performance of Geo2Seq would be valuable for understanding its effectiveness in leveraging knowledge from large chemical databases.
补充材料
Yes. I have checked the Appendix.
与现有文献的关系
Yes. By applying the proposed methods to generate a more accurate 3D coordinates of molecules, researchers could better find appropriate drug candidates.
遗漏的重要参考文献
As far as I know, there is no essential reference not discussed.
其他优缺点
Strengths:
- Originality: The paper presents a novel approach to 3D molecule generation using language models, which is a relatively unexplored area. The use of Geo2Seq for converting 3D molecular structures into 1D sequences is a creative combination of ideas from graph theory and language modeling.
- Potential Impact: The approach has the potential to significantly impact the field of 3D molecule generation, particularly in drug discovery and materials science. The ability to efficiently generate valid and diverse 3D molecules with desired properties could revolutionize these fields and enable the discovery of new drugs and materials.
- Clarity: The paper is generally well-written and provides a clear explanation of the approach and its benefits. The figures and tables are helpful in illustrating the key concepts and results.
Weaknesses:
- Limited Comparison with Alternative Methods: The paper primarily focuses on comparing Geo2Seq with state-of-the-art 3D point cloud based methods. A more comprehensive comparison with alternative tokenization methods and graph-based approaches would provide a better understanding of the advantages and limitations of Geo2Seq.
- Limited Evaluation of Controlled Generation: The evaluation of controlled generation tasks is limited to specific quantum property values. Exploring the generalizability and robustness of controlled generation across different conditions would be valuable.
- Limited Discussion of Limitations: The paper acknowledges the limitations of the discrete tokenization approach but does not provide a detailed analysis of these limitations and potential solutions. A more thorough discussion of the limitations and their impact on the performance would be beneficial.
其他意见或建议
Figure Captions: Some figure captions could be more informative and provide a clearer explanation of the content. For example, Figure 2 would benefit from a more detailed description of the equivariant frame and invariant spherical representations.
Dear Reviewer aNgc,
Thank you for your appreciation of our work and insightful comments! We address each point here and added experiments are in this Link.
Real number generalizability
We have conducted more studies on this point.
- LMs require tokenization which limits number resolution. To mitigate this, we study flexible number-tokens and show even with coarse discretization, our model maintains strong performance. Please see experiments in
[Link] Table 1. - We also analyze different discretization schemes. See experiments in
Appx C Table 5and[Link] Table 2. - We can extend to additional datasets in future work.
Uniqueness of QM9
- For the uniqueness of QM9, we believe it is because conversion from real numbers to discrete tokens limits 3D search space, especially on small datasets like QM9. Evidence is that we achieves 99.77% uniqueness on GEOM-DRUGS. This reflects that richer database or vocabulary enlarge search space of 3D structures and enhance uniqueness.
- Moreover, following EDM, we emphasize validity/stability, thus setting temperature=0.7. This can be adjusted for validity-diversity trade-off. We can easily improve uniqueness with larger temperature (e.g., temp=1.0 enhances uniqueness from 81.7% to 86.5%).
Error Case
- We now include error case studies. Please see
[Link] Table 7. - We also conduct quantitative studies. Among 100 error cases, 61% stem from incorrect distance-angle combinations, 25% geometric inconsistencies, and 14% subsequences repetition. Errors are rarer if model well converges: when trained for 150 and 250 epochs, the model generates ~15% and <2% invalid samples, respectively.
Comparison with graph-based methods
- Previously we focus on comparing with SoTA methods. Here we compare with more graph-based methods, as in
[Link] Table 3. - Meanwhile, we clarify that our baselines also directly use GNN representations, where GNNs are backbones while other techniques enable generation. E-NF is based on E(n)-GNN, G-Schnet uses GNN SchNet, EDM and GEOLDM parametrized by EGNN, and GDM by non-equivariant MPNN.
LMs like llama/qwen
- We provide experiments extending Geo2Seq with LLaMA. Please see
[Link] Table 4andour Response to Reviewer 39Ws.
More Metrics
Thanks for the advice. We already include these metrics in Appx. D.
Table 6, 7, 8, 9, 10, 11 in Appx. Dreport evaluations including novelty, structural diversity metrics, distribution distance metrics, etc. Property prediction accuracy is asTable 2 of the paper. Vast results show we can outperform other methods across various metrics.
Def 3.4 Validity
We clarify its soundness regarding theoretical alignment with graph theory and physical relevance in molecular modeling.
- Our definition builds upon colored graph isomorphism. Lemma 3.2 show attributed CL retains bijectivity and different-attributed molecules cannot be conflated under this formulation. This is consistent with graph matching literature. We extend to 3D graphs requiring valid SE(3) transformation mapping coordinate matrices. It is formalized in Def 3.4 and supported by Lemma 3.3 & Thm 3.5. Our definition corresponds to label- and coordinate-preserving isomorphism.
- To further substantiate, it aligns with physical indistinguishability in chemistry, where two conformers differing only by spatial orientation are considered identical. While our formulation is novel, it mirrors physical practices in geometric deep learning, which treat structures up to SE(3) equivalence.
Proof Completeness
- We clarify that our proofs do include sufficiency & necessity explicitly. Thm 3.5: lines 883-899 prove sufficiency, and lines 900-928 necessity. Cor 3.6: lines 967-987 sufficiency and 988-1021 necessity.
Pretraining
- We clarify that we do explore the impact of pretraining. See
Appx D.5 and Table 13.
Limited Controlled Generation
- We focus on quantum properties because they are available conditions relating 3D information. Here we provide more experiments exploring generalizability across conditions.
- The first experiment studies generalization across multiple conditions (see
[Link] Table 5). The second experiment tests generalization to unseen conditions (see[Link] Table 6). While generalization brings challenges, we can capture certain multi-condition knowledge with robustness.
Limited Limitations
- We now include a detailed limitations section. Key challenges include discretization loss, high-precision 3D geometry generalization, and solutions include tokenizer learning with continuous embeddings or vector quantized codes.
Figure Captions
- We have included more informative captions. For Figure 2, see
[Link] Figurefor updated caption.
Thank you for your time! Hope we have addressed your concerns with practical efforts and shown our work’s significance. We look forward to further discussions.
Sincerely,
Authors
The paper proposes a method called Geo2Seq to generate 3D molecules using language models. The authors convert each molecule into an SE(3)-invariant discrete sequence of tokens—one token per atom, with tokens containing both atom type and spherical-coordinate information. Once converted to a sequence, any language model can be trained to produce new molecular sequences, which are then mapped back to 3D structures.
给作者的问题
Please refer to other sections.
论据与证据
Much of the theoretical proof (canonical labeling correctness, SE(3)-invariant spherical coordinates) is already known from prior literature on graph isomorphism and spherical transformations. The paper does not fully demonstrate how they guarantee stronger empirical generation beyond giving a valid, lossless tokenization. The paper would benefit from more ablation: e.g., do we see the same advantage if we do a simpler coordinate encoding?
方法与评估标准
The tokenization for unconditional and property-conditional generation does make sense: the method inserts property tokens into the same sequence, so an LM can learn to produce geometries with certain property values.
理论论述
No obvious errors stand out.
实验设计与分析
I believe it remains unclear whether the LM is capturing fundamental 3D chemistry knowledge or if it is mostly reproducing token patterns it has memorized. More analyses, like measuring internal geometry consistency or testing on molecules that deviate strongly from training data, would strengthen the claims.
补充材料
Yes, I have checked all additional experiments.
与现有文献的关系
The paper was built based on prior works on 3D generation and proposed a geometry-aware tokenization to do generation tasks.
遗漏的重要参考文献
NA
其他优缺点
Strengths:
- The proposed method is model-agnostic and requires no special modifications to model architectures.
- The method preserves SE(3)-invariance.
- The paper is well-written and easy to follow.
- The paper conducts comprehensive experiments.
其他意见或建议
NA
Dear Reviewer hgAt,
Thank you for your appreciation of our work and insightful comments! We have made efforts to thoroughly improve our work accordingly and provide responses for each concern here. Please also refer to our added experiments in this Link and our responses to other reviewers.
Claims And Evidence
- Thanks for your valuable comments. Beyond standing as a valid lossless tokenization, Geo2Seq has SE(3)-invariance and adopt spherical design, which can significantly benefit in achieving stronger empirical generation. We have conducted comprehensive ablation studies here. We explain in detail below.
- In
Ablation on 3D representation of Appx. C, we explore using simpler or different coordinate encodings to represent 3D molecular structures. We compare the spherical coordinates in Geo2Seq with directly using the 3D Cartesian coordinates from xyz data files. We also study whether normalizing the xyz coordinates is effective by subtracting the xyz coordinates with the mass-center coordinates of each molecule. Additionally, we compare with using the SE(3)-invariant Cartesian coordinates that are projected to the equivariant frame proposed in Section 3.2. We also explore adopting to manage distances in a more local scheme, which reduces the scale of the distances. - Results in
Table 4 of Appx. Cdemonstrate that LLMs achieve the best performance on spherical coordinates, showing the superiority of invariant spherical coordinates over invariant Cartesian coordinates. We believe this is due to that the numerical values of distances and angles of spherical coordinates lie in a smaller region than coordinates, which reduces outliers and makes it easier for LLMs to capture their correlation. From these empirical results, we can analyze that the representation of azimuth and polar angles has brought sufficient advantage for LM learning over Cartesian coordinates, thus spherical representations with both distance schemes are showing promising performances. - In Sec 3.2, we discuss the advantage of spherical coordinates. Compared to Cartesian coordinates, spherical coordinate values are bounded in a smaller region, namely, a range of /. Given the same decimal place constraints, spherical coordinates require a smaller vocabulary size, and given the same vocabulary size, spherical coordinates present less information loss. This makes spherical coordinates advantageous in discretized representations and thus easier to be modeled by LMs.
Experimental Designs Or Analyses
Thanks for your comment. This is an important point and we have conducted various experiments and analyses including measuring geometry consistency and novelty to verify the capabilities of the method.
Table 6 and 7reports further random/controllable generation results including novelty metrics. Results show that our method achieves reasonably high novelty scores, which demonstrates that our method is not simply memorizing or reproducing token patterns.- Also, experiments indicate that our generated molecules satisfy the internal geometry consistencies well. In addition to our main validity/stability results, we provided more evaluation results on various chemical metrics in
Appx. D, Table 6, 7, 8, 9, 10, and 11, including diversity metrics, distribution distance metrics, bond lengths and angles, reasonable internal energy, steric hindrance, etc. Vast results show that we can outperform existing methods across metrics regarding various chemical constraints and geometry consistencies. - In addition, in
Appendix F.2, we provided UMAP visualizations of learned (atom type, distance, and angle) token embeddings, which indicates that the model has successfully learned structure information in 3D space. Figure 8 shows similar angle tokens (e.g., 1.41° and 1.42°) are placed next to each other, overall structure of all angles is a loop, and π-out-of-phase angles (3.14°, -3.14°, and 0°) are placed near each other. For atom type tokens, the model appears to capture the structure of the periodic table. - We believe the current results are sufficient to prove the correctness and capabilities of our design. Geo2Seq with LMs can model 3D molecular structure distribution and capture the underlying chemical rules.
We sincerely thank you for your time! Hope we have addressed your concerns through practical efforts and shown the contributions and significance of our work. We look forward to your reply and further discussions, thanks!
Sincerely,
Authors
The paper proposes Geo2Seq, which transforms molecular geometries into SE(3)-invariant discrete sequences for molecule generation. Existing language model-based molecule generation works do not consider the 3D molecular geometries in the tokenization process. The proposed paper address this limitations and show that the proposed Geo2Seq improves the performance in molecule gnerations.
给作者的问题
Please refer the above sections.
论据与证据
The paper claims that the tokenization with preserving 3d molecular graph information improves the quality of molecule generation. The claim is well supported by theoretical analysis and experimental results.
方法与评估标准
The methods and evaluation criteria make sense for the problem from my perspective.
理论论述
I've checked the correctness of the proofs in the paper and it seems to be correct from my side.
实验设计与分析
The experimental designs and analyses are valid.
补充材料
I've checked the appendix.
与现有文献的关系
The paper seems novel to me. Even though a recent paper proposes tokenization techniques considering 3D molecular geometric information, it limits its scope to text generation. Different from it, Geo2Seq is designed to generate molecules with theoretical analysis. So, I think it is novel.
遗漏的重要参考文献
I think that the essential references are discussed in this paper.
其他优缺点
Strengths
- The paper is well written and easy to follow.
- The paper well demonstrates the effectiveness of the proposed method Geo2Seq with theoretical analysis and experimental results.
- The visualization map in Figure 6 is interesting to me.
Weaknesses
- I cannot find crucial weaknesses in this paper.
其他意见或建议
- I'm wondering the performance of the proposed Geo2Seq with larger or recent language models such as LLaMa.
Dear Reviewer 39Ws,
Thank you for your appreciation of our work and insightful comments! We have made efforts to thoroughly improve our work accordingly and provide responses for each concern here. Please also refer to our added experiments in this Link and our responses to other reviewers.
Geo2Seq with larger or recent language models such as LLaMa
- Thanks for your valuable comments. We would like to provide more experimental results extending Geo2Seq to LLaMA. Due to the limited time of the rebuttal, here we provide the results of Geo2Seq with LLaMA for the controllable generation task, which have a smaller training data size of 50K. We will update the results of Geo2Seq with larger LLaMA/Qwen models as well as for other tasks and hyperparameter settings in the paper revision later. In scale with our data size, we use the LLaMA implementation from HuggingFace with 768 hidden size, 8 hidden layers, and 8 attention heads. The setting is the same as GPT used in the paper. We train the model for 200 epochs.
| Property (Units) | α (Bohr³) | Δε (meV) | ε_HOMO (meV) | ε_LUMO (meV) | μ (D) | C_v (cal/mol·K) |
|---|---|---|---|---|---|---|
| Data | 0.10 | 64 | 39 | 36 | 0.043 | 0.040 |
| Random | 9.01 | 1470 | 645 | 1457 | 1.616 | 6.857 |
| GEOLDM | 2.37 | 587 | 340 | 522 | 1.108 | 1.025 |
| Geo2Seq with Mamba | 0.46 | 98 | 57 | 71 | 0.164 | 0.275 |
| Geo2Seq with GPT | 0.53 | 102 | 48 | 53 | 0.097 | 0.325 |
| Geo2Seq with LLaMA | 0.71 | 102 | 51 | 98 | 0.324 | 0.357 |
- As shown above, Geo2Seq with LLaMA achieves significantly better results over the baselines and performs similarly as Geo2Seq with GPT for most properties, without careful tuning of hyperparameters. This is expected given the functional architectures of LLaMA and GPT are very similar. We have included the results and discussions in the paper revision.
We sincerely thank you for your time! Hope we have addressed your concerns through practical efforts and shown the contributions and significance of our work. We look forward to your reply and further discussions, thanks!
Sincerely,
Authors
The paper introduces a new method, Geo2Seq, for molecule generation and appropriately provides theoretical analysis and comprehensive experimental evaluation. It is presented clearly and is intuitive. The addressed problem is very important in structural biology and drug design. The method is original and has the potential to be impactful. All reviewers recommend acceptance and the AC agrees. Thanks to the authors for this good submission. Please include the feedback in the rebuttal to the main paper.