We sincerely apologize for the typo in our initial rebuttal, which may have caused a misunderstanding of our method. We appreciate your careful reading and now provide clarifications and detailed responses below:

"What does represent?"

In our paper, denotes the default data distribution learned by the DM. refers to the distribution of local model parameters, and represents the conditional distribution of local model parameters given the client data .

Since the latter two distributions are not related to the diffusion model parameters , it is more appropriate to express them as and . We thank you for pointing out this inaccuracy, and we will revise the notation accordingly in the next version to improve precision and clarity.

"... is the second term still constant?"

Yes. As in our work and many other FL settings, once the local models are uploaded to the server, their parameters remain fixed during aggregation. Since the conditional distribution of the synthetic dataset relies on the fixed local model parameters , the 2nd term is also constant in our analysis.

"... the related work FedDEO is not properly cited in the context of the borrowed content, which significantly degrades the quality of this paper."

We sincerely apologize for not explicitly citing FedDEO at the point of theoretical borrowing. During manuscript preparation, we indeed referred to some of FedDEO’s theoretical formulations to enhance the logical rigor of our paper. We will make the citation explicit and properly acknowledge their contribution in the revised version.

It is important to emphasize that despite some theoretical similarities, our method differs significantly from FedDEO [1] in terms of practical design: FedDEO requires training based on the DMs on the clients, which obviously introduces substantial computation and communication costs. In contrast, our method significantly reducing the client burden. What's more, our method employs local models rather than additional prompts for guiding generation, eliminating the need for compositional diffusion, imposing a lower server computation cost. Below we provide a detailed comparison of model performance and computation costs between FedDEO, OSCAR[2] and our method, which will be included in the final version.

Server computation cost :

	FedDISC	FGL	FedDEO	OSCAR	FedLMG
flops (T)	135.71	102.83	101.78	67.85	38.87

Client accuracy comparison :

	client0	client1	client2	client3	client4	client5	average
FedLMG_FT	48.99	51.66	55.59	52.80	62.41	58.86	55.05
FedLMG_SD	47.60	55.20	61.54	61.83	67.07	59.90	58.86
FedLMG_MD	44.70	53.08	58.67	60.13	64.06	58.06	56.45
FedDEO	51.08	52.53	61.22	62.18	67.31	56.68	58.50
OSCAR	50.89	53.51	60.05	61.98	68.76	56.52	58.61

We once again thank you for your thoughtful review and valuable feedback. Your comments helped us clarify critical aspects of our method and recognize areas where our explanations and citations can be improved. We will revise the corresponding parts accordingly, enrich the paper with further comparisons, and refine our writing to enhance the overall completeness and rigor. We sincerely hope that these improvements will better convey the contributions and practicality of our work.

[1] FedDEO: Description-Enhanced One-Shot Federated Learning with DMs, MM 2024.
[2] One-Shot Federated Learning with Classifier-Free Diffusion Models, ICME 2025.