<?xml version="1.0"?>
<!DOCTYPE ArticleSet PUBLIC "-//NLM//DTD PubMed 2.0//EN" "http://www.ncbi.nlm.nih.gov/entrez/query/static/PubMed.dtd">
<ArticleSet>
  <Article>
    <Journal>
      <PublisherName>Sichuan Knowledgeable Intelligent Sciences</PublisherName>
      <JournalTitle>International Scientific Technical  and Economic Research </JournalTitle>
      <Issn>2959-1309</Issn>
      <Volume>4</Volume>
      <Issue>2</Issue>
      <PubDate PubStatus="epublish">
        <Year>2026</Year>
        <Month>04</Month>
        <Day>03</Day>
      </PubDate>
    </Journal>
    <ArticleTitle>Research on Multi-Agent Collaborative Decision-Making Algorithm for Supply Chain Management</ArticleTitle>
    <FirstPage>21</FirstPage>
    <LastPage>50</LastPage>
    <ELocationID EIdType="doi">10.71451/ISTAER2614</ELocationID>
    <Language>eng</Language>
    <AuthorList>
      <Author>
        <FirstName>Changgeng</FirstName>
        <LastName>Li</LastName>
        <Affiliation>International Operations, Shinhan University, Gyeonggi-do, Republic of Korea</Affiliation>
        <Identifier Source="ORCID">0009-0006-1115-105X</Identifier>
      </Author>
      <Author>
        <FirstName>Zixi</FirstName>
        <LastName>Liu</LastName>
        <Affiliation>International Operations, Shinhan University, Gyeonggi-do, Republic of Korea</Affiliation>
        <Identifier Source="ORCID">0009-0008-0026-2680</Identifier>
      </Author>
    </AuthorList>
    <History>
      <PubDate PubStatus="received">
        <Year>2026</Year>
        <Month>04</Month>
        <Day>03</Day>
      </PubDate>
    </History>
    <Abstract>
Addressing the key challenges of fuzzy credit allocation, low exploration efficiency, and insufficient robustness in multi-node collaborative decision-making in supply chain management, this paper proposes a hybrid local-global credit allocation multi-agent collaborative decision-making algorithm (HGA-MADDPG). This algorithm introduces a hierarchical graph attention mechanism to dynamically represent the state of the supply chain network topology. It quantifies the contribution of individual actions to sub-chain objectives and system-level indicators through local and global credit networks, respectively, and designs an adaptive fusion weight based on marginal returns to dynamically balance local and global credit. Furthermore, an adversarial disturbance and resilient training architecture is constructed, including modeling three types of disturbances: demand mutation, node failure, and transportation delay, as well as adversarial agent injection, a dynamic environment replay buffer, and a two-stage training strategy. In a baseline scenario of a four-level supply chain and a dynamic environment driven by real data based on SCDL and WSN, compared with eight baseline algorithms, experimental results show that HGA-MADDPG achieves a total cost reduction rate of 26.2%, a service level improvement rate of 42.8%, and a stockout rate controlled at 3.2%. In the extreme scenario of triple perturbation, the cost deviation rate (29.6%) and recovery time (58 hours) are significantly better than existing methods. It still maintains a cost reduction rate of 21.5% in a 120-node ultra-large-scale supply chain. Ablation experiments and scalability analysis further verify the effectiveness of each core module.
</Abstract>
  </Article>
</ArticleSet>
