arXiv cs.AI
· Papers
DMind Benchmark: Toward a Holistic Assessment of LLM Capabilities across the Web3 Domain
arXiv:2504.16116v4 Announce Type: replace-cross Abstract: The Web3 ecosystem, underpinned by cryptographic primitives and decentralized consensus, represents a high-stakes environment where software vulnerabilities and incentive misalignments translate directly into financial loss. As Large Language Models (LLMs) are i