Introduction: The 6G world requires connected intelligence, but there is a crucial paradox between the standards of Large Language Model (LLM) and edge constraints. The premium devices have up to 6-12GB of DRAM, whereas the typical 175B models need 350GB of storage, which is 30 times that of the premium version. Literature Survey: It has been proposed that the bandwidth can be reduced by 90 % with Semantic Communication (Scom) and Edge Semantic Cognitive Intelligence (ESCI). Besides, neuromorphic-based Spiking Neural Networks (SNNs)-model quantization (INT4/INT8) are also known to be necessary to achieve order-of-magnitude energy efficiency (J/token) on resource-constrained hardware. Methodology: This paper proposes a Hybrid Cognitive Model utilizing a three-tier Cloud-Edge-Device hierarchy. The model integrates event-driven neuromorphic principles with self-optimizing resource management, utilizing paged KV-cache and resource-aware agents for dynamic task offloading. Results: Quantitative evidence is used to show that the hybrid strategy helps to address the 30x resource gap by attaining a 10-100x energy-per-token efficiency due to event-driven neuromorphic sparsity. Statistical analysis makes it evident that semantic filtering substantially reduces communication overhead and maintains reasoning faithfulness by 90 %, and, effectively, it keeps the thermal conditions of devices stable in the case of prolonged 6G edge communications. This model can be used to make sustainable and multi-step thinking on the edge. The hybrid solution achieves the 6G vision of pervasive intelligence by bridging the hardware-software gap via cross-layer co-design.
Yang H, Lam KY, Xiao L, Xiong Z, Hu H, Niyato D, et al. Lead federated neuromorphic learning for wireless edge artificial intelligence. Nature Communications. 2022;13(1).
2.
Yang B, Cao X, Xiong K, Yuen C, Guan YL, Leng S, et al. Edge Intelligence for Autonomous Driving in 6G Wireless System: Design Challenges and Solutions. IEEE Wireless Communications. 2021;28(2):40–7.
3.
Wei P, Guo K, Li Y, Wang J, Feng W, Jin S, et al. Reinforcement Learning-Empowered Mobile Edge Computing for 6G Edge Intelligence. IEEE Access. 2022;10:65156–92.
4.
Letaief KB, Shi Y, Lu J, Lu J. Edge Artificial Intelligence for 6G: Vision, Enabling Technologies, and Applications. IEEE Journal on Selected Areas in Communications. 2022;40(1):5–36.
5.
Shang X, Huang Y, Liu Z, Yang Y. NVM-Enhanced Machine Learning Inference in 6G Edge Computing. IEEE Transactions on Network Science and Engineering. 2024;11(6):5615–26.
6.
Chaccour C, Saad W. Edge Intelligence in 6G Systems. Computer Communications and Networks. Springer International Publishing; 2021. p. 233–49.
7.
Gupta R, Reebadiya D, Tanwar S. 6G-enabled Edge Intelligence for Ultra -Reliable Low Latency Applications: Vision and Mission. Computer Standards & Interfaces. 2021;77:103521.
8.
Chang Z, Liu S, Xiong X, Cai Z, Tu G. A Survey of Recent Advances in Edge-Computing-Powered Artificial Intelligence of Things. IEEE Internet of Things Journal. 2021;8(18):13849–75.
9.
Bal M, Sengupta A. SpikingBERT: Distilling BERT to Train Spiking Language Models Using Implicit Differentiation. Proceedings of the AAAI Conference on Artificial Intelligence. 2024;38(10):10998–1006.
10.
Bal M, Sengupta A. SpikingBERT: Distilling BERT to Train Spiking Language Models Using Implicit Differentiation. Proceedings of the AAAI Conference on Artificial Intelligence. 2024;38(10):10998–1006.
11.
Gautam A, Patton R, Potok T, Kannan R, Aimone J, Severa W. AI-Powered Knowledge Graphs for Neuromorphic and Energy-Efficient Computing. Proceedings of the Great Lakes Symposium on VLSI 2025. ACM; 2025. p. 996–1001.
12.
Qin H, Hu C, Magno M. Event-Priori-Based Vision-Language Model for Efficient Visual Understanding. Communications in Computer and Information Science. Springer Nature Singapore; 2025. p. 16–30.
13.
Ji S, Li J, Jin H, Dong H, Ge Z, Yang S, et al. Resource Aware Multi-User Task Offloading In Mobile Edge Computing. 2024 IEEE International Conference on Web Services (ICWS). IEEE; 2024. p. 665–75.
14.
Tefera G, She K, Shelke M, Ahmed A. Decentralized adaptive resource-aware computation offloading & caching for multi-access edge computing networks. Sustainable Computing: Informatics and Systems. 2021;30:100555.
15.
Li S, Ma Y, Zhang Y, Xie Y. Towards Enhanced Energy Aware Resource Optimization for Edge Devices Through Multi-cluster Communication Systems. Journal of Grid Computing. 2024;22(2).
16.
Darchini-Tabrizi M, Hemati E, Entezari-Maleki R. RADTO: A Resource-Aware and Dynamic Task Offloading Strategy for Mobile Edge Computing. 2025 29th International Computer Conference, Computer Society of Iran (CSICC). IEEE; 2025. p. 1–5.
17.
Ali M, Arshad M, Uddin I, Ali G, Asim M, ELAffendi M. A Resource Aware Memory Requirement Calculation Model for Memory Constrained Context-Aware Systems. IEEE Access. 2024;12:19320–9.
18.
Xu Y, Xu J. Resource-aware and computation offloading based on space–air–ground–sea integrated network. The Journal of Supercomputing. 2025;81(5).
19.
Shan W, Sheng S, Min L, Bo G, Yuwei W, Fuhong L. Resource-aware probability-based collaborative odor source localization using multiple UAVs. China Communications. 2025;22(12):269–80.
20.
Qin L, Lu H, Chen Y, Gu Z, Zhao D, Wu F. Energy-Efficient Blockchain-Enabled User-Centric Mobile Edge Computing. IEEE Transactions on Cognitive Communications and Networking. 2024;10(4):1452–66.
The statements, opinions and data contained in the journal are solely those of the individual authors and contributors and not of the publisher and the editor(s). We stay neutral with regard to jurisdictional claims in published maps and institutional affiliations.