Disaggregated Computing for AI: Composable Infrastructure Architecture
CXL memory pooling achieves 3.8x speedup compared to 200G RDMA and 6.5x speedup compared to 100G RDMA when sharing memory across GPU servers running large language model inference. The