India, Oct. 29 -- Cognitive warmup. Is this another DeepSeek moment for AI to contend with? Chinese tech giants Alibaba says their Alibaba Cloud platform has successfully tested a new compute pooling system called Aegaeon, which has reduced the number of Nvidia H20 GPUs required to serve dozens of models of up to 72-billion parameters, from 1,192 to 213 GPUs. This is according to a research paper presented this week at the 31st Symposium on Operating Systems Principles (SOSP) in Seoul, South Korea."Aegaeon is the first work to reveal the excessive costs associated with serving concurrent LLM workloads on the market," researchers from Peking University and Alibaba Cloud wrote in the paper. This may well be another example of working smarte...