Towards a federated infrastructure for the global data pipeline