Ideas for testing data science workflows on self hosted Linux based HPC cluster.
📰 Reddit r/datascience
Hi all, Mid–Senior Data Scientist here. I currently work in a team that develops and maintains several fairly large-scale data science projects on a self-hosted, multi-user Linux HPC cluster. Both compute and storage are hosted on-premises. Storage is separated into development/test and production environments, with restricted write access in production. Our technology stack includes: * Debian Linux * Python * Perl * Fortra
DeepCamp AI