China SKA Regional Centre prototype
Summary
Resources per team: cluster architecture
Resource Access: ssh credentials are sent by email to users/teams. Access must be via static IP address or VPN
Data cube access: Data cube stored in distribution shared storage, it is visible to all teams
Resource management: Slurm for resource and job management to avoid mutual interference
Software management: Any software installation needs to be done by a superuser (administrator). It can be installed in advanced on the teams' directory.
Documentation: Manuals with instructions available for download (inside SLURM). CSRC-P User Manual.doc
Support: Contacts listed in the Support section
Resource location: China
Technical specifications
Three architectures clusters are deployed : Intel x86 (multi cores), Nvidia GPU, and ARM.
data storage: After upgrade in 2020 September, the storage is expanded to a total 4.5 PB. About >2 PB is spare.
network:
High-speed internet of a maximum of 5 Gbps linked to the prototype. This link can be used for massive data transfer.
A dedicated network with bandwidth of 200 Mbps, specially reserved for the SKA activities (from CSTNET). It can be used for world-wide users to remotely login the clusters. SDC users will use this link. It can also be used for downloading data at GB level.
User access
Open the accounts
Accounts can be all set up at once in advance of challenge release (email to Baoqiang Lao - lbq@shao.ac.cn).
Users must provide a static IP address segment so that they can access the cluster in remote.
Possibility of configuring the lifetime of each account. If any user wants to keep the account longer, they can ask us.
Number of accounts available
Please, update the number of accounts that will be made available.
When SDC2 starts, each user is assigned to an account if the number of users is not large, or 1-2 accounts are assigned to each team if the number of users is too large.
Authentication
Credentials are sent by email to users/teams
Logging in
Please, use your credentials (username and password) to login SHAO's cluster:
ssh username@202.127.3.157 (X86 machine)
ssh username@202.127.3.156 (ARM machine)
It is highly recommended that you reset your password at first.
You can access https://202.127.3.156 to do this using random password (this method is the only way to change your password by now).
Note: Prior to reset the password and login SHAO's cluster, please provide a static IP address, so that you can access the cluster in remote.
How to run a workflow
Supercomputing systems can use Slurm for resource and job management to avoid mutual interference and improve operational efficiency. All jobs that need to be run, whether for program debugging or business calculations, must be submitted through interactive parallel srun, batch sbatch, or distributed salloc commands, and related commands can be used to query the job status after submission. Please do not directly run jobs (except compiling) on the login node, so as not to affect the normal use of other users.
Resource management
Statistics
At the end of the SDC, statistics of resource usage for each team/user can be printed for reference.
Limitation of use
In order to avoid unlimited using and a fair challenge, some constraints will be defined.
Support
For support for our services, please contact us at:
Contacts
Tao An - antao@shao.ac.cn (main contact)
Baoqiang Lao - lbq@shao.ac.cn (software environment)
Shaoguang Guo - sgguo@shao.ac.cn (networking, data transferring)
Ms. Xiaocong Wu (wuxc@shao.ac.cn) (software deployment)
Credits and acknowledgements
If teams made use of the China proto-SRC resource in their publication, they should be asked to add some sentences in the Acknowledgements.
This work used resources of China SKA Regional Centre prototype (An, Wu, Hong, Nat Astron, 2019, 3, 1030) funded by the National Key R&D Programme of China (2018YFA0404603) and Chinese Academy of Sciences (114231KYSB20170003).