Difference between revisions of "Swestore-dCache"

From SNIC Documentation
Jump to: navigation, search
m (fix broken links to collaborators (WLCG))
(Swestore documentation moved)
(Tag: New redirect)
 
(182 intermediate revisions by 10 users not shown)
Line 1: Line 1:
[[Category:Storage]]
+
#REDIRECT[[Swestore Documentation Moved]]
[[Category:SweStore]]
 
SNIC is building a storage infrastructure to complement the computational resources.
 
 
 
Many forms of automated measurements can produce large amounts of data. In scientific areas such as high energy physics (the Large Hadron Collider at CERN), climate modeling, bioinformatics, bioimaging etc., the demands for storage are increasing dramatically. To serve these and other user communities, SNIC has appointed a working group to design a storage strategy, taking into account the needs on many levels and creating a unified storage infrastructure, which is now being implemented.
 
 
 
Swestore is in collaboration with [http://www.ecds.se/ ECDS], [http://snd.gu.se/ SND], Bioimage Sweden, [http://www.bils.se/ BILS], [http://www.uppnex.uu.se/ UPPNEX],[http://wlcg.web.cern.ch/ WLCG], [http://www.nrm.se/ NaturHistoriska RiksMuseet].
 
 
 
= National storage =
 
The aim of the nationally accessible storage is to build a robust, flexible and expandable system that can
 
be used in most cases where access to large scale storage is needed. To the user it should appear as a single large system,
 
while it is desirable that some parts of the system are distributed across all SNIC centra to benefit from the advantages
 
of, among other things, locality and cache effects. The system is intended as a versatile long-term storage system.
 
 
 
==Supported access protocol==
 
; Today SweStore support this protocols
 
: srm://, gsiftp://, http:// (ro), https:// (ro), webdav (rw).
 
; Coming to support this protocols
 
: NFS4.1, iRODS
 
 
 
== Getting access ==
 
; Apply for storage
 
: Please follow instructions [[Apply for storage on SweStore|here]]
 
; Get a client certificate.
 
: Follow the instructions [[Grid_certificates#Requesting_a_certificate|here]] to get your client certificate. For Terena certificates, please make sure you also [[Requesting_a_grid_certificate_using_the_Terena_eScience_Portal#Exporting Terena certificate for use with Grid tools|export the certificate for use with grid tools]]. For Nordugrid certificates, please make sure to also [[Requesting_a_grid_certificate_from_the_Nordugrid_CA#Installing_the_certificate_in_your_browser|install your client certificate in your browser]].
 
; Request membership in the SweGrid VO.
 
: Follow the instructions [[Grid_certificates#Requesting_membership_in_the_SweGrid_VO|here]] to get added to the SweGrid virtual organisation.
 
 
 
== Download and upload data ==
 
; Browse and download data
 
: SweStore is accessible from your web browser, here https://webdav.swegrid.se/. To browse private data you must first install your certificate in your browser (see above). Your data is available at <code><nowiki>https://webdav.swegrid.se/snic/YOUR_PROJECT_NAME</nowiki></code>.
 
; Upload and delete data
 
: Use the ARC client. Please see the instructions for [[Accessing SweStore national storage with the ARC client]].
 
: Use cURL. Please see the instructions for [[Accessing SweStore national storage with cURL]].
 
: Use lftp. Please see the instructions for [[Accessing SweStore national storage with lftp]].
 
: Use globus-url-copy. Please see the instructions for [[Accessing SweStore national storage with globus-url-copy]].
 
 
 
== Examples of storage projects ==
 
Below are some examples of project that are using SweStore today.
 
 
 
{|border="1" style="text-align:left; border-collapse: collapse; border-width: 1px; border-style: solid; border-color: #000" class="wikitable sortable"  valign=top
 
!Allocation name
 
!Size in TB
 
!class="unsortable"|Project full name
 
|-
 
|alice
 
|400
 
|
 
|-
 
|uppnex
 
|140
 
|[https://www.uppnex.uu.se UPPmax NExt Generation Sequencing Cluster & Storage]
 
|-
 
|brain_protein_atlas
 
|10
 
|Mouse brain protein atlas project
 
|-
 
| scims2lab
 
|20
 
| Identification of novel gene models by matching mass spectrometry data against 6-frame translations of the human genome
 
|-
 
|subatom
 
|
 
|Low-energy nuclear theory and experiment
 
|-
 
|genomics-gu
 
|10
 
|Genomics Core Facility, Sahlgrenska academy at University of Gothenburg.
 
|-
 
|Chemo
 
|5TB
 
|Genetic interaction networks in human deseas
 
|-
 
|cesm1_holocene
 
|30
 
|Arctic sea ice in warm climates
 
|}
 
 
 
== More information ==
 
* [[SweStore introduction]]
 
* [http://status.swestore.se/munin/monitor/monitor/ Per Project Monitoring of Swestore usage]
 
* [[Accessing SweStore national storage with the ARC client]]
 
<!-- * [[Mounting SweStore national storage via WebDAV|Mounting SweStore national storage via WebDAV (Not recomendated at the moment)]] -->
 
If you have any issues using SweStore please do not hesitate to contact [mailto:swestore-support@snic.vr.se swestore-support].
 
 
 
== Tools and scripts ==
 
 
 
There exists a number of tools and utilities developed externally that can be useful. Here are some links:
 
 
 
* [https://github.com/samuell/arc_tools ARC_Tools] - Convenience scripts for the arc client (Only a recursive rmdir so far).
 
* [http://sourceforge.net/projects/arc-gui-clients ARC Graphical Clients] - Contains the ARC Storage Explorer (SweStore supported development).
 
* Transfer script, [http://snicdocs.nsc.liu.se/wiki/SweStore/swstrans_arc swetrans_arc], provided by Adam Peplinski / Philipp Schlatter
 
* [http://www.nordugrid.org/documents/SWIG-wrapped-ARC-Python-API.pdf Documentation of the ARC Python API (PDF)]
 
 
 
= Center storage =
 
Centre storage, as defined by the SNIC storage group, is a storage solution that lives independently of the computational resources and can be accessed from all such resources at a centre. Key features include the ability to access the same filesystem the same way on all computational resources at a centre, and a unified structure and nomenclature for all centra. Unlike cluster storage which is tightly associated with a single cluster, and thus has a limited life-time, centre storage does not require the users to migrate their own data when clusters are decommissioned, not even when the storage hardware itself is being replaced.
 
 
 
== Unified environment ==
 
To make the usage more transparent for SNIC users, a set of environment variables are available on all SNIC resources:
 
 
 
* <code>SNIC_BACKUP</code> – the user's primary directory at the centre<br>(the part of the centre storage that is backed up)
 
* <code>SNIC_NOBACKUP</code> – recommended directory for project storage without backup<br>(also on the centre storage)
 
* <code>SNIC_TMP</code> – recommended directory for best performance during a job<br>(local disk on nodes if applicable)
 

Latest revision as of 10:01, 8 February 2023