The Research File System (RFS) at Indiana University
Important: Although Indiana University's Research File System (RFS) is secure enough for storing research-related electronic protected health information (ePHI) regulated by the Health Insurance Portability and Accountability Act of 1996 (HIPAA), the system does not encrypt stored data. Consequently, you must encrypt ePHI research data before storing it on RFS. Even if your files are stored on an encrypted hard drive (e.g., using PGP Whole Disk Encryption), you still must encrypt your files individually (using an application such as AESCrypt) before transferring them to RFS. If you have questions concerning research involving HIPAA-regulated data, contact the UITS Advanced Biomedical IT Core (ABITC).
For other important considerations regarding work with ePHI research data, see the Working with electronic protected health information section (below).
On this page:
- General information
- System information
- Accessing RFS
- Requesting an account
- Best uses for RFS
- Working with data containing PHI
The Indiana University Research File System (RFS) is a centralized storage area based on OpenAFS. The IU RFS is compatible with all major operating systems, and accessible using various methods from on and off campus. RFS data are regularly backed up, and reside in physically secure environments on the IU Bloomington and IUPUI campuses. IU researchers can get 100 GB of disk storage at no charge. In addition to individual user directories, RFS offers project areas well suited for use by collaborative research teams. All users in a project area group have access to that area, and therefore can exchange files and collaborate. Group account owners can tailor access rights for individual users within their projects.
Access is available to IU graduate students, faculty, and staff. Undergraduates and non-IU collaborators must have IU faculty sponsors. For details, see the "Research system accounts (all campuses)" section of What computing accounts are available at IU, and for whom?
Note: To keep pace with the growing needs of Indiana University researchers, Research File System (RFS) storage will be migrating in the near future from OpenAFS to a new technology from DataDirect Networks, Inc. (DDN), based on the General Parallel File System (GPFS). For more, see Information about upcoming changes to the Research File System at IU.
Note: The RFS is offline for regularly scheduled maintenance every Sunday 7am-10am.
|Machine type||Research file system|
|Operating system||Red Hat Enterprise Linux 5|
|Nodes||4 file servers
4 gateway nodes
|Network file system protocols||OpenAFS, CIFS (Samba), SFTP/SCP, HTTPS|
|Total disk space||50 TB|
|Quotas||100 GB (default) per user, 100 GB (default) per project; increases as needed|
|Backup and purge policies||The system is backed up nightly. Data recoveries are possible within 30 days of deletion. The system is never purged.|
|Aggregate I/O||4 Gbs (currently limited by network connection)|
You can access RFS from your personal computer using several methods; choose the method that best fits your work habits, operating system, and location:
Samba: The Samba interface is popular
with Windows and Mac OS X users, because it lets you map
(or mount) RFS on your desktop (like any other disk drive
or a portable storage device). Via Samba, your RFS area appears as a
desktop window filled with your folders and files. See At IU, how do I use Samba to mount or map to the RFS on my workstation?
Note: When connected via Samba to the Research File System (RFS), transferring large files (2.5 GB or larger) to RFS or between RFS directories may cause your connection to drop; this also may cause problems on the RFS server. This is an issue with Samba connections only; if you need to move or copy files that are 2.5 GB or larger, connect to RFS using OpenAFS, SFTP, or RFSWeb. If you need help or have questions, email the Research Storage team.
SFTP: If you routinely copy files that are 1 GB
or larger, use SFTP, a fast, secure tool specialized for
file transfer. SFTP is commonly used from the command line, but
graphical SFTP clients are also available. See At IU, can I use FTP or SFTP to access my RFS space?
RFSWeb: RFSWeb is a secure website that lets you navigate to your RFS storage area using your
web browser. It's accessible from on and off campus, and
compatible with most operating systems. See At IU, what is RFSWeb and how do I use it?
OpenAFS client: The OpenAFS client is a somewhat
advanced tool that works best under Linux, but Windows and
Mac OS X versions are also available. The OpenAFS client displays your
RFS storage area as part of your computer's local directory structure,
and is well suited for high-volume, high-intensity work. Installing
the client on your computer can be challenging, and it's helpful to
have experience with the finer details of your operating system. See
At IU, how do I install and configure OpenAFS on my workstation for use with the RFS?
If you have an account on Quarry, the OpenAFS client is already installed. Your RFS storage area will appear as part of the cluster's directory structure.
Requesting an account
To request an account on the Scholarly Data Archive (SDA) or Research File System (RFS):
- For instructions on requesting an individual SDA or RFS account,
see At IU, if I already have some computing accounts, how do I get others?
- For instructions on requesting an SDA or RFS account for an IU group or department, see Requesting IU computing accounts for groups or departments. Note: In accordance with standards for access control mandated by the HIPAA Security Rule, you are not permitted to access data containing protected health information (PHI) using a group (or departmental) account. To ensure accountability and permit access to authorized users only, IU researchers must use their personal Network ID credentials for all work involving PHI.
After submitting your account request, UITS will notify you via email when your account is ready for use.
For eligibility requirements, see the "Research system accounts (all campuses)" section in What computing accounts are available at IU, and for whom?
Best uses for RFS
RFS is primarily a file system, and not intended as archival storage. Archival storage for IU researchers is available on the Scholarly Data Archive (SDA).
RFS is best suited for:
- Storing relatively small files
- Storing files that are updated frequently
Note: Your applications can open files on RFS directly if you use Samba or the OpenAFS client to access your RFS space.
- Storing frequently accessed files
- Storing files that need to be shared, especially group project work
Do not use RFS for:
- Backup storage; RFS is intended as working space, use the SDA to
- Storing concurrently updated files (e.g., Access databases)
- Storing relational database (e.g., MySQL or Postgre SQL databases)
Working with data containing PHI
The Health Insurance Portability and Accountability Act of 1996 (HIPAA) established rules protecting the privacy and security of individually identifiable health information. The HIPAA Privacy Rule and Security Rule set national standards requiring organizations and individuals to implement certain administrative, physical, and technical safeguards to maintain the confidentiality, integrity, and availability of protected health information (PHI).
This system meets certain requirements established in the HIPAA Security Rule that enable its use for research involving data that contain protected health information (PHI). You may use this resource for research involving data that contain PHI only if you institute additional physical, administrative, and technical safeguards that complement those UITS already has in place. For details, see When using UITS Research Technologies systems and services, what are my legal responsibilities for protecting the privacy and security of data containing protected health information? You can also contact the Advanced Biomedical IT Core for help.
Note: Although PHI is a type of institutional data classified as Critical by the IU Committee of Data Stewards, other types of institutional data classified as Critical are not permitted on Research Technologies systems. Except for PHI, the most sensitive classification of institutional data allowed on Research Technologies resources is Restricted. For help determining which institutional data elements classified as Critical are considered PHI, see Which data elements in the classifications of institutional data are considered protected health information (PHI)?Note: In accordance with standards for access control mandated by the HIPAA Security Rule, you are not permitted to access data containing protected health information (PHI) using a group (or departmental) account. To ensure accountability and permit access to authorized users only, IU researchers must use their personal Network ID credentials for all work involving PHI.