DSpace
Search DSpace at MIT
MIT Libraries
 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

DSpace Home > Policies > Format Support

Format Support

The DSpace team hopes to provide support for as many file formats as possible. Over time, items stored in DSpace will be preserved as is, using a combination of time-honored techniques for data management and best practices for digital preservation. As for specific formats, however, the proprietary nature of many file types makes it impossible to make guarantees. Put simply, our policy for file formats is this:

  • Everything put in DSpace will be retrievable.
  • We will recognize as many files' formats as possible.
  • We will support as many known file formats as possible.

When a file is uploaded to DSpace, we assign it one of the following categories to note the level of support for its format:

  • Supported: DSpace fully supports the format.
  • Known: DSpace can recognize the format, but we cannot guarantee full support.
  • Unsupported: DSpace cannot recognize a format; such formats are listed as "application/octet-stream", or Unknown.

By "support", we mean "make usable in the future, using whatever combination of techniques (such as migration, emulation, etc.) is appropriate given the context of need." For supported formats, we might choose to bulk-transform files from a current format version to a future one, for instance. But we can't predict which services will be necessary down the road, so we'll continually monitor formats and techniques to ensure we can accomodate needs as they arise.

In the meantime, we can choose to "support" a format if we can gather enough documentation to capture how the format works. In particular, we collect file specifications, descriptions, and code samples, and make those available in the DSpace Format Reference Collection below. Unfortunately, this means that proprietary formats for which these materials are not publicly available cannot be supported in DSpace. We will still preserve these files, and in cases where those formats are native to tools supported by MIT Information Systems, we will provide you with guidance on converting your files into formats we do support. It is also likely that for extremely popular but proprietary formats (such as Microsoft .doc, .xls, and .ppt), we will be able to help make files in those formats more useful in the future simply because their prevalence makes it likely tools will be available. Even so, we cannot guarantee this level of service without also having more information about the formats, so we will still list these formats as "known", not "supported."

What to do if your format isn't recognized

We understand that there are always more formats to consider, and we would appreciate your help in identifying and studying the suitability of support for formats you care about. If we can't identify a format, DSpace will record it as "unknown", or "application/octet-stream," but we would like to keep the percentage of supported format materials in DSpace as high as possible. Don't hesitate to contact us if you have any questions or concerns.

DSpace Format Reference Collection

The table below describes how DSpace supports each listed file type. MIME type is the Multipurpose Internet Mail Extensions (MIME) type identifier. For more information on MIME, see the MIME RFCs or the MIME FAQ. Description is what most people use as the name for the format. Extensions are typical file name extensions (the part after the dot, for example, the extension for "index.html" is "html"). These are not case-sensitive in DSpace, so either "sample.XML" or "sample.xml" will be recognized as XML. The Level is DSpace's support level for each format:

  • Supported: DSpace fully supports the format.
  • Known: DSpace can recognize the format, but we cannot guarantee full support.
  • Unsupported: We cannot recognize a format; these will be listed as "application/octet-stream", or Unknown.

MIME type
Description
Extensions
Level
application/marc MARC marc, mrc supported
application/mathematica Mathematica ma known
application/msword Microsoft Word doc known
application/octet-stream Unknown (anything not listed) unsupported
application/pdf Adobe PDF pdf supported
application/postscript Postscript ps, eps, ai supported
application/sgml SGML sgm, sgml known
application/vnd.ms-excel Microsoft Excel xls known
application/vnd.ms-powerpint Microsoft Powerpoint ppt known
application/vnd.ms-project Microsoft Project mpp, mpx, mpd known
application/vnd.visio Microsoft Visio vsd known
application/wordperfect5.1 WordPerfect wpd known
application/x-dvi TeXdvi dvi known
application/x-filemaker FMP3 fm known
application/x-latex LateX latex known
application/x-photoshop Photoshop psd, pdd known
application/x-tex TeX tex known
audio/x-aiff AIFF aiff, aif, aifc supported
audio/basic audio/basic au, snd known
audio/x-mpeg MPEG Audio mpa, abs, mpeg supported
audio/x-pn-realaudio RealAudio ra, ram known
audio/x-wav WAV wav supported
image/gif GIF gif supported
image/jpeg JPEG jpeg, jpg supported
image/png PNG png supported
image/tiff TIFF tiff, tif supported
image/x-ms-bmp BMP bmp known
image/x-photo-cd Photo CD pcd known
text/html HTML html, htm supported
text/plain Text txt supported
text/richtext Rich Text Format rtf supported
text/xml XML xml supported
video/mpeg MPEG mpeg, mpg, mpe supported
video/quicktime Video Quicktime mov, qt known
text/x-sas-syntax SAS Syntax File sas Supported
application/x-sas-system SAS System File sas7bdat, sd1, sd2, sd7, ssd01, ssd, ssd04 Known
application/x-sas-transport SAS Transport File xpt, cport, v5x, v6x, v7x Known
text/x-spss-syntax SPSS Syntax File sps Supported
application/x-spss-sav SPSS system file sav Known
application/x-spss-sav SPSS portable file por Known
text/x-stata-syntax Stata Syntax file do Supported
application/x-stata Stata Binary files dta Known
text/x-r-syntax R syntax file r, R Supported
application/x-rlang-transport R binary file Rdata, rdata Known
text/x-fixed-field fixed field text data dat, asc Supported
text/csv Comma separated values csv Supported
text/tab-separated-values Tab separated values tab Supported

 

 

 

 

invent MIT: The 
			HP-MIT Alliance
Copyright © 2002-2006 MIT Libraries & Hewlett-Packard Company.
DSpace is a trademark of the Massachusetts Institute of Technology.