|
|
DSpace Home
> Policies > Format Support
Format Support
The DSpace team hopes to provide support for as many file formats
as possible. Over time, items stored in DSpace will be preserved
as is, using a combination of time-honored techniques for data management
and best practices for digital preservation. As for specific formats,
however, the proprietary nature of many file types makes it impossible
to make guarantees. Put simply, our policy for file formats is this:
- Everything put in DSpace will be retrievable.
- We will recognize as many files' formats as possible.
- We will support as many known file formats as possible.
When a file is uploaded to DSpace, we assign it one of the following
categories to note the level of support for its format:
- Supported: DSpace fully supports the format.
- Known: DSpace can recognize the format, but we cannot
guarantee full support.
- Unsupported: DSpace cannot recognize a format; such formats
are listed as "application/octet-stream", or Unknown.
By "support", we mean "make usable in the future,
using whatever combination of techniques (such as migration, emulation,
etc.) is appropriate given the context of need." For supported
formats, we might choose to bulk-transform files from a current
format version to a future one, for instance. But we can't predict
which services will be necessary down the road, so we'll continually
monitor formats and techniques to ensure we can accomodate needs
as they arise.
In the meantime, we can choose to "support" a format
if we can gather enough documentation to capture how the format
works. In particular, we collect file specifications, descriptions,
and code samples, and make those available in the DSpace
Format Reference Collection below. Unfortunately, this means
that proprietary formats for which these materials are not publicly
available cannot be supported in DSpace. We will still preserve
these files, and in cases where those formats are native to tools
supported by MIT Information Systems, we will provide you with guidance
on converting your files into formats we do support. It is also
likely that for extremely popular but proprietary formats (such
as Microsoft .doc, .xls, and .ppt), we will be able to help make
files in those formats more useful in the future simply because
their prevalence makes it likely tools will be available. Even so,
we cannot guarantee this level of service without also having more
information about the formats, so we will still list these formats
as "known", not "supported."
What to do if your format isn't recognized
We understand that there are always more formats to consider, and
we would appreciate your help in identifying and studying the suitability
of support for formats you care about. If we can't identify a format,
DSpace will record it as "unknown", or "application/octet-stream,"
but we would like to keep the percentage of supported format materials
in DSpace as high as possible. Don't hesitate to contact us if you
have any questions or concerns.
DSpace Format Reference Collection
The table below describes how DSpace supports each listed file type.
MIME type is the Multipurpose Internet Mail
Extensions (MIME) type identifier. For more information on MIME,
see the MIME RFCs or the MIME FAQ. Description is what most people
use as the name for the format. Extensions are typical file name
extensions (the part after the dot, for example, the extension for "index.html"
is "html"). These are not case-sensitive in DSpace, so
either "sample.XML" or "sample.xml" will be
recognized as XML. The Level is DSpace's support level for each format:
- Supported: DSpace fully supports the format.
- Known: DSpace can recognize the format, but we cannot guarantee full
support.
- Unsupported: We cannot recognize a format; these will be listed
as "application/octet-stream", or Unknown.
MIME type |
Description |
Extensions |
Level |
| application/marc |
MARC |
marc, mrc |
supported |
| application/mathematica |
Mathematica |
ma |
known |
| application/msword |
Microsoft Word |
doc |
known |
| application/octet-stream |
Unknown |
(anything not listed) |
unsupported |
| application/pdf |
Adobe PDF |
pdf |
supported |
| application/postscript |
Postscript |
ps, eps, ai |
supported |
| application/sgml |
SGML |
sgm, sgml |
known |
| application/vnd.ms-excel |
Microsoft Excel |
xls |
known |
| application/vnd.ms-powerpint |
Microsoft Powerpoint |
ppt |
known |
| application/vnd.ms-project |
Microsoft Project |
mpp, mpx, mpd |
known |
| application/vnd.visio |
Microsoft Visio |
vsd |
known |
| application/wordperfect5.1 |
WordPerfect |
wpd |
known |
| application/x-dvi |
TeXdvi |
dvi |
known |
| application/x-filemaker |
FMP3 |
fm |
known |
| application/x-latex |
LateX |
latex |
known |
| application/x-photoshop |
Photoshop |
psd, pdd |
known |
| application/x-tex |
TeX |
tex |
known |
| audio/x-aiff |
AIFF |
aiff, aif, aifc |
supported |
| audio/basic |
audio/basic |
au, snd |
known |
| audio/x-mpeg |
MPEG Audio |
mpa, abs, mpeg |
supported |
| audio/x-pn-realaudio |
RealAudio |
ra, ram |
known |
| audio/x-wav |
WAV |
wav |
supported |
| image/gif |
GIF |
gif |
supported |
| image/jpeg |
JPEG |
jpeg, jpg |
supported |
| image/png |
PNG |
png |
supported |
| image/tiff |
TIFF |
tiff, tif |
supported |
| image/x-ms-bmp |
BMP |
bmp |
known |
| image/x-photo-cd |
Photo CD |
pcd |
known |
| text/html |
HTML |
html, htm |
supported |
| text/plain |
Text |
txt |
supported |
| text/richtext |
Rich Text Format |
rtf |
supported |
| text/xml |
XML |
xml |
supported |
| video/mpeg |
MPEG |
mpeg, mpg, mpe |
supported |
| video/quicktime |
Video Quicktime |
mov, qt |
known |
| text/x-sas-syntax |
SAS Syntax File |
sas |
Supported |
| application/x-sas-system |
SAS System File |
sas7bdat, sd1, sd2, sd7, ssd01, ssd, ssd04 |
Known |
| application/x-sas-transport |
SAS Transport File |
xpt, cport, v5x, v6x, v7x |
Known |
| text/x-spss-syntax |
SPSS Syntax File |
sps |
Supported |
| application/x-spss-sav |
SPSS system file |
sav |
Known |
| application/x-spss-sav |
SPSS portable file |
por |
Known |
| text/x-stata-syntax |
Stata Syntax file |
do |
Supported |
| application/x-stata |
Stata Binary files |
dta |
Known |
| text/x-r-syntax |
R syntax file |
r, R |
Supported |
| application/x-rlang-transport |
R binary file |
Rdata, rdata |
Known |
| text/x-fixed-field |
fixed field text data |
dat, asc |
Supported |
| text/csv |
Comma separated values |
csv |
Supported |
| text/tab-separated-values |
Tab separated values |
tab |
Supported |
|