-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy path2539.xml
162 lines (162 loc) · 6.63 KB
/
2539.xml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
<?xml version="1.0" encoding="UTF-8"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0">
<teiHeader type="ISBD-ER">
<fileDesc>
<titleStmt>
<title type="main">British Academic Written English Corpus</title>
<author>Nesi, Hilary</author>
<author>Gardner, Sheena</author>
<author>Thompson, Paul</author>
<author>Wickens, Paul</author>
<funder>Economic and Social Research Council (ESRC)</funder>
<respStmt>
<resp/>
<name/>
</respStmt>
</titleStmt>
<editionStmt>
<edition/>
</editionStmt>
<extent>
<seg type="designation">CollectionText</seg>
<seg type="size">206 files: ca. 61.9 MB</seg>
</extent>
<publicationStmt>
<authority>[Depositor details removed]<date>2008-10-01</date></authority>
<authority>[Depositor details removed]<date>2008-10-01</date></authority>
<distributor>
<name key="ota" type="organisation">University of Oxford Text Archive</name>
<name type="place">Oxford</name>
<address>
<addrLine>
<name key="ota" type="organisation">OTA</name>
</addrLine>
<addrLine>OUCS</addrLine>
<addrLine>7-19 Banbury Road</addrLine>
<addrLine>Oxford</addrLine>
<addrLine>OX2 6NN</addrLine>
<addrLine>
<name type="email">[email protected]</name>
</addrLine>
</address>
</distributor>
<idno type="ota">2539</idno>
<availability n="OTA-new" status="restricted" rend="visible">
<p>Available for non-commercial use on condition that this header is included in its
entirety with any copy distributed. Registration or request via our order form is
required.</p>
</availability>
<date>2008-10</date>
</publicationStmt>
<notesStmt>
<note>Title proper taken from OTA Catalogue Form</note>
<note type="summary">The BAWE corpus contains 2761 pieces of proficient assessed student
writing, ranging in length from about 500 words to about 5000 words. Holdings are fairly
evenly distributed across four broad disciplinary areas (Arts and Humanities, Social
Sciences, Life Sciences and Physical Sciences) adn across four levels of study
(undergraduate and taught masters level). Thirty-five disciplines are represented. The
assignments have been annotated using a system devised in accordance with the TEI
guidelines. There is a dtd file which must be kept in the same folder as the corpus files,
named tei_bawe.dtd and the holdings are described in an Excel spreadsheet 'BAWE.xls'. The
transcription and mark-up conventions are described in the BAWE manual document, which is
in PDF format. </note>
</notesStmt>
<sourceDesc>
<p>no source record</p>
<bibl> The original assignments are held anonymised as PDF documents by the project team.
</bibl>
</sourceDesc>
</fileDesc>
<encodingDesc>
<projectDesc>
<p>The British Academic Written English (BAWE) corpus was created through a collaboration
between the Univeristies of Warwick, Reading and Oxford Brookes as part of the project 'An
investigatin of genres of assessed writing in British Higher Education' funded by the
Economic and Social Resaerch Council (ESRC). The aims of the research were to identify
genres of assessed writing in UK Higher Education and to investigate textual variation
across disciplines and year of study in the context of academic conventions and tutor
expectations of student writing. From the research, a database was created for use by
researchers who are investigating the nature of academic writing. </p>
</projectDesc>
<editorialDecl>
<p>Encoding format: TEI XML</p>
</editorialDecl>
<refsDecl>
<p/>
</refsDecl>
<classDecl>
<taxonomy xml:id="OTASH">
<bibl>University of Oxford Text Archive Subject Headings</bibl>
</taxonomy>
<taxonomy xml:id="LCSH">
<bibl>Library of Congress Subject Headings</bibl>
</taxonomy>
</classDecl>
</encodingDesc>
<profileDesc>
<creation>
<date from="2004-01-01" to="2007-12-31">The resource was created between 2004 and 2007 </date>
<note>Student assignments were collected from the student writers as Word documents,
anonymised and converted to PDF format. They were marked up following the Text Encoding
Initiative (TEI) guidelines. The mark-up conventions are described in the corpus
manual.</note>
</creation>
<langUsage>
<language ident="eng">English</language>
</langUsage>
<textClass>
<keywords scheme="#OTASH">
<term type="genre">Linguistic corpora</term>
</keywords>
<keywords scheme="#OTASH">
<term type="resource_type">Corpus</term>
</keywords>
<keywords scheme="#LCSH">
<term type="genre">Linguistics</term>
</keywords>
<keywords scheme="#LCSH">
<term type="genre">Linguistics analysis (Linguistics)</term>
</keywords>
<keywords scheme="#LCSH">
<term type="genre"/>
</keywords>
<keywords scheme="#LCSH">
<term type="genre"/>
</keywords>
</textClass>
<textClass>
<keywords scheme="http://www.ota.ox.ac.uk/processing">
<term type="class">corpus</term>
<term type="license">Restricted</term>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change><date>2008-10-01</date><label>cataloguer</label><name>Wynne, Martin</name>TEIXML
header composed </change>
<change><date>2009-09-03</date><label>cataloguer</label><name>Wynne, Martin</name>Size of file to be downloaded changed to 61.9 Mb </change>
</revisionDesc>
</teiHeader>
<text>
<body>
<div type="OTA-filemap" rend="visible" n="unrestricted">
<ab type="dir">
<seg type="part">1</seg>
<ptr type="resourcePart" target="documentation"/>
</ab>
<ab type="dir">
<seg type="part">2</seg>
<ptr type="resourcePart" target="CORPUS_ASCII"/>
</ab>
<ab type="dir">
<seg type="part">3</seg>
<ptr type="resourcePart" target="CORPUS_TXT"/>
</ab>
<ab type="dir">
<seg type="part">4</seg>
<ptr type="resourcePart" target="CORPUS_UTF-8"/>
</ab>
</div>
</body>
</text>
</TEI>