Tarefa
Assignment #2: Building your own corpus
Condições de conclusão
Aberto: terça-feira, 2 jul. 2013, 17:20
Assignment #2 - Using "Learning by Doing" (or by "Example") to build your own corpus
In this assignment, you will select a few scientific manuscripts that are useful for your writing and/or related to your area. You will examine them with a fine-tooth comb! Here, you will learn to build your corpus, by annotating the text, using the strategies proposed in the nine steps. For additional help and details, see presentation "ime_v2_4_corpus_creation.pdf"
What to do?
- Having selected 3 to 5 scientific manuscripts from reliable sources, your goal now is to annotate them by carefully examining the text.
- It is important that you read the manuscripts and determine whether they are well-written.
- Select and begin to annotate expressions that convey important messages to you and to your work. Consider words or sentences that may be helpful to you in writing about your work.
- What can you identify that is typical of your field?
- If you are in math, physics, computer science, engineering, can you relate to the work described?
- HOW to annotate? You can use labels as in XML, e.g.
- <introduction> ... </introduction>
- <method> ... </method>
- Using the components (proposed in “components_typical_introduction.pdf”), annotate the abstract and the introduction.
- Continue “processing” the text in your selected publications and identify the strategies that you could use.
- Go through the 9 steps, one by one until you have built a collection of sentences that (i) fit a specific component and/or (ii) can serve as examples of patterns in your research field. NOTE that these latter ones will be very general sentence patterns.
- Remember, you cannot copy text “as it is”. That would result in plagiarism. The goal here is to identify parts that can help you produce your own terms and sentences.
What to submit?
Your assignment must be submitted as a text file.
- Name the file as follows: "assignment_2_yourfirstname.txt" (or ".rtf", or ".doc"/".docx" if you use Word), where yourfirstname is indeed your first name.
- Write the title of the assignment in your file, include your name and date.
- Make sure you specify how you annotate your selected publications.
- See "sample_annotated_corpus.pdf" and "mini_sample_corpus.pdf" for examples on how your annotated corpus may look like.
Important
- Do not copy "text as is" from the web or your work will be considered plagiarism. You must use your own words.
- Throughout the course, you must cite your sources properly. Points will be taken off for failure to do so. See an example of a proper citation below:
Correct:
Stockman, Farah. "Foreign activists stay covered online." Boston Globe. Boston Globe, 30 01 2011. Web. 30 Jan 2011. <http://www.boston.com/news/world/africa/articles/2011/01/30/mass_groups_software_helps_avoid_censorship/>. Incorrect: http://www.boston.com/news/world/africa/articles/2011/01/30/mass_groups_software_helps_avoid_censorship - For help with proper citations you can usse online resources to create citations. Use, for example, citationmachine.net.