Jump to content

సహాయం:అచ్చుదిద్దు

వికీసోర్స్ నుండి
Help pages Proofread Proofreading example


Looking at documents

[మార్చు]

Page, Index and main Namespaces

[మార్చు]

Namespaces are used to differentiate page types at Wikisource. For example, the page you are reading now is in the "Help:" namespace. See also Help:Namespaces

  • The సూచిక: namespace, holds indexes for documents, displaying all the pages of a text together. A sample index can be found at సూచిక:Andhrula Charitramu Part-1.pdf. An index page shows publication details of the document, a link to the File, and the set of links to its Pages. The Index can be modified to show the original page numbers. This type of page is used by the ProofreadPage extension to build navigation buttons, these are:
previous page , next page and up to index .
Index pages on this wiki are listed in వర్గం:సూచిక.
  • The పుట: namespace is used to display page text side-by-side with individual page images, and allows transcription of the original text. A sample page can be found at పుట:Andhrula Charitramu Part-1.pdf/5. You can zoom in on the page image by clicking and scrolling on the image in the right-hand pane.
Page numbers for DjVu or PDF files are indicated by adding a slash followed by the page number to the file name. For example, పుట:Sketch of Connecticut, Forty Years Since.DJVU/27 displays the 27th page of the file. The text on the left can be modified in edit mode, only the contents of the edit box are displayed on the 'main' page.


Some text and formatting may be placed outside of the edit box, in 'no-include' sections. These boxes can be opened by selecting the 'Proofread tools' in the upper left corner and clicking the button. The header and footer are automatically placed inside <noinclude> tags which prevent transclusion of their contents. An example is పుట:Andhrula Charitramu Part-1.pdf/5 which hides the title, repeated on every page, and the page number.
The tab on the page is a link to its index.
  • The 'main' namespace uses 'transclusion' to display a number of pages as a chapter or section of the work. A link at left provides access to the పుట: namespace. The original pages numbering can be displayed at the links, information provided by the Index. The text in the header and footer of the పుట: namespace is not displayed.
A mainspace page, such as ఆంధ్రుల చరిత్రము - ప్రథమ భాగము, displays the text and illustrations of that work.

Editing pages

[మార్చు]

The following buttons appear for editing and navigating the పుట: namespace

previous page
next page
the Index for the page
show/hide the interface for editing the header and footer
zoom out on scan
zoom in on scan
reset zoom

Formatting conventions

[మార్చు]

The following conventions are considered best practices for pages in the పుట: namespace (DjVu or PDF files and other files which use the ProofreadPage extension). For general article formatting conventions and guidelines see Wikisource:Style guide.

  • A scanned page's header and footer often include page number and titles, each of which are not needed for the new page. Place the information in the 'no-include' sections, accessed by clicking the which appears above the edit window. The {{RunningHeader}} or {{rh}} template is useful for formatting these headers, and is used as follows:
{{RunningHeader|left=|center=|right=}}
  • Text in the left, center, or right parameters will appear on the same line.

OR

{{rh|left=|center=|right=}}
  • Text in the left, center, or right parameters will appear on the same line.
  • Remove end-of-line hyphens and line breaks. To start a new paragraph, media-wiki pages use two returns.
  • When a word is hyphenated onto two different pages of the DjVu or PDF scans, use {{hws}} and {{hwe}} (if you wish, you can also use {{Hyphenated word start}} and {{Hyphenated word end}}). These templates will make the word appear hyphenated in the పుట: namespace and remove the hyphen when the text is transcluded. Example: (first Page and second Page, result in main page "pretending")
{{hws|FIRST HALF OF WORD|WHOLE WORD}}
{{hwe|LAST HALF OF WORD|WHOLE WORD}}
Note: formerly the {{Blank line}} template was used for this purpose at the top of the following page; {{nop}} is now preferred.

Page status

[మార్చు]

The status of a page is reflected both in the color of its block on the index page, and by the banner on the page. The ProofreadPage Extension is used to implement the status.


ఫ్రూప్ రీడ్ ఎక్స్టెన్షన్ ధృవీకరణ క్రమము ఐదు అంచెలతోకూడుకున్నది :
పాఠ్యంలేని/అచ్చుదిద్దనవసరంలేని
ఖాళీ పేజీ అచ్చుదిద్దబడని అచ్చుదిద్దబడినవి ఆమోదించబడ్డవి
సమస్యాత్మకం


ఈ క్రమములోని మొదటి మూడు సాధారణ అంచెలు:

  • అచ్చుదిద్దబడని అప్రమేయ విలువ. (అన్ని పేజీలు చూడు.)
  • అచ్చుదిద్దబడినవి ఒక సంపాదకునిచే అచ్చుదోషాలు తనిఖీచేయబడింది. (అన్ని పేజీలు చూడు.)
  • ఆమోదించబడ్డవి అనగా ఇద్దరు సంపాదకులచే అచ్చుదోషాలు తనిఖీచేయబడింది. దీనికి కావలసిన బొత్తాము ఈ పేజీ ఇప్పటికే వేరే సంపాదకునిచే తనిఖీ చేయబడినప్పుడే కనిపిస్తుంది.. (అన్ని పేజీలు చూడు.)

దీనితోపాటు,

  • పాఠ్యంలేని/అచ్చుదిద్దనవసరంలేని ది ఖాళీ పేజీలు లేక ఇద్దరిచే అచ్చుదోషాలు తనిఖీ అవసరంలేని పేజీలకొరకు. (అన్ని పేజీలు చూడు.)
  • సమస్యాత్మకం అంటే సంపాదకుల చర్చ అవసరమైన పేజీలు. (అన్ని పేజీలు చూడు.)


మీ సవరణ పెట్టె క్రింద ఈ బొత్తాములు కనబడతాయి. వేరే సంపాదకుడు అచ్చుదిద్దబడినవి ట్లైతే ఈ క్రిందచూపిన విధంగా కనబడతాయి:


Five buttons
Five buttons


ఎవరూ ఇంతవరకు అచ్చుదిద్దకపోతే , బొత్తాములు కనబడే విధం:


Four buttons
Four buttons


రెండు సందర్భాలలో సరిపోలిన బొత్తామును ఎంచుకని భద్రపరచడంద్వారా పుటస్థితిని మార్చవచ్చు.ఆకుపచ్చ "ఆమోదించబడ్డవి " బొత్తాము కనబడకపోతే, మీరు పసుపుపచ్చ "అచ్చుదిద్దబడినవి " బొత్తాము ఎంపిక చేసి భద్రపరచితే ఇతరులకు కనబడుతుంది.

ఈ రంగుల అమరిక విధము పేజీ సూచిక రూపం, అనగా ప్రతిపేజీ దాని నేపధ్య రంగుతో జాబితాలా చూపేదానిలో కూడా కనబడుతుంది..

Transclusion

[మార్చు]

After the text of the work is populated into each side-by-side image page, "transclusion" is used to display the text from the పుట: namespace on pages in the main namespace. Transclusion displays the page of another text without having to copy or manually update it later. The purpose of transcluding the text is to group it into logical, reasonably sized chunks—most frequently chapters or sections.

A completed example is the Wind in the Willows. Once all of the individual pages of సూచిక:Wind_in_the_Willows_(1913).DjVu or PDF were typed up in the "పుట:" namespace (one page can be seen at పుట:Wind_in_the_Willows_(1913).DJVU/19), this text and the following pages of the first chapter were populated into the various chapters of the book in the main namespace beginning at The Wind in the Willows/Chapter 1. The following explains how to use transclusion to display your finished proofreading project as a final product in the Wikisource main namespace.

Full-page transclusion

[మార్చు]

There are two methods which can perform full-page transclusion. The <pages> function which can display a series of pages, and the {{Page}} template which can display individual pages or sections of pages. Most of the time, the <pages> tag will be the best method of transcluding into chapters. The syntax is as follows:

  • <pages index="file_name.DjVu or PDF" from=20 to=40/>
  • from is the beginning page;
  • to is the ending page.

If you experience problems, post a question on the Scriptorium.


The Page template can transclude a single page. The syntax is as follows:

  • {{Page|Wind in the Willows (1913).DJVU/19|num=3}}
  • The page number is placed after a slash (/) following the file name.
  • num defines the physical page number, as numbered in the scanned text.

Partial transclusion

[మార్చు]

If two logical sections (for example the end of one chapter followed by the beginning of another) appear on the same page, it's necessary to transclude only the relevant part of the page.

This is accomplished using Labeled Section Transclusion (LST). The relevant parts of the page are marked with section tags and then when transcluding, only the relevant part of the page, rather than the entire page, are called.

To mark sections in the "పుట:" namespace, insert the following syntax into the typed proofreading text to label the end of Part I and the start of Part II (where both are on the same page):

  • <section begin=chapter1 />This is Chapter 1.<section end=chapter1 />
  • <section begin=chapter2 />This is Chapter 2.<section end=chapter2 />

To accomplish this, the {{Page}} template is used, as follows, replacing "DjVu or PDF" with the DjVu or PDF file name and "#" with the page number where the text appears:

  • {{Page|DjVu or PDF/#|section=chapter1}}
  • {{Page|DjVu or PDF/#|section=chapter2}}

Alternatively, the following format can be used, replacing "article" with the article name (excluding namespace):

  • {{#section:పుట:article|chapter1}}
  • {{#section:పుట:article|chapter2}}

Single-section transclusion

[మార్చు]

If a single logical section begins and ends within the content of a single పుట:, it is desirable to transclude only the relevant section from the page content.

This is also accomplished by using Labeled Section Transclusion (LST). Again, the relevant part of the page's content is marked with a begining & ending section tag and then, upon transclusion, only the marked part of the page, rather than the entire page, is called.

To mark sections in the "పుట:" namespace, insert the following syntax into the typed proofreading text to label the start & end of Section A (where Section A's content is found witin the content of a single పుట:):

  • <section begin="sectionA" />This is Section A text.<section end="sectionA" />

There are three basic methods, with a few additional variations per each method, which can accomplish single-section content (found within a single page) transclusions. These are the <pages> tag function, the {{Page}} template, and the #section tag parser.

In most instances, the <pages> tag will be the best method of transcluding a single section. It is also the May 2012 on forward recognized community recomended preference. The syntax is as follows:

  • <pages index="file_name.DjVu or PDF" from="20" to="20" onlysection="sectionA" />
  • from is the beginning page of the page range transclusion is to focus upon;
  • to is the ending page the page range transclusion is to focus upon;
(In this case, the page range consists of a single page; page 20)
  • onlysection is the label name assigned to the section tags containing the desired content.

To accomplish this, the {{Page}} template is used, as follows, replacing "DjVu or PDF" with the DjVu or PDF file name and "#" with the page number where the text appears:

  • {{Page|DjVu or PDF/#|section=sectionA}}

Alternatively, the following format can be used, replacing "article" with the article name (excluding namespace):

  • {{#section:పుట:article|sectionA}}

If you experience problems, post a question on the Scriptorium.

Combination multipage and partial transclusion

[మార్చు]

To add multiple pages, where the first and/or last page may require partial transclusion, the coding allows for additional parameters for use with <pages>. The extra parameters are both optional.

  • <pages index="file name.DjVu or PDF" from=20 to=40 fromsection="choice A" tosection="choice B" />

Additional parameters

[మార్చు]

In early May of 2012, the <pages> tag function added new optional parameters in order to improve the few variations of its basic method of transclusion for multi-page works (both .DjVu or PDF & .pdf files). These parameters allow for instances that require a stepped page range, the exclusion of page(s) or page range(s) and/or the inclusion of page(s) or page ranges as needed.

The new parameters are:

  • step
Transclude every other page by increments of n. By example : <pages from=1 to=10 step="2" /> show the 1st, 3rd, 5th, 7th and 9th pages.
  • exclude
Don't include following pages. By example : <pages from=1 to=10 exclude="2-5,9" /> show the 1st, 6th, 7th, 8th and 10th pages.
  • include
Include following pages. By example : <pages include="2-5,9" /> show the 2nd, 3rd, 4th, 5th and 9th pages.

We can, of course, use all the attributes on the same tag. By example <pages from=1 to=10 include="31" exclude="2-4" step="2" /> will show 1st, 5th, 7th, 9th and 31st pages.

Page with image transclusion

[మార్చు]

The preferred means of placing a picture or other image in the main pagespace is to upload the cropped image to Wikimedia Commons as a separate picture, for example as a png or jpeg file.

  • [[File:Title and number and subject.png|frameless|center]]

A temporary means of displaying an image is available. To display an image of a page in the DjVu or PDF file like at పుట:Personal Recollections of Joan of Arc.DJVU/9, use:

  • {{use page image|Personal Recollections of Joan of Arc.DjVu or PDF|9|caption=JOAN'S VISION}}

The page image could also be displayed in the book's Wikisource mainspace like at Personal Recollections of Joan of Arc/Book I/Chapter 2, using:

  • [[Image:Personal_Recollections_of_Joan_of_Arc.DjVu or PDF|page=27|right|thumbnail|200px|THE FAIRY TREE]]

Beginning the proofreading project

[మార్చు]
See Help:DjVu or PDF files

Most proofreading projects use DjVu or PDF files to contain scans or photographs of each page of a document, together with an OCR layer representing the extracted text. To convert a .pdf to .DjVu or PDF format (even "secured" .pdf files), the GPLv2 open source application or PDF/ pdf2DjVu or PDF may be used.

Editing DjVu files

[మార్చు]

Many of the already-created DjVu files available at archive.org and other sites have the Google copyright page attached to the front of the document. Wikimedia policy, based on an analysis of the underlying law, does not accept that copyright is established on a public domain work simply by scanning or copying it or taking a two-dimensional photograph that faithfully represents its subject. See Wikimedia Commons for more information about scans, artwork and the position of the WMF.

Such copyright pages and other extraneous material can be removed with DjVu Libre, an open source program maintained by the inventors of DjVu or PDF under the GNU Public License. Binaries are available for Windows, Mac, Linux, Solaris, and IRIX. It includes djvm.exe, which is run as a command-line utility. If you cannot figure out how to do this, you can message Mkoyle (talk), and he will do it for your file and email the file to you for upload. The command line to delete (-d) the first page (1) is as follows:

djvm -d filename.DjVu  1

Producing పుట: files for side by side editing

[మార్చు]
See Help:Beginner's guide to సూచిక: files

Once a .DjVu or PDF document has been produced and uploaded to Wikimedia Commons, the index is started as a new article under the name "సూచిక:[name of Wikimedia Commons file]" using Mediawiki:Proofreadpage index template. Copy the full parameter list and fill in what you can. Set "Progress=MS" and "Pages=<pagelist />". You should include a Wikilink in the "Title" which points to the text article at Wikisource (which may have the same name as the index file, but does not contain "సూచిక:").

New files for each page in the పుట: namespace are then produced for the file by using User:ThomasBot. See Help:Match and split for this procedure.

Discussion

[మార్చు]

The side by side proofread page extension provides a transcribed text and a scan of the original document on one page. These pages use the prefix 'పుట:' and collections of these displayed in a page beginning with 'సూచిక:'. While many file types are supported by the extension, a document at wikisource is usually a DjVu or PDF with OCR.

The ProofreadPage extension is enabled by default at Wikisource and should come up automatically when a page in the "పుట:" namespace is edited. However, for this to work the editor's browser (and extensions such as NoScript) must allow script processing. Your Special:Preferences page (section "Gadgets") allows you to control certain features, such as whether the OCR button is enabled and whether the text by default appears side by side or one above another.

Users new to proofreading can experiment with the concept, and test their abilities with these simple introductory tests on the Distributed Proofreading's website. Working examples can be seen by finding a project in progress, such as Wikisource:Proofread of the Month.

Once you've found a project you want to work on, you'll want to go to the index page. In it you'll find links to many pages for the project, colored by their status. After selecting a page that needs work (not green), you'll go into the page, open up the editor, and make whatever changes (either to the document or the status) are appropriate, preview & save.

Anybody is able to proofread and correct most pages at Wikisource. However, editors must log into an account in order to change the proofread status. IP addresses cannot change this status.

When corrections and formatting are complete, the page is marked as proofread and is ready for the main namespace, leave the page as 'not proofread' until it is done. Mark as problematic if appropriate.

Wikisource uses the ProofreadPage extension, which allows you to render text along with its corresponding scanned image. It is intended to allow easy comparison of text to the original. It has the following advantages:

  • Credibility: it makes it possible for Wikisource to guarantee that the text corresponds to its scanned source.
  • Improved collaboration: texts can be proofread and typos can be fixed by everyone, by providing direct access to the book. This restores the wiki way of collaborating.
  • Security: text is better protected against vandalism (any falsification can be detected immediately; texts are not accessed directly, but through transclusion, which deters inexperienced vandals).
  • No limitations on rendering: a book can be rendered in two different ways, without duplicating data:
  1. As a set of pages. Each page is a column of OCR text beside a column of scanned image. This mode is meant for contributors.
  2. Broken into its logical organization (such as chapters or poems) using transclusion. This mode is meant for readers.
  • Fairness of comparisons: since book pages are not in the 'main' namespace, they are not included in the statistical count of text units. A count of pages is available [1]. This method of comparison uses the same unit of measure for all texts (the page), which puts an end to the temptation of slicing texts into arbitrarily small units in order to increase statistics.

Limitations

[మార్చు]

The poem tag does not work well because it adds a carriage return at the end of a block. It's also not possible to use <pre> formatting, since the line breaks are suppressed during transclusion. To solve this issue, add <br /> tags to the beginning of lines.

To ease proofreading images that are rotated, the Rotate Image Firefox extension can be used.

  • Category:Index category of page scan indexes of works being proofread or validated

Plus our active proofreading work

[మార్చు]

మూస:PotM

English wikisource Guide for Distributed Proofreading

[మార్చు]

Any adaptations for specific Telugu book, please mention on that book's index talk page.