Lesson: "Text Information"

Text as a form of information presentation

Text is any verbal statement printed, written or existing in oral form.

Information presented in written form is called text information.

Amazing fact!Skipping, replacing or rearranging at least one character in the text sometimes changes its meaning:

Who came to us! (the meaning is that the person who entered was seen and recognized)

Who came to us? (is a question that emphasizes the uncertainty and uncertainty of the situation).

For thousands of years people have recorded information.

Information carriers: stone, clay, wood, papyrus, parchment, paper.

But in order to make changes to the text, it had to be rewritten. And this is a very long and laborious process. The advent of computers has changed the technology of writing. With the help of special computer programs, you can type any text, make changes to it if necessary, write the text to the computer's memory for long-term storage, print any number of copies on the printer, send the text using Email to other computers.

Text Documents

Document - any text created with text editor, along with non-textual materials included in it.

The text document can be:

Hypertext is a way of organizing a document so that it can be quickly found. necessary information . The transition from one place in the hypertext to another is carried out with the help of links.

Example: You are reading an encyclopedia about animals and you are especially interested in information about dogs. Let the word "sheepdog" be underlined - this means a link within the hypertext. If you click on this word with the mouse, you will be taken to another encyclopedia article that talks about this breed of dog.

The main objects of a text document are: character, word, line, paragraph, page, fragment.

Symbol - number, letter, punctuation mark, etc.

Word – an arbitrary sequence of characters, limited on both sides by service characters (space, brackets, comma, etc.).

Line – an arbitrary sequence of characters between the left and right borders of the document.

Paragraph – an arbitrary sequence of characters, limited by special characters for the end of a paragraph.

Fragment is an arbitrary sequence of characters. A fragment can be a single word, a line, a paragraph, a page, or even the entire text you type.

The computer is the main tool for preparing texts

Installed on any computer special programs, designed to create texts, - text editors .

To create and design stories, reports, articles for school newspaper containing inscriptions, tables, diagrams, drawings, photographs, use more powerful text editors. They are also called word processors.

The window of one of the simple word processors is shown in fig. 24 p.58.

Preparing a document on a computer consists of several stages:input (kit)text , editing , formatting andseal.

Entering text

When entering text, keep the following in mind.rules:

    Use capital letters where necessary.

    All punctuation marks, except for a dash, should be placed immediately after the last letter of the word; After any punctuation mark, press the Space key. Emphasize dashes with spaces on both sides.

    Avoid entering two or more spaces in a row; do not use a space to align paragraph boundaries.

    Don't watch the end of the line: as soon as it is reached, the cursor will automatically move to the beginning of the next line.

    To enter a new paragraph, press the keyEnter.

It will be much easier to control compliance with typing rules if you set the display mode for non-printing characters.

Text editing

Editing - this is the next stage of document preparation, which begins after the input of information, as a result of which the document is checked for its correctness and the errors found are corrected, as well as the necessary changes are made.

Using the cursor arrows, the cursor can be moved around the entire screen, bringing it to any character. To move around the entire document, special keys or key combinations are intended (see the textbook, p. 59).

When editing, you can work not only with individual characters, but also with entire text fragments. The fragment must first be selected. To do this, place the mouse pointer at the beginning of the desired fragment and, holding the mouse button down, drag the pointer to its end. The text is highlighted in a contrasting color. The selection of a fragment can be canceled by clicking anywhere in the workspace of the window.

Text formatting

Formatting - this is one of the stages of preparing a document, during which various operations are performed to draw up a document.

Paragraph alignment:

The font is full set letters of the alphabet with the general style of their image.

Font style:

To print a document on paper, a printer is connected to the computer -Printer.

Unicode- a character encoding standard that allows you to represent the signs of almost all written languages.
The standard was proposed in 1991 by the non-profit organization Unicode Consortium. The application of this standard makes it possible to encode very big number characters from different scripts: Chinese characters, mathematical symbols, letters of the Greek alphabet, Latin and Cyrillic letters can coexist in Unicode documents, while switching code pages becomes unnecessary.
The standard consists of two main sections: the universal character set and the encoding family. The universal character set specifies a one-to-one correspondence of characters to codes - elements of the code space representing non-negative integers. An encoding family defines the machine representation of a sequence of UCS codes.
Codes in the Unicode standard are divided into several areas. The area with codes U+0000 to U+007F contains ASCII characters with corresponding codes. Next are areas of characters of various scripts, punctuation marks and technical symbols. Some codes are reserved for future use. Under Cyrillic characters, areas of characters with codes from U + 0400 to U + 052F, from U + 2DE0 to U + 2DFF, from U + A640 to U + A69F are allocated (see Cyrillic in Unicode).

Coding system

The Universal Coding System (Unicode) is a set of graphic characters and a way to encode them for computer processing of text data.
Graphic symbols are characters that have a visible image. Graphic characters are contrasted with control characters and formatting characters. Graphic symbols include the following groups: letters contained in at least one of the supported alphabets; numbers; punctuation marks; special signs (mathematical, technical, ideograms, etc.); separators.
Unicode is a system for linear representation of text. Characters that have additional superscript or subscript elements can be represented as a sequence of codes built according to certain rules (composite variant, composite character) or as a single character (monolithic variant, precomposed character).

Modifier symbols
Graphic characters in Unicode are divided into extended and non-extended (widthless). Unstretched characters do not take up space on the line when displayed. These include, in particular, accent marks and other diacritics. Both extended and non-extended characters have their own codes. Extended symbols are otherwise called basic, and non-extended - modifying; and the latter cannot meet independently. For example, the character "á" can be represented as a sequence of the base character "a" (U+0061) and the modifier character "́" (U+0301) or as a monolithic character "á" (U+00C1).
A special type of modifying characters are style variant selectors. They only affect those symbols for which such variants are defined. In version 5.0, font styles are defined for a number of mathematical symbols, traditional Mongolian characters, and Mongolian square characters.

Forms of normalization
Since the same characters can be represented by different codes, which sometimes makes processing difficult, there are normalization processes designed to bring text to a certain standard form.
The Unicode standard defines 4 forms of text normalization:
The normalization form D (NFD) is the canonical decomposition. In the process of bringing the text into this form, all compound characters are recursively replaced by several compound characters, in accordance with the decomposition tables.
The normalization form C (NFC) is canonical decomposition followed by canonical composition. First, the text is reduced to form D, after which canonical composition is performed - the text is processed from beginning to end and the following rules are followed:
An S character is initial if it has modification class zero in the Unicode character base.
In any sequence of characters that starts with a start character S, the character C is blocked from S if and only if there is any character B between S and C that either starts or has the same or greater modification class than C. This is the rule applies only to strings that have undergone canonical decomposition.
A primary composite is a character that has a canonical decomposition in the Unicode character base (or a canonical decomposition for Hangul and is not on the exclusion list).
A symbol X can be primary combined with a symbol Y if and only if there exists a primary composite Z that is canonically equivalent to the sequence .
If the next symbol C is not blocked by the last encountered initial base symbol L and it can be successfully primary-matched with it, then L is replaced by an L-C composite, and C is removed.
The normalization form KD (NFKD) is a compatible decomposition. When cast to this form, all compound characters are replaced using both canonical Unicode decomposition maps and compatible decomposition maps, after which the result is put in canonical order.
The normalization form KC (NFKC) is a compatible decomposition followed by canonical composition.

The terms "composition" and "decomposition" mean, respectively, the connection or decomposition of symbols into their component parts.

Examples

Source textNFDNFCNFKDNFKC
FrancaisFranc\u0327aisFran\xe7aisFranc\u0327aisFran\xe7ais
A, Yo, Y \u0410, \u0401, \u0419\u0410, \u0415\u0308, \u0418\u0306\u0410, \u0401, \u0419
\u304b\u3099\u304c\u304b\u3099\u304c
Henry IVHenry IVHenry IVHenry IVHenry IV
Henry ⅣHenry \u2163Henry \u2163Henry IVHenry IV

Unicode includes almost all modern scripts, including:
Arabic, Armenian, Bengali, Burmese, Glagolitic, Greek, Georgian, Devanagari, Hebrew, Cyrillic, Chinese (Chinese characters are actively used in Japanese, and also quite rarely in Korean), Coptic, Khmer, Latin, Tamil, Korean (Hangul) , Cherokee, Ethiopian, Japanese (which includes, in addition to Chinese characters also the syllabary)
and others.

Many historical scripts have been added for academic purposes, including: runes, ancient Greek, Egyptian hieroglyphs, cuneiform, Mayan script, Etruscan alphabet.

Presentation methods
Unicode has several representation forms: UTF-8, UTF-16 (UTF-16BE, UTF-16LE) and UTF-32 (UTF-32BE, UTF-32LE). A form of UTF-7 representation was also developed for transmission over seven-bit channels, but due to incompatibility with ASCII, it was not widely used and was not included in the standard. On April 1, 2005, two joke representations were proposed: UTF-9 and UTF-18 (RFC 4042).
AT Microsoft Windows NT and based on it Windows systems 2000 and Windows XP mostly use the UTF-16LE form. The UNIX-like operating systems GNU/Linux, BSD, and Mac OS X adopt the UTF-8 form for files and UTF-32 or UTF-8 for handling characters in RAM.

UTF-8
UTF-8 is a Unicode representation that provides the best compatibility with older systems that used 8-bit characters. Text consisting only of characters less than 128 is converted to plain ASCII text when written in UTF-8. Conversely, in UTF-8 text, any byte with a value less than 128 represents an ASCII character with the same code. The remaining Unicode characters are represented by sequences from 2 to 6 bytes in length (in fact, only up to 4 bytes, since there are no characters in Unicode with a code greater than 10FFFF, and there are no plans to introduce them in the future), in which
the first byte is always 11xxxxxx and the rest 10xxxxxx.
The UTF-8 format was invented on September 2, 1992 by Ken Thompson and Rob Pike and implemented in Plan 9. The UTF-8 standard is now officially enshrined in RFC 3629 and ISO/IEC 10646 Annex D.
UTF-8 characters are obtained from Unicode like this:

Unicode UTF -8: 0x00000000 -xxxxxxxxxxxx10xxx 10xx 10 Byte order
In a UTF-16 data stream, the high byte can be written either before the low byte or after the low byte. Similarly, there are two variants of the four-byte encoding - UTF-32BE and UTF-32LE.
To determine the Unicode representation format, a signature is written to the beginning of the text file - the U + FEFF character (non-breaking space with zero width), also called the byte order mark (BOM). This makes it possible to distinguish between UTF-16LE and UTF-16BE, since the U+FFFE character does not exist. Also, this method is sometimes used to denote the UTF-8 format, although the concept of endianness does not apply to this format. Files following this convention begin with these byte sequences: UTF-8 EF BB BFUTF-16BE FE FFUTF-16LE FF FEUTF-32BE 00 00 FE FFUTF-32LE FF FE 00 00
Unfortunately, this method does not reliably distinguish between UTF-16LE and UTF-32LE, since the character U+0000 is allowed by Unicode (although real texts rarely begin with it).
UTF-16 and UTF-32 encoded files that do not contain a BOM must be in big-endian byte order (unicode.org).

Unicode and traditional encodings
The introduction of Unicode has led to a change in approach to traditional 8-bit encodings. If earlier the encoding was set by the font, now it is set by the correspondence table between this encoding and Unicode. In fact, 8-bit encodings have become a form of representation of some subset of Unicode. This has made it much easier to create programs that need to work with many different encodings: now, to add support for another encoding, you just need to add another Unicode conversion table.
In addition, many data formats allow you to insert any Unicode character, even if the document is written in the old 8-bit encoding. For example, in HTML, you can use ampersand codes.

Implementations
Most modern operating systems provide Unicode support to some degree.
On operating systems Windows families NT for internal representation of filenames and others system lines double-byte encoding UTF-16LE is used. System calls taking string parameters, exist in single-byte and double-byte versions. For more information, see Unicode on Microsoft Operating Systems.
UNIX-like operating systems, including GNU/Linux, BSD, Mac OS X, use UTF-8 encoding to represent Unicode. Most programs can work with UTF-8 as with traditional single-byte encodings, regardless of the fact that a character is represented as multiple consecutive bytes. To work with individual characters, strings are usually recoded into UCS-4, so that each character corresponds to a machine word.
One of the first successful commercial implementations of Unicode was the Java programming environment. It fundamentally abandoned the 8-bit representation of characters in favor of 16-bit. Most programming languages ​​now support Unicode strings, although their representation may vary by implementation.

Input Methods
Because no keyboard layout can allow all Unicode characters to be entered at the same time, operating systems and application programs are required to support alternative methods for entering arbitrary Unicode characters.

Microsoft Windows
Starting with Windows 2000, the Character Map utility (charmap.exe) shows all the characters in the OS and allows you to copy them to the clipboard. There is a similar table, for example, in Microsoft Word.
Sometimes you can type a hexadecimal code, press Alt + X, and the code will be replaced with the corresponding character, for example, in WordPad, Microsoft Word. In editors, Alt+X also performs the reverse transformation.
In many MS Windows programs, to get a Unicode character, you need to press the Alt key to type the decimal value of the character code on the numeric keypad. For example, when typing Cyrillic texts, the combinations Alt + 0171 (“) and Alt + 0187 (”) will be useful. The combinations Alt+0133 (…) and Alt+0151 (-) are also interesting.

Macintosh
For Mac OS 8.5 and above later versions an input method called "Unicode Hex Input" is supported. While holding down the Option key, you need to type the four-digit hexadecimal code of the required character. This method allows you to enter characters with codes greater than U+FFFF using pairs of surrogates; such couples operating system will be automatically replaced with single characters. This input method must be activated in the appropriate section before use. system settings and then choose as the current input method in the keyboard menu.
Starting with Mac OS X 10.2, there is also a "Character Palette" application that allows you to select characters from a table in which you can select characters from a particular block or characters supported by a particular font.

GNU/Linux
GNOME also has a "Symbol Table" utility that allows you to display the characters of a specific block or writing system and provides the ability to search by character name or description. When the code of the desired character is known, it can be entered in accordance with the ISO 14755 standard: while holding down Ctrl keys and Shift to enter a hexadecimal code (starting with some version of GTK+, entering the code must be preceded by pressing the "U" key). The hexadecimal code you enter can be up to 32 bits long, allowing you to enter any Unicode character without using surrogate pairs.
All X Window applications, including GNOME and KDE, support input using the Compose key. For keyboards that don't have a separate Compose key, any key can be assigned for this purpose, such as Caps Lock.
The GNU/Linux console also allows you to enter a Unicode character by its code - for this decimal code characters must be entered with the numbers of the extended keyboard block while holding down the Alt key. You can also enter characters by their hexadecimal code: to do this, hold down the AltGr key, and to enter digits A-F use the keys of the extended keyboard block from NumLock to Enter (clockwise). Input according to ISO 14755 is also supported. In order for these methods to work, you must enable Unicode mode in the console with unicode_start(1) and select the appropriate font with setfont(8).
Mozilla Firefox for Linux supports ISO 14755 character input.

The basis of most electronic publications, as well as printed publications, is textual material. It is on the text that the main semantic load falls. . The text material of electronic publications can be prepared in text editors or software packages layout and formatted in accordance with the requirements for printed publications. In preparing for publication, each these programs prepares text files in its format, however, subsequent printing on hard media provides the same opportunities for the perception of text and graphic information.

The situation is different in electronic publications. Here, the perception of textual information is carried out on the basis of the presentation of any publication on the monitor screen. Since such publications can be prepared in various software environments, it is this environment or another one that is programmatically and informationally compatible with it that is needed to reproduce an electronic document. Thus, all the most probable software reading electronic publications.

One way to avoid this situation is to use standard text formats. But today there are standards (legal) only for ways to encode characters. Therefore, publications and publications in electronic editions and hypertext electronic HTML documents use ASCII ASNI codes or two-byte UNICOD code. The same approach was taken in the preparation electronic documents on Help-technologies.

Adobe Systems has proposed its own approach to solving the problem of software and information compatibility of electronic publications. As a standard (de facto) text format PDF markup format selected . This is due to the fact that in almost every text editor program or layout package there are translators of the format of a document created in specific program, in PDF format. Therefore, any electronic document can be presented in a single form (in the de facto PDF standard) and viewed in Adobe Acrobat.

In a number of electronic documents text material is used as a comment to graphic or animated elements of electronic publications. Such electronic publications include hypergraphic and multimedia publications.

Source text material electronic edition can be prepared in traditional ways: using a keyboard set; scanning text blocks of the original document with subsequent character recognition in the environment of the corresponding text recognition programs; transferred to a computer using magnetic media; transmitted over the network, etc.

Text plays a dual role in electronic documents. First of all, it carries the main semantic load in most types of electronic publications. As a rule, this text material appears in the form of text blocks or sections, which will later be combined into something whole in the mode of viewing an electronic publication or navigating through it.

On the other hand textual information is a backbone element electronic publications, since it is used for the purpose of navigating through electronic edition. This textual information is entered directly in the process of generating navigation elements.

A number of special programs have been created to automate the preparation of electronic documents.

Text information Text as a form of information presentation Text documents The computer is the main tool for preparing texts Entering text Editing text Formatting text

Keywords Text document Text editor Text entry rules Editing Formatting

Text as a form of information presentation! Text is any verbal statement printed, written or existing in oral form. Information presented in the form of written text is called textual information.

Surprising fact Skipping, replacing or rearranging at least one character in the text sometimes changes its meaning:

Text Documents A document is any text created with a text editor, along with non-text materials included in it. Text document Article Invitation Announcement Report Story Poem

Text Document Objects The main objects of a text document are: q character q word q line q paragraph q page q fragment

Stages of preparing a document Preparing a document on a computer consists of several stages: entering (typing) text, editing, formatting and printing.

Entering Text When entering text, adhere to the following rules: 1. Use capital letters where necessary. 2. All punctuation marks, except for a dash, put immediately after the last letter of the word; After any punctuation mark, press the Space key. Emphasize dashes with spaces on both sides. 3. Avoid entering two or more spaces in a row; do not use a space to align paragraph boundaries. 4. Do not watch the end of the line: once it is reached, the cursor will automatically move to the beginning of the next line. 5. To move on to entering a new paragraph, press the Enter key.

Editing the text Editing is the next stage of document preparation, which begins after the input of information, as a result of which the document is checked for its correctness and errors are corrected, as well as the necessary changes are made. Basic editing operations

Text Formatting Formatting is one of the stages of document preparation, during which various operations are performed to format the document. Format Operations

Printing a Document Dot Matrix Inkjet Laser Printer To print a document on paper, a printer is connected to the computer.

Most importantly, a text is any verbal statement, printed, written or existing in oral form. Information presented in the form of written text is called textual information. Special programs - text editors - are designed for processing text information. Any text created with a text editor, along with non-text material included in it, is called a document. Preparation of a document on a computer consists of such stages as input (typing), editing and formatting.

Questions and tasks? 2. Specify elimination algorithms the following errors. Extra symbol 1. Position the cursor at the insertion point of the symbol. 2. Press the key with the required symbol. Missing character 1. Place the cursor behind the wrong character. 2. Press the Back key. space. 3. Press the key with the correct character. Invalid character 1. Place the cursor in front of the extra character. 2. Press the Delete key.

      Keywords:
      • text
      • Text Document
      • text editor
      • text entry rules
      • editing
      • formatting

Text as a form of information presentation

Every written text is a certain sequence of characters. The omission, replacement or rearrangement of at least one character in the text sometimes changes its meaning. Consider two phrases that differ from each other by a single, last character:

Who came to us!

Who came to us?

The meaning of the first sequence of symbols is that the person who entered was seen and recognized. The second sequence of characters is a question, emphasizing the uncertainty and uncertainty of the situation.

For thousands of years people have recorded information. During this time, the things on which information was recorded (stone, clay, wood, papyrus, parchment, paper) and the means by which it was done (a sharp stone, a bone stick, a bird feather, fountain pens, fountain pens, At the end of the 19th century, a typewriter began to be used to perform written work). But the main thing did not change: in order to make changes to the text, it had to be rewritten. And this is a very long and laborious process.

The advent of computers has fundamentally changed the technology of writing. With the help of special computer programs, you can type any text, make changes to it if necessary, write the text to the computer's memory for long-term storage, print any number of copies of the text on the printer without re-typing it, or send the text via e-mail to other computers.

You will find additional information about the history of the creation of text documents in the electronic appendix to the textbook.

Text Documents

A text document can be an article, report, story, poem, invitation, announcement, greeting card. When working on a network, parts of one complex document can be stored on different computers located far from each other.

Hypertext is a way of organizing a document so that you can quickly find the information you need. It is often used in the construction of online help systems and computer versions of large reference books and encyclopedias. The transition from one place in the hypertext to another is carried out with the help of links. For example, let's say you're reading an animal encyclopedia and you're especially interested in information about dogs. Let's assume that the word "bulldog" is underlined - this indicates a link within the hypertext. If you click on this word with the mouse, you will be taken to another encyclopedia article that talks about this breed of dog.

The main objects of a text document are: character, word, line, paragraph, page, fragment.

Symbol- number, letter, punctuation mark, etc.

Word- an arbitrary sequence of characters (letters, numbers, etc.), limited on both sides by service characters (such as space, comma, brackets, etc.).

Line- an arbitrary sequence of characters between the left and right borders of the document.

Paragraph- an arbitrary sequence of characters, limited by special characters for the end of a paragraph. Empty paragraphs are allowed.

Fragment- an arbitrary sequence of characters. A fragment can be a single word, a line, a paragraph, a page, or even the entire text you type.

Computer - the main tool for preparing texts

Text preparation is one of the most common applications of computers. Any computer has special programs designed for creating texts - text editors.

You have already worked with a simple text editor. With it, you can create simple texts consisting of letters, numbers, punctuation marks, and special characters, which can be entered using the keyboard.

To create and design stories, reports, articles for the school newspaper containing inscriptions, tables, diagrams, drawings, photographs, more powerful text editors are used. They are also called word processors.

The window of one of the simple word processors is shown in fig. 24.

Rice. 24

Preparation of a document on a computer consists of such stages as input (typing), editing, formatting and printing.

Entering text

Input (typing) of text, as a rule, is carried out using the keyboard. The place for entering the next character of text is indicated on the monitor screen using a flashing vertical bar- cursor.

When entering text, adhere to the following rules:

  1. Use capital letters where necessary.
  2. All punctuation marks, except for a dash, should be placed immediately after the last letter of the word; After any punctuation mark, press the Space key. Emphasize dashes with spaces on both sides.
  3. Avoid entering two or more spaces in a row; do not use a space to align paragraph boundaries.
  4. Don't watch the end of the line: as soon as it is reached, the cursor will automatically move to the beginning of the next line.
  5. Press the Enter key to enter a new paragraph (or line of a poem).

It will be much easier to control compliance with typing rules if you set the display mode for non-printing characters.

Text editing

Editing is the next step in preparing a document on a computer. When editing text, you review it to make sure everything is correct, correct any errors you find (for example, in spelling) and make the necessary changes.

If the text is large, then only part of it will be visible on the screen, and all of it will be stored in the computer's memory.

Using the cursor arrows, the cursor can be moved around the entire screen, bringing it to any character. To move around the entire document, special keys or key combinations are used:

In addition, there is a scrolling mode that allows you to quickly display parts of the text that are outside of it.

When editing, you can work not only with individual characters, but also with entire text fragments. The fragment must first be selected. To do this, place the mouse pointer at the beginning of the desired fragment and, holding the mouse button down, drag the pointer to its end.

The selection of a fragment can be canceled by clicking anywhere in the workspace of the window.

The selected fragment can be removed from the text and erased from memory, or it can be removed from the text, but placed in a special section of memory called a buffer. In this case, the deleted fragment can either be returned to its original place, or placed in another, more suitable place in the text (Fig. 25).

Rice. 25

Sometimes you have to enter texts in which individual lines, or even groups of lines, are repeatedly repeated (remember S. Marshak's poem "Baggage" or some song with a refrain). A repeating fragment is typed only once, then it is selected and copied using a special button - the fragment itself remains in its original place, and its exact copy is placed in the buffer. After that, you continue typing and, having reached the place where the repeating fragment should be, paste it from the buffer. This procedure can be repeated many times.

Word processors allow you to find a given word in the text, if necessary, automatically replace one word with another in the entire text.

Modern word processors can also be tasked with detecting and correcting spelling errors.

Text formatting

At the formatting stage, various operations are performed to format the document.

First, paragraphs of text are aligned.

When text is left-aligned, the paragraph's left border forms a straight line. In this case, all lines have the same indents from the edge of the page. This paragraph is aligned to the left.

When text is right-aligned, the right border of the paragraph forms a straight line. Each line of a paragraph ends at the same distance from the edge of the page. This paragraph is right-aligned.

Center-aligned, or centered, text is arranged like this: on both sides of each line, the width of the free space is the same. On both sides, the edges of the paragraph are uneven. This paragraph is center aligned.

Many text editors "can" automatically break text into pages and number them. They can monitor the size of the margins and adjust the spacing between lines, offer a choice of different font options.

A font is a complete set of letters of the alphabet with a common style for their representation.

Normal font does not stand out.

  • Bold font is darker and more visible.
  • Italic font is slanted.
  • Underlined font.

The font size, or point size, is the height of the font, measured from the bottom edge of the smallest letter (such as "p" or "y") to the top edge of the tallest letter (such as "b"). Font size is measured in points. One point is a very small unit, equal to 1/72 of an inch 1 (0.3 mm), i.e. a 72 point font is 1 inch high. Most books use 10-12 point font most often.

    1 inch is a unit of length in the English system of measures, equal to 2.54 cm.

Additional information about fonts can be found in the electronic appendix to the textbook.

To print a document on paper, a printer is connected to the computer. Exist different types printers.

A dot matrix printer prints using metal needles that press an ink ribbon against the paper. Hitting the tape, they leave a pattern of dots on the paper - the matrix of the letter.

An inkjet printer prints letters on paper by spraying droplets of liquid ink over it.

AT laser printer used to print characters laser ray. This allows you to get typographic print quality.

Important information about creating text documents on a computer is given in the video lecture "Techniques for working with text", posted in the Unified Collection of Digital educational resources(sc.edu.ru).

The most important

Text is any verbal statement, printed, written or existing in oral form.

Information presented in the form of written text is called textual information.

Special programs - text editors - are designed for processing text information.

Any text created with a text editor, along with non-text material included in it, is called a document.

Preparation of a document on a computer consists of such stages as input (typing), editing and formatting.

Questions and tasks

  1. Tell us about the textual form of information presentation. What other forms of information presentation do you know? Tell us about the advantages or disadvantages of presenting information in the form of text compared to the one you described.
  2. For what purpose do you create texts? Give two or three examples.
  3. Give examples of texts that differ in size, design, and purpose.
  4. What fundamental changes did the computer make to the process of creating text?
  5. How do you understand the meaning of the saying: “What is written with a pen cannot be cut down with an ax”? Do you agree with this?
  6. What is common and what is the difference of opportunities word processor and a text editor?
  7. What do you understand by text document?
  8. List the main stages of preparing a text document on a computer.
  9. What rules must be followed when typing (typing) text?
  10. How can text be transformed at the stage of its editing?
  11. What methods of "moving" through a large text document do you know?
  12. Why do you need to format text? How can the text be converted at this stage?
  13. What is the name of the device for outputting a document to paper?
  14. Which way of creating text - computer or handwritten - do you like more? Justify your choice.
  15. One of the meanings of the word "editor" is a person who corrects a manuscript with the consent of the author. Try to use this information to explain why computer programs for creating texts are called text editors.

Computer workshop

Work 5 "Enter text"
Work 6 "Editing the text"
Work 7 "We work with text fragments"
Job 8 "Format the text"