The concept of 1. 1. 1
1, algorithm concept:
Mathematically, "algorithm" in the modern sense usually refers to a kind of problem that can be solved by computer, which is a procedure or step. These plans or steps must be clear and effective and can be completed in a limited number of steps.
2. The characteristics of the algorithm:
(1) finiteness: The sequence of steps of the algorithm is finite, and it must stop after a finite number of operations, not an infinite number.
(2) Determinism: Every step in the algorithm should be deterministic, can be effectively executed and get certain results, and should not be fuzzy.
(3) Sequence and correctness: The algorithm starts from the initial step and is divided into several definite steps. Each step can only have one definite subsequent step, and the former step is the premise of the latter step. Only when the previous step is carried out can the next step be carried out, and each step is accurate to complete the problem.
(4) Uniqueness: the solution of a problem is not necessarily unique, and a problem can have different algorithms.
(5) Universality: Many specific problems can be solved by designing reasonable algorithms, such as mental calculation and calculator calculation, which must be solved through limited and pre-designed steps.
1. 1.2 program block diagram
1, the basic concept of program block diagram:
The concept of (1) program composition: program block diagram, also known as flow chart, is a graph that accurately and intuitively represents the algorithm with specified graphics, pointing lines and text descriptions.
The program block diagram includes the following parts: program blocks representing corresponding operations; Streamlines with arrows; Necessary text description outside the program box.
(2) Graphical symbols and their functions that constitute the program box.
Program box name function
The start-stop box indicates the beginning and end of an algorithm, which is indispensable for any flow chart.
I/O box represents the input and output information of the algorithm, which can be used in any position where input and output are needed in the algorithm.
In the algorithm, formulas such as box assignment, calculation and calculation formula required for data processing are written in different processing boxes for data processing.
The judgment box judges whether a certain condition is established, and marks "Yes" or "Y" at the exit when it is established; If not, please mark "No" or "No".
When learning this part of knowledge, we should master the shape, function and usage rules of each graphic. The rules for drawing program block diagram are as follows:
1. Use standard graphic symbols. 2. The block diagram is generally drawn from top to bottom and from left to right. 3. Except for the judgment box, most flow chart symbols have only one entry point and one exit point. The decision box has a unique symbol and multiple exit points. 4. The judgment box is divided into two categories, one is the judgment of "yes" and "no", and there are only two results; The other is multi-branch judgment, which has several different results. 5. The language described in graphic symbols should be very concise and clear.
(3) Three basic logical structures of the algorithm: sequence structure, conditional structure and cyclic structure.
1, sequence structure: sequence structure is the simplest algorithm structure. Reports and boxes are made from top to bottom. It consists of several processing steps that are executed in turn. It is the basic algorithm structure that any algorithm can't do without.
Sequence structure's embodiment in the program block diagram is to make the program block from top to bottom by pipeline.
Connect the fields and perform the algorithm steps in sequence. Such as boxes a and b in the schematic diagram.
These boxes are executed in sequence, and can only be executed after the operation specified in box A is executed.
The operation specified in the box in line b.
2. Conditional structure:
Conditional structure refers to the judgment of conditions in the algorithm.
According to whether the conditions are true or not, the algorithm structures with different flow directions are selected.
Whether the condition P is true or not, choose to execute box A or box B. No matter whether the condition P is true or not, only one of box A or box B can be executed. It is impossible to execute box a and box b at the same time, and it is impossible to execute both. A judgment structure can have multiple judgment boxes.
3. Circular structure: In some algorithms, it is often that a processing step is repeatedly executed from a certain place according to certain conditions. This is the cycle structure, and the repeated processing steps are the cycle body. Obviously, the loop structure must contain the conditional structure. Circular structures, also known as repetitive structures, can be subdivided into two categories:
(1), one is the current circulation structure, as shown in the left figure below. Its function is to execute block a when the given condition p holds. After the execution of block A is completed, it will be judged whether condition P is established. If it is still true, block A is executed again, and so on until condition P is not true once. At this point, block A will not be executed and the loop structure will be left.
(2) The other is the until cycle structure, as shown in the right figure below. Its function is to execute first, and then judge whether the given condition P is true or not. If p still does not hold, continue to execute block A until the given condition P holds, then stop executing block A and leave the loop structure.
When the loop structure reaches the loop structure.
Note: the loop structure of 1 will terminate the loop under certain conditions, which requires conditional structure to judge. Therefore, the loop structure must contain conditional structure, but "infinite loop" is not allowed. There is a count variable and an accumulation variable in the loop structure. The counting variable is used to record the number of cycles, and the cumulative variable is used to output the results. Counting variables and accumulating variables are generally executed synchronously, accumulating once and counting once.
1.2. 1 input, output and assignment statements
1, enter the statement
(1) General format of input statement
(2) The function of input sentence is to realize the information input function of the algorithm; (3) "Prompt content" prompts the user what information to input, and variables refer to variables whose values can be changed when the program is running; (4) The input statement requires that the input value can only be a specific constant, not a function, variable or expression; (5) Use semicolon ";" Between the prompt content and the variable. Separate. If multiple variables are entered, the variables are separated by commas.
2. Output statement
General format of (1) output statement
(2) The function of the output statement is to realize the output result function of the algorithm; (3) "Prompt content" prompts the user what information to input, and the expression refers to the data to be output by the program; (4) The output statement can output the values of constants, variables or expressions and characters.
3. Assignment statement
General format of (1) assignment statement
(2) The function of the assignment statement is to assign the value expressed by the expression to the variable; (3) The "=" in the assignment statement is called the assignment number, which is different from the equivalent number in mathematics. The left and right sides of the assignment number cannot be interchanged, it assigns the value of the expression on the right side of the assignment number to the variable on the left side of the assignment number; (4) The left side of the assignment statement can only be a variable name, not an expression, and the expression on the right side can be data, constant or formula; (5) A variable can be assigned multiple times.
Note: ① The left side of the assignment number can only be a variable name, not an expression. 2=X is wrong. ② The left and right distribution numbers cannot be interchanged. For example, "A = B" and "B = A" have different meanings. ③ Algebraic calculus cannot use assignment statements. (such as simplification, factorization, solving equations, etc.). (4) The assignment symbol "=" has different meanings from the equal sign in mathematics.
1.2.2 conditional statement
Conditional statements generally have two formats: (1) if-then-else statements; (2) If-then statement. 2.If-then-else statement
The general format of if-then-else statement is figure 1, and the corresponding program block diagram is figure 2.
Figure 1 Figure 2
Analysis: In the if-then-else statement, "condition" means the condition of judgment, and "statement 1" means the content of operation to be executed when the condition is met; "Statement 2" indicates the content of the operation to be performed when the conditions are not met; END IF indicates the end of the conditional statement. When the computer executes, the condition after IF is judged first, and if the condition is met, the statement after THEN is executed1; If the condition is not met, statement 2 is executed after ELSE.
3.If-then statement
The general format of IF—THEN statement is shown in Figure 3, and the corresponding program block diagram is shown in Figure 4.
Note: "condition" refers to the condition of judgment; "Statement" refers to the operation content to be executed when the conditions are met, and the program is terminated when the conditions are not met; END IF indicates the end of the conditional statement. When the computer executes, the condition if is judged first. IF the condition is met, the statement after the condition if is executed. If the condition is not met, the conditional statement is directly ended and other statements are executed.
1.2.3 loop statement
The loop structure is realized by loop statements. Corresponding to the two loop structures in the program block diagram, there are also two statement structures in the general programming language: WHILE type and UNTIL type. That is, WHILE statement and UNTIL statement.
1, WHILE statement
The general format of (1)WHILE statement is the corresponding program block diagram.
(2) When the computer encounters a WHILE statement, it first judges whether the condition is true, and if the condition is met, it executes the loop between WHILE and WEND; Then check the above conditions. If the condition is still met, the loop is executed again and the process is repeated until the condition is not met once. At this time, the computer will not execute the loop body, jump directly after the WEND statement, and then execute the statement after WEND. Therefore, the period is sometimes called "pretest" period.
2.UNTIL statement
The general format of (1)UNTIL statement is the corresponding program block diagram.
(2) Untill cycle is also called "post-test cycle". From the analysis of Untill loop structure, when the computer executes this statement, it first executes the loop body and then judges the conditions. If the condition is not met, continue to return to the loop body, and then judge the condition. Repeat this process until a certain condition is met, the loop body is no longer executed, and other statements are executed after the loop until statement.
Analysis: the difference between when-type cycle and until-type cycle: (discuss by students first, then summarize)
(1) When the loop is judged first and then executed until the loop is judged first;
In the WHILE statement, the loop body is executed when the condition is met, and in the UNTIL statement, the loop is executed when the condition is not met.
1.3. 1 shift division and phase subtraction
1, toss division. Also known as Euclid algorithm, the steps of finding the greatest common divisor by alternating division are as follows:
(1): divide the larger number m by the smaller number n to get a quotient and a remainder; (2): If = 0, then n is the greatest common divisor of m and n; If ≠0, divide the divisor n by the remainder to get a quotient and a remainder; (3): If = 0, it is the greatest common divisor of m and n; If ≠0, divide the divisor by the remainder to get a quotient and a remainder; ..... in turn, until = 0, at this time is the greatest common divisor.
2. More phase subtraction
China also had an algorithm for finding the greatest common divisor in the early days, that is, the subtraction technique. The steps to find the greatest common divisor with more subtraction skills in "Nine Chapters Arithmetic": What is half, what is not half, and what is the denominator? The fewer the number of children, the more they decrease and the more they lose, and so on, the number is about equal.
Translated as: (1): Give two positive numbers at will; Determine whether they are all even numbers. If so, reduce by 2; If not, proceed to the second step. (2): Subtract the smaller number from the larger number, then compare the smaller number with the difference, and subtract the number from the larger number. Continue this operation until the numbers obtained are equal, then this number (equal number) is the greatest common divisor.
Example 2 Find the greatest common divisor of 98 and 63 by polyphase subtraction.
Analysis: (omitted)
3, the difference between division and subtraction:
(1) are all methods to find the greatest common divisor. In calculation, division is the main method and subtraction is the main method. The number of calculations is relatively small, especially when the size of the two numbers is very different.
(2) From the form of the result, the result reflected by division is obtained by dividing the remainder by 0, while the result obtained by subtraction is equal to the difference.
1.3.2 Qin algorithm and ranking
1, Qin algorithm concept:
Evaluation of f (x) = anxn+an-1xn-1+...+a1x+A0.
f(x)= anxn+an- 1xn- 1+…。 +a 1x+A0 =(anxn- 1+an- 1xn-2+…。 +a 1)x+A0 =((anxn-2+an- 1xn-3+…。 +a2)x+a 1)x+a0
=......=(...(anx+an- 1)x+an-2)x+...+a 1)x+a0
To require the value of polynomial, first calculate the value of the sequence polynomial in the innermost bracket, that is, v 1=anx+an- 1.
Then the value of the polynomial is calculated layer by layer from the inside out, that is
v2=v 1x+an-2 v3=v2x+an-3......vn=vn- 1x+a0
In this way, the evaluation problem of polynomial of degree n is transformed into the problem of finding the value of polynomial of degree n.
2. Two sorting methods: direct insertion sorting and bubble sorting.
1, insert sorting directly.
Basic idea: the idea of inserting sorting is to read one and arrange one. Put the number 1 into the number 1 element of the array, and compare the number read later with the number stored in the array to determine its position in the ascending order. Move this position and subsequent elements back one position, and fill in the blank position with new numbers. (Because the algorithm is simple, you can give an example. )
2, bubble sorting
Basic idea: compare two adjacent numbers in turn, with the big one in front and the small one in the back. That is to say, first compare the number 1 with the second number, with the big one in front and the small one in the back. Then compare the second number with the third number ... until the last two numbers are compared. The first trip down, the smallest must sink to the end. Repeat the above process, still starting from 1
1.3.3 decimal system
1. Concept: The carry system is a counting method, which uses a finite number of digits to represent different numerical values in different positions. The number of digital symbols that can be used is called radix, and when radix is n, it can be called N-ary, which is abbreviated as N-ary. Decimal system is most commonly used now, and it is usually counted by 10 Arabic numerals 0-9. For any number, we can use different carry systems to represent it. For example, the decimal number 57, binary can be expressed as11001,octal can be expressed as 7 1, and hexadecimal can be expressed as 39, and their values are all the same.
Generally speaking, if k is an integer greater than 1, then the K-based system can be expressed as:
,
Numbers representing various carry digits are generally represented by adding a note to the lower right of the digits, such as111001(2) for binary digits and 34(5) for binary digits.
Chapter II Statistics
2. 1. 1 simple random sampling
1. Population and sample
In statistics, the whole research object is called population.
Call each research object an individual.
The total number of individuals in a group is called the total capacity.
In order to study the related properties of the population, a part of the population is generally randomly selected:,,,
Research, we call it a sample. The number of individuals is called sample size.
2. Simple random sampling, also called pure random sampling. That is, as a whole, it does not go through any grouping, classification, queuing, etc. , completely with.
Machine-based measurement unit extraction. The characteristics are: the probability of each sample unit being extracted is the same (the probability is equal), and each unit of the sample is completely independent, and there is no certain correlation and exclusion between them. Simple random sampling is the basis of other sampling forms. This method is usually only used when the difference between the whole units is small and the number is small.
3. Common methods of simple random sampling:
(1) draw lots; (2) Random number table method; ⑶ Computer simulation method; ⑷ Direct extraction with statistical software.
In the sample size design of simple random sampling, the main considerations are: ① population variation; ② Allowable error range; ③ Degree of probability assurance.
4. Draw lots:
(1) Number each object in the investigation team;
(2) Prepare the lottery tool and implement the lottery.
(3) Measure or investigate each individual in the sample.
Please investigate the favorite sports activities of students in your school.
5. Random number table method:
Example: Select 10 students from the class to participate in an activity by using a random number table.
2. 1.2 systematic sampling
1. System sampling (equidistant sampling or mechanical sampling):
Sort the units of the population, then calculate the sampling distance, and then sample according to this fixed sampling distance. The first sample was selected by simple random sampling.
K (sampling distance) =N (population size) /n (sample size)
Prerequisite: For the variables studied, the arrangement of individuals in the group should be random, that is, there is no regular distribution related to the variables studied. You can start sampling from different samples and compare the characteristics of several samples under the conditions allowed by the investigation. If there are obvious differences, it shows that the distribution of samples in the population follows a certain cycle law, and this cycle coincides with the sampling distance.
2. Systematic sampling, namely equidistant sampling, is one of the most commonly used sampling methods in practice. Because it has low requirements for sampling frames and simple implementation. More importantly, if there are some auxiliary variables related to the survey indicators available, and the whole unit is queued according to the size of the auxiliary variables, systematic sampling can greatly improve the estimation accuracy.
2. 1.3 stratified sampling
1. stratified sampling (type sampling):
First of all, according to some characteristics or signs (gender, age, etc.), all units in the group are divided into several types or levels. ), and then extract a sub-sample from each type or level by simple random sampling or systematic sampling. Finally, these sub-samples are combined to form a total sample.
Two methods:
1. Firstly, the population is divided into several layers by stratification variables, and then extracted from each layer according to the proportion of each layer in the population.
2. Firstly, the population is divided into several layers by stratification variables, and then the elements in each layer are arranged neatly in hierarchical order. Finally, samples are extracted by systematic sampling.
2. Stratified sampling is to divide people with strong heterogeneity into sub-populations with strong homogeneity, and then draw samples from different sub-populations to represent sub-populations, and all samples represent people again.
Stratification standard:
(1) Take the main variables or related variables to be analyzed and studied in the investigation as the standard of stratification.
(2) Ensure that the variables with strong homogeneity in each layer, strong interlayer heterogeneity and outstanding overall internal structure are used as stratified variables.
(3) Take those variables with obvious stratification as stratified variables.
3. The proportion of stratification:
(1) Proportional stratified sampling: a method to extract sub-samples according to the proportion of units of various types or levels to the total units.
(2) Non-proportional stratified sampling: If the proportion of some levels in the population is too small, the sample size will be very small. At this time, this method is mainly used to facilitate special research or comparison of different levels of subpopulations. If we want to infer the population from the sample data, we need to first weight the data of each layer, adjust the proportion of each layer in the sample, and restore the data to the actual proportion structure of each layer in the population.
2.2.2 Use the digital features of the sample to estimate the digital features of the population.
1, average:
2. Standard deviation of samples:
3. When estimating the population with samples, if the sampling method is reasonable, then the samples can reflect the information of the population, but the information obtained from the samples will be biased. In random sampling, this deviation is inevitable.
Although the distribution, mean and standard deviation we get from the sample data are not the real distribution, mean and standard deviation of the population, but only an estimate, this estimate is reasonable, especially when the sample size is large, and they do reflect the information of the population.
4.( 1) If the same constant is added or subtracted from each data in a set of data, the standard deviation remains unchanged.
(2) If each data in a set of data is multiplied by a constant k, the standard deviation becomes k times the original value.
(3) The influence of the maximum and minimum values in a set of data on the standard deviation, and the application of intervals;
The scientific truth of "removing a highest score and removing a lowest score"
2.3.2 Linear correlation of two variables
1, concept:
(1) regression linear equation
(2) Regression coefficient
2. Least square method
3. Application of linear regression equation
(1) describes the dependency between two variables; Linear regression equation can be used to quantitatively describe the quantitative relationship between two variables.
(2) Using regression equation to forecast; Substituting the predictor (independent variable x) into the regression equation to estimate the predictor (dependent variable y), the allowable interval of individual y value can be obtained.
(3) Use regression equation for statistical control, specify the change of Y value, and achieve the purpose of statistical control by controlling the range of X. If the regression equation between NO2 concentration in the air and traffic flow is obtained, the NO2 concentration in the air can be controlled by controlling traffic flow.
4. Matters needing attention in the application of linear regression
(1) regression analysis should have practical significance;
(2) Before regression analysis, it is best to make a scatter plot;
(3) Don't extend the tropic of cancer.
Chapter III Probability
3.1.1-3.1.2 Probability of random events and its significance.
1, basic concept:
(1) inevitable event: the event that will happen under condition S is called the inevitable event relative to condition S;
(2) Impossible events: events that will not happen under condition S are called impossible events relative to condition S;
(3) Deterministic events: inevitable events and impossible events are collectively referred to as deterministic events relative to condition S;
(4) Random events: events that may or may not occur under condition S are called random events relative to condition S;
(5) Frequency and frequency: repeat the test for n times under the same condition S, and observe whether there is an event A, and call the frequency nA of the event A in the n tests as the frequency of the event A; The ratio fn(A) of event A = the probability of event A: for a given random event A, if the frequency fn (a) of event A is stable at a certain constant with the increase of test times, the constant is recorded as P(A) and called as the probability of event A. ..
(6) Difference and connection between frequency and probability: The frequency of a random event refers to the ratio of the number of times nA of the event to the total number of times n of testing, which has certain stability and always swings around a certain constant, and with the increase of testing times, the swing amplitude becomes smaller and smaller. We call this constant the probability of random events, which quantitatively reflects the probability of random events. Frequency can be approximated as the probability of the event under the premise of a large number of repeated experiments.
3. Basic Properties of1.3 Probability
1, basic concept:
The inclusion, union, intersection and equality of (1) events.
(2) If A∩B is an impossible event, that is, A ∩ B = Ф, then event A and event B are mutually exclusive;
(3) If A∩B is an impossible event and A∪B is an inevitable event, then event A and event B are mutually opposite events;
(4) When events A and B are mutually exclusive, the addition formula is satisfied: p (a ∪ b) = p (a)+p (b); If events A and B are opposite events, then A∪B is an inevitable event, so P(A∪B)= P(A)+ P(B)= 1, so there is P (A) = 1-P (B).
2, the basic nature of probability:
1) The probability of inevitable events is 1, and the probability of impossible events is 0, so 0 ≤ p (a) ≤1;
2) When events A and B are mutually exclusive, the addition formula is satisfied: p (a ∪ b) = p (a)+p (b);
3) If events A and B are opposite events, then A∪B is inevitable, so P(A∪B)= P(A)+ P(B)= 1, so there is P (A) =1-P (B);
4) The difference and connection between mutually exclusive events and opposing events, mutually exclusive events means that in an experiment, event A and event B will not happen at the same time, including three different situations: (1) Event A happens and event B doesn't happen; (2) Event A does not occur, but Event B does; (3) Event A and Event B do not occur at the same time, while the opposite event means that there is only one event A and Event B, including two situations; (1) Event A occurs, but event B does not; (2) Event B happens and Event A doesn't, which is a special case of mutually exclusive events.
3.2. 1 —3.2.2 Generation of Classical Probability and Random Numbers
The use conditions of classical probability 1 and (1): the finiteness of test results and the equal possibility of all results.
(2) the solution steps of classical probability;
① Find the total number of basic events;
② Find the number of basic events contained in event A, and then use the formula P(A)= 1
3.3. 1—3.3.2 Generation of Geometric Probability and Uniform Random Numbers
1, basic concept:
(1) geometric probability model: If the probability of each event is only proportional to the length (area or volume) of the event area, such a probability model is called geometric probability model;
(2) Probability formula of geometric probability:
p(A)=;
(3) Characteristics of geometric probability: 1) There are infinitely many possible results (basic events) in the experiment; 2) The possibility of each basic event is equal.