MindMap Gallery Basics of Information Management Ma Feicheng 3rd Edition
This book is divided into four parts: basic theory of information management, information management technical methods, information organization and business management, and information management discipline theory. The basic theory of information management includes four chapters: information resource theory, information user theory, information flow theory, and information management theory; information management technical methods include five chapters: information collection, information organization, information analysis, information provision, and information management system; information institutions and undertakings Management includes two chapters: information organization management and information industry management; information management discipline theory includes two chapters: basic theory of information management and the emergence and development of information management. This book has rich content, complete system, clear viewpoints, easy-to-understand narration, and connects theory with practice. Each chapter is equipped with exercises. This book can be used as a teaching material or teaching reference book for information management and information systems, information resource management, library science, archives science and other related majors in colleges and universities. It can also be used as a reference for business personnel and managers in the field of information management.
Edited at 2023-04-15 20:51:53El cáncer de pulmón es un tumor maligno que se origina en la mucosa bronquial o las glándulas de los pulmones. Es uno de los tumores malignos con mayor morbilidad y mortalidad y mayor amenaza para la salud y la vida humana.
La diabetes es una enfermedad crónica con hiperglucemia como signo principal. Es causada principalmente por una disminución en la secreción de insulina causada por una disfunción de las células de los islotes pancreáticos, o porque el cuerpo es insensible a la acción de la insulina (es decir, resistencia a la insulina), o ambas cosas. la glucosa en la sangre es ineficaz para ser utilizada y almacenada.
El sistema digestivo es uno de los nueve sistemas principales del cuerpo humano y es el principal responsable de la ingesta, digestión, absorción y excreción de los alimentos. Consta de dos partes principales: el tracto digestivo y las glándulas digestivas.
El cáncer de pulmón es un tumor maligno que se origina en la mucosa bronquial o las glándulas de los pulmones. Es uno de los tumores malignos con mayor morbilidad y mortalidad y mayor amenaza para la salud y la vida humana.
La diabetes es una enfermedad crónica con hiperglucemia como signo principal. Es causada principalmente por una disminución en la secreción de insulina causada por una disfunción de las células de los islotes pancreáticos, o porque el cuerpo es insensible a la acción de la insulina (es decir, resistencia a la insulina), o ambas cosas. la glucosa en la sangre es ineficaz para ser utilizada y almacenada.
El sistema digestivo es uno de los nueve sistemas principales del cuerpo humano y es el principal responsable de la ingesta, digestión, absorción y excreción de los alimentos. Consta de dos partes principales: el tracto digestivo y las glándulas digestivas.
Basics of Information Management
Chapter 1 Information and Information Management
1. Information
(1) The concept of information
1. Ontological level
Information is the expression of the way things exist and the state of movement.
2. Epistemological level
Information is the way things exist and the state of movement that the subject perceives or expresses.
3. Full information
Information at the epistemological level that simultaneously takes into account the external form (grammatical information), intrinsic meaning (semantic information) and utility value (pragmatic information) of the way things exist and their state of motion is called complete information.
4. Social information
That is, it is generated, transmitted, communicated and applied to human social practical activities for specific purposes, including all data, messages, experiences and knowledge expressed and recorded by language, symbols and other material carriers created by humans.
(2) Characteristics of information
1. Universality and objectivity of information existence
⑴Information is the expression of the way things exist and the state of movement. The existence and movement of things are always and everywhere, so information follows them like a shadow, always and everywhere.
⑵This kind of ubiquitous information is also absolute and objective. Absoluteness means that the existence of information does not depend on the subject; objectivity means that information is not an illusory thing, and its existence can be perceived, acquired, stored, processed, transmitted and applied by people.
2. The extension and infinity of information generation
⑴Everything in the universe and space has its own way of existence and state of motion, and is constantly generating information; and the things in the universe and space are infinitely rich, vast in space, and infinitely changing in time.
⑵Therefore, the generation of information is infinite, and the distribution is also infinite. Even in limited space and time, things are infinitely diverse, and information is naturally infinite.
3. Transmittance of information in time and space
⑴Information is generated from the existence and movement of things, but information can be relatively independent of its source and can be carried by other material carriers and transmitted in time or space. The transmission in time is the storage of information, and in space The transmission is communication.
⑵The nature of information transmission in time and space is very important. It not only enables human society to carry out effective information exchange and communication, but also enables the accumulation and dissemination of knowledge and information.
4. The independence of information from material carriers
⑴Information represents the existence and movement of things, but information is not the thing itself. This "representation" can be expressed through various symbols, codes, and languages created by humans, recorded and stored through various material carriers, and loaded and transmitted through energy such as light, sound, and electricity. Without these material carriers, information cannot exist. This shows that information is dependent on material carriers.
⑵ However, the specific material carrier through which the information is expressed, recorded and loaded will not change the nature and meaning of the information, which shows that the information is independent of the material carrier. The transformation of material carriers carrying information does not change the way things exist or the expression of their motion states. This property makes it possible for people to perform various processing and transformation of information.
5. The relativity of information to the cognitive subject
⑴Because people have different observation abilities, cognitive abilities, understanding abilities and purposes, the amount of information they obtain from the same thing is also different.
⑵ Even if their abilities and purposes are exactly the same, when they observe things, they choose different angles and different sides, and the amount of information they obtain about the same thing must also be different. This nature of information shows that the actual amount of information obtained varies from person to person.
6. Sharability of information to users
⑴Because information can be separated from its source or independent of its material carrier, and is not consumed during use, it can be used by many users at the same time or at different times. This is the sharing of information.
⑵The sharing of information is a natural attribute (essential attribute) of information, which has both positive and negative aspects. The positive aspect is that information can be shared to the maximum extent in time and space, improving information utilization efficiency and saving production costs; the negative aspect is that sharing brings great difficulty to the arrangement and control of information property rights in modern information management.
⑺The non-transformability and non-combinability of information
Once information is generated, it expresses a certain meaning. It is not a simple arithmetic sum of various elements contained in the information. Therefore, it is impossible to arrange these elements in any order and combine them in different combinations without damaging the information. meaning. Likewise, the elements that make up information cannot be arbitrarily divided.
⑻Timeliness of information generation and utilization
⑴From the perspective of information generation, information represents the way things exist and the state of movement at a specific moment. Since all things are constantly changing, after this moment, the way things exist and the state of movement will inevitably change. The information in this way and state will also change accordingly, which is the so-called change of times.
⑵From the perspective of information utilization, information can only exert its effectiveness at a specific moment. A timely piece of information may be worth a fortune, but an outdated piece of information may be worth nothing. The main emphasis here is on "specific moments". Users must be good at seizing the opportunity. Only when the time is right can the information be fully effective.
(3) Classification of information
1. Divide according to the objects described by the information
Natural information, biological information, machine information, social information
2. Classification according to the nature of information
Grammatical information, semantic information, pragmatic information
3. Divide from the perspective of user observation and process
Real information, a priori information, actual information
4. Divide according to the direction of information transmission
Vertical information, horizontal information, mesh information
5. Divide by content of information
Economic information, scientific and technological information, political information, cultural information, policy and regulatory information, entertainment information
6. Divide according to the role of information
Useful information, useless information, interference information
7. Divide according to the operating status of the information
Continuous information, interval information, routine information, sudden information
8. Classification according to information circulation channels
formal information, informal information
9. Divide by how information is recorded
Voice information, image information, text information, digital information, computing information
10. Divide by source of information
internal information, external information
(4) Data, information and knowledge in the information chain
The information chain consists of five elements: facts - data - information - knowledge - intelligence. In the information chain, data and knowledge are closely related to information.
1.Data
⑴Data: It is a physical symbol that loads or records information and is arranged and combined according to certain rules. It can be numbers, text, images, sounds or computer code.
⑵ People's reception of information begins with the reception of data, and information can only be obtained through the interpretation of data background and rules. "data context = information"
2.Information
Information: It is the content of the data load. For the same information, its data representation can be in various forms. The message tells us: “What does this set of symbols represent?”
3.Knowledge
⑴Knowledge: It is the correct conclusion obtained by the information recipient through the refining and reasoning of the information. It is the understanding and mastery of nature, human society, thinking methods and movement laws through information. It is reassembled by the human brain through thinking. A systematic collection of information. Knowledge tells us: “What does this set of data mean?”
⑵ Knowledge transmission mode: transmitter’s knowledge → data → information → receiver’s knowledge. It can be seen that the key to converting information into knowledge lies in the recipient's ability to understand the information. The ability to understand the information depends on the recipient's information and knowledge preparation. Information can only be transformed into knowledge if it is combined with the recipient's personal experience, information and knowledge preparation, that is, it is integrated with the recipient's personal background. "Information experience = knowledge"
⑶ It can be seen that knowledge can only be acquired through learning and experience (practice), and human cognitive and understanding abilities provide the necessary conditions for the transformation of data into information, the transformation of information into knowledge, and the effective integration of knowledge and the creation of new knowledge. An indispensable condition.
4.Smart
⑴Intelligence: It is an ability and strategy to solve problems. It is the ability to effectively obtain information for specific problems and purposes in a certain environment, process information to form knowledge and strategies, and use strategies to solve problems, thereby successfully achieving the purpose. .
⑵Intelligence is knowledge activated by purpose and the way knowledge moves under certain conditions.
relation
Starting from the relevant concepts in the above information chain, it can be considered that:
Data is the raw material of information, and information is the raw material of knowledge. The trend of knowledge is to become an intelligent method for people to make decisions. Among them, data has the widest coverage, followed by information, and knowledge is the smallest.
2. Informatization and information society
(1) Information, matter and energy
⑴Material, energy and information are the three basic elements that govern the development of human society, and are also called the three pillar resources of contemporary society. Matter provides materials, energy provides power, and information provides knowledge.
⑵ In human society, matter and energy constantly flow from producers to users. This objective flow of matter and energy is reflected in human understanding as information about the movement of this matter and energy. The movement of material and energy flow also manifests as the flow of various other forms of material and energy, that is, the movement of corresponding documents, data and signals, the sum of which constitutes the information flow. Information can be considered to be the materialized flow in the field of social production.
⑶ Information flow reflects the movement of material energy flow. Society uses information to control the movement of material energy flow, influence their movement direction, conduct scientific distribution, and implement effective management.
(2) Informatization
1.Informatization concept
Informatization: Due to the irreplaceable role of information and information technology in today's social and economic development, both the government and all walks of life are making full use of information technology, fully developing information resources, and improving their own effectiveness and efficiency. People regard this as This phenomenon is called informatization. Informatization is an inevitable product of social development to a certain stage, and its rise has profound social, economic and technological backgrounds.
2. Elements included in the connotation of informatization
⑴Information resources. As a resource, information itself must reach a certain level in terms of accumulation, and it exists relatively commonly in various fields of human society;
⑵Information technology, the popularity and utilization of modern information technology and network technology continue to increase;
⑶Information concept, people’s understanding of the importance and value of information has been strengthened, and people’s education level and core literacy have been improved;
⑷Resource sharing, through rapid and effective information exchange, the full sharing of information resources within the whole society is achieved, generating social and economic benefits.
3. Information society
⑴ Concept
Information society: It is a new form of human society after agricultural society and industrial society. In the information society, information has become a more important resource than material and energy. Information economic activities aimed at developing and utilizing information resources have expanded rapidly, gradually replacing industrial production activities and becoming the main content of national economic activities.
⑴Characteristics of the information society
①Information, knowledge and intelligence are the decisive forces for social development;
②Information technology, information industry, and information economy have become the dominant factors in technological, economic and social development;
③Information workers and intellectuals will play a greater role;
④ Due to the widespread application of information technology, it has had a huge and profound impact on economic and social development, fundamentally changing people's lifestyles, behaviors and values.
(Supplement: Social informatization)
1. Concept
Social informatization: refers to a specific phenomenon in the development process of human society. When this phenomenon occurs, humans are increasingly dependent on information and relatively low on material and energy. (It is a new technological revolution based on and marked by the widespread application of computer information processing technology and transmission means, and a process that affects and transforms social lifestyles and management methods.)
2. Three levels of social informatization
⑴ Informatization of production tools: realized through automatic control and intensive knowledge.
⑵ Informatization of social production systems: realized through automated control of production industries, departments and even the entire national economy.
⑶Informatization of social life: realized through communication systems, consulting industries and other facilities.
3. The development stage of social informatization
The stage of popularizing the information industry, the stage of establishing and developing advanced communication systems, the stage of enterprise informatization, and the comprehensive informatization of social life.
3. Overview of information management
(1) Concepts and objects of information management
1. The concept of information management
⑴Narrow sense: Information management is the management of information itself, that is, using various technical methods and means to organize, control, store, retrieve and plan information, and guide it to predetermined goals.
⑵Broad sense: Information management is not only the management of information, but also the reasonable organization and control of various elements involved in information activities (information, people, machines, institutions, etc.) to achieve the reasonable allocation of information and related resources. Thus effectively meeting the information requirements of society.
⑶This book: A broad understanding of information management. It is believed that the essence of information management is that humans comprehensively use technical, economic, policy, legal, and humanistic methods and means to control the flow of information (including information flow in informal channels and formal channels) to improve information utilization. Efficiency is an activity aimed at maximizing the utility value of information.
2. Objects of information management
⑴Information resources
①The narrow concept of information resources equates information resources with knowledge, data and information, that is, it only refers to the collection of information itself.
②The broad concept of information resources refers to the organic collection of information, information technology and information personnel. It is a multi-element concept involving the entire information labor process such as information production, processing, dissemination, and utilization. These include the object of information labor - information, the tools of information labor - information technology means such as computers and communication technology, and information workers - information professionals. The above three elements are interconnected and interact with each other to form an organic whole with unified functions - an information system.
⑵Information activities
①The process of information from generation and dissemination to collection, processing, absorption and utilization is a complete "information life cycle". Information activities are the process of development and utilization of information resources.
②The information activities of human society include three basic levels: personal, organizational and social information activities.
Personal information activities are manifested in the individual's development and utilization of information resources. Its efficiency is related to the individual's information awareness and information capabilities, and is affected by the individual's information environment;
Organizational information activities often appear in the form of various information systems. The degree of development of information systems can reflect the level of information resource development and utilization of organizations at all levels.
As the development scale of information activities continues to expand, the development and utilization of information resources has an increasingly significant impact on the progress of human society, which is reflected in the formation and development of an emerging industry in society as a whole - the information industry.
(2) Goals and tasks of information management
1. Goals of information management
⑴Overall goal
Ensure that social information flow flows in an orderly manner through different channels. The development and utilization of information are coordinated and carried out in an orderly manner under leadership and organized unified planning and management, so that all types of information can be more efficient, effective and more effective. Low cost plays a full role in the country's social progress, economic development, and the improvement of people's material and cultural standards.
⑵Sub-goals
① Information production and development sub-goals: Mainly to rationally organize and plan the production and development of information based on social and economic development to ensure that relevant potential information can be transformed into actual information resources in a timely and economical manner for people to use.
② Information utilization sub-goal: mainly to rationally organize the circulation and distribution of information in accordance with the principles of socialization, specialization and industrialization to ensure that information can be fully and effectively utilized.
③ Information management mechanism sub-goals: establish a scientific and reasonable information management mechanism in accordance with the characteristics and laws of social information processes, and improve the guarantee system for information development and utilization.
2. Information management tasks
⑴Macro level
① Formulate information development strategies, strategies, plans, guidelines and policies so that information development activities can be carried out in an orderly manner under the unified guidance and management of the state, so that information development is not only low-cost and cheap, but also can well meet the national economy and the overall needs of social development.
② Formulate laws, rules and regulations for information management, and establish a supervision and guarantee system for information management, so that information management can truly have laws and regulations to follow, so that the information produced and developed can be fully, timely and effectively used.
③ Comprehensive use of economic, legal and administrative means to coordinate the relationship between departments, regions and enterprises, clarify the boundaries of responsibilities, rights and interests of information development and utilization agencies at all levels, so that information development and utilization can be maximized on the basis of equality and mutual benefit Realize resource sharing.
④ Strengthen the construction of national information infrastructure and networks so that the production, development, utilization and management of information have good hardware environment support.
⑵Micro level
① Investigate and understand the information needs of various personnel within the organization or institution, and formulate a compromise plan to meet different needs to maximize the satisfaction of different needs.
② Understand the information sources and information acquisition channels inside and outside the organization or institution, so as to obtain the required information or transmit the information externally when needed.
③Select applicable information technology, build internal information systems and networks within the organization or institution, determine information processing, storage, retrieval and transmission methods, and establish an efficient information assurance system within the organization or institution.
④Evaluate the performance of information management to provide a basis for improving information management.
(3) Perspective of information management
1.Technical perspective
Information managers study the use of manual and computer methods to collect, process and store information in an orderly manner so that it can be quickly retrieved and delivered to specific users. This has always been the focus and goal of information management research.
2. Economic perspective
Mainly studies the formation, development, characteristics and operation model of the information market, information industry, and information economy based on the production, circulation and utilization of information, the optimal allocation of information resources, the evaluation and selection of information technology, and the evaluation of information economic benefits. question. Information economics is a research field at the intersection of information management and economics from this perspective.
3. Administrative and legal perspectives
Based on government functions, use administrative means and legal means to regulate and control information activities, focus on coordinating and resolving new contradictions, new conflicts, and new interests that arise in the process of social informatization, and promote society to make fuller use of information. Better utilize the social functions of information to serve certain development goals. Information policy and information law are typical research fields from this perspective.
4.Humanistic perspective
Based on the study of the influence and construction of moral, ethical and cultural factors in information flow control. Especially in the network environment, it is difficult to effectively restrict people's information behavior by relying solely on administrative and legal means. People's behavior must be regulated and information activities must be managed through the cultivation of humanistic spirit. From this perspective, the research on information ethics and network ethics has attracted increasing attention.
As people's understanding of the rapid development of information technology and information management and their relevance to human society continues to deepen, a dynamic interdisciplinary research is gradually emerging - social informatics. (Social Informatics: is the interdisciplinary study of the design, use, and co-operation of information technology from the perspective of its interaction with organizational structures and cultural backgrounds. It is a new field that studies computerized social aspects, including information technology. work in social and organizational change, and the impact of social organizations and practices on the development of information technology).
(4) Information management process: Information life cycle management (ILM)
1.The meaning of information life cycle management
Information Lifecycle Management (ILM): It is an information management model that manages information throughout its entire life cycle, from creation to use to archiving and processing. It is a process strategy for active management of information. Its purpose is to Protect the continuity of information dissemination.
2. Stages of information life cycle management
⑴Information creation (generation/release) stage
The creation phase is the initial stage of the information life cycle and the starting point for information life cycle management. In addition to the originally regulated information producers such as information agencies, a considerable part of the information is currently generated by users in the network environment. Therefore, the release of information has a great degree of freedom and arbitrariness, which requires the regulation of the information creation stage. Sex makes higher demands. Document formats, specifications, and metadata descriptions need to be consistent during the information creation phase.
⑵Information collection stage
Collection is the basis and starting point for the development and utilization of information resources. First, it is necessary to formulate an information collection policy, clarify the content and scope of information collection, and then adopt corresponding collection methods. There are usually two methods of information collection: manual collection and automatic acquisition. The advantage of manual collection is that information must be manually checked for consistency before collection, which can avoid errors that cannot be recognized by machines. The automatic acquisition method can use corresponding network programs to capture information resources on the network, especially for dynamically updated information. This method is more efficient, but it also has the disadvantage that the value of the information cannot be judged.
⑶Information organization stage
The task of this stage is to provide an orderly structure for the information collection to form an organic whole to facilitate the access and utilization of the information. Specifically, it is the process of using certain methods to filter, analyze, index, describe, sort, and optimize the collected large amounts of scattered and messy information to form a system that is convenient for users to effectively utilize.
⑷Information storage stage
Information storage is the basis for realizing information value. The main task of this stage is to rely on relevant storage application technology to change the information stored on the corresponding carrier and media from an unavailable state to an available state, and the available state to a usable state. The horizontal usage status becomes a high-level usage status, so that information management, sharing, protection, backup, recovery, replication and other functions can be realized in an automated way; a service level hierarchy is established in the storage network and the information is transferred to the corresponding service In the hierarchical level.
⑸Information utilization stage
Information utilization is the purpose of information life cycle management, and information utilization is the process in which users effectively use the information provided. The main goal of information lifecycle management is to ensure that information can support business decisions and provide long-term value to enterprises and users. Information must therefore be easily accessible and usable. The main methods used in the information utilization stage include: establishing professional databases; providing professional information navigation; and developing information value-added services.
⑹Information cleaning (destruction/recycling) stage
As information ages and loses value, there will always be a time when much information is no longer worth keeping. At this time, relevant policies need to be formulated to clean up or destroy information that is not retained or necessary. Cleaned or destroyed information will be purged from active and inactive systems, as well as systems such as data warehouses. Some information that cannot be destroyed lightly needs to be migrated. Therefore, the main work at this stage is to establish scientific and clear rules for data recycling (destruction, cleaning, migration).
3. The difference between information life cycle management based on management perspective and value perspective
⑴The two methods of dividing the information life cycle stages are different.
① The management perspective is more from a management perspective, refining the specific process of tracking the actual processing of a type of product or the existing business process of processing, and decomposing the life cycle process of information.
②The value perspective mainly uses or attempts to use quantitative methods to objectively describe and simulate the life evolution process of information based on certain measurement indicators, and then divide the stages according to the life curve, or set some quantitative identification indicators to automatically identify life cycle stage.
⑵The information life described by the two has different starting points.
①The value perspective takes the research starting point after the information is generated, such as literature publication, monograph publication, etc.
②The management perspective involves how information is collected, organized, stored and utilized after it is generated.
4. History and Development of Information Management
(1) Traditional management stage
This stage takes information source management as the core, symbolized by the library, and also includes archive management and other document management. Although humans have been preserving and managing knowledge and information for a long time, information management as a specialized work and career only emerged and developed after the emergence of libraries.
⑴The library is the product of the development of human social life to a certain stage. Collecting and organizing documents and records became the way of existence and activities of early libraries. With the development of social economy, science and technology, and culture, the types of document records have increased significantly. As the original document collection institution, the library has gradually separated from the archives and become a social information exchange center institution for the collection, collection, and utilization of knowledge and documents. That is how it was formed. library in an independent sense.
⑵The library collects documents for the purpose of document utilization. In order to effectively resolve the contradiction between "collection" and "use", the library has introduced the concept of management. In addition to the management of the documents themselves, it also includes the management of institutions, people, document collection and provision processes, so it has a fairly comprehensive and modern awareness of information resource management.
⑶ Since the use of document information is very complex in time and space, libraries do not know when and where the "collection" here and now can be "used". This forces many libraries to have to focus on the source of document information. Collect them all in case of unforeseen events in the future. This is the fundamental reason why libraries attach great importance to the management of "sources".
⑷After the 1940s, the intelligence crisis began to emerge, and a new type of full-time information service organization emerged in the field of science and technology—scientific and technological information agencies. In essence, there is no special difference between scientific and technological information institutions and libraries. Especially in the context of libraries moving from the world of books to the world of information when modern information technology is widely adopted, library and information services are becoming more integrated. After the 1960s, both of them paid more attention to "management" and transitioned to the development of "information management". The traditional management stage with "information source" as the core, literature as the main carrier, and public welfare services as the main goal is developing into the information management stage, transitioning from management that emphasizes "source" to control that focuses on "flow".
(2) Technical management stage
This stage takes the control of information flow as the core, uses computers as tools, and takes automated information processing and information system construction as the main work content. Obviously, this is an emerging information management model developed under the background of the high development and widespread application of computer technology and related information technology.
① Computers have scientific computing functions and information processing functions. The first computer was successfully developed in the 1940s. Less than ten years after the invention of the computer, it was applied to the processing and management of document information in libraries. The purpose was to improve the efficiency of document information processing and search, and to realize the flow of document information. control.
② With the development of computer technology, computer processing functions are getting stronger and stronger, making it possible for people to process documents from the macro level to the micro level, and from partial information of the document to full-text information, which greatly improves human understanding of document information. The processing and management capabilities have improved the library and information center's automated control over the flow of document information.
③The technical management stage focuses on computer technology to process information and control information flow. Technical factors dominate and technical experts play the leading role. This stage created many information processing methods and system design and development theories around computer applications. The excessive pursuit of the application of the most advanced technology completely ignores the role of other factors in information management.
(3) Resource management stage
The concept of information resource management is proposed based on two backgrounds: on the one hand, it is the technical management stage, and pure technical means cannot achieve effective control and utilization of information; on the other hand, and the more important reason is that contemporary social and economic development has made Information has become an important resource, and there is an urgent need to think about the problem from an economic perspective and optimize the allocation and management of this resource.
⑴After the 1970s, people began to care about how people respond to changes caused by technology and the impact of people on the application of technology, and began to use administrative, legal, and economic means to coordinate society from a combination of micro and macro. Various contradictions, conflicts and interest relationships in the process of informatization, and proper handling of the complex relationship between people and things in information management, thus gradually forming the ideas and concepts of information resource management.
⑵After the 1990s, the global information highway based on the Internet has completely changed the way human information activities are conducted. However, the high-speed information network has not brought a truly efficient and orderly information space through high-level technological development. On the contrary, the rapid expansion of the network has brought about information pollution, information chaos, information crime, and information infringement, which far exceed the traditional meaning. intelligence crisis.
⑶The current research on network digital information resource management can be mainly divided into the following aspects: ① Network resources ② Library ③ Information network ④ Information retrieval ⑤ Information organization ⑥ Metadata ⑦ User service ⑧ Evaluation ⑨ Information resource sharing ⑩ Information resources Develop and utilize information environment
(4) Knowledge management stage
Knowledge management was developed on the basis of overcoming the inherent shortcomings of information management. It is an information management activity that emphasizes dealing with people. Its essence is to connect structured and unstructured information with the rules for people to use this information. . The premise of knowledge management is the accuracy, timeliness, availability of information and information systems, and the ability to provide information in an easy-to-use manner. The main goal of knowledge management is to improve the ability of an organization or institution to acquire, share and utilize knowledge.
⑴The realization of knowledge management must be conditioned by corresponding information technology, and its results can most easily be expressed in the form of technical achievements. After the 1980s, the development of artificial intelligence and expert system technologies played a greater role in the emergence of knowledge management. Due to the development and application of these technologies, knowledge acquisition, knowledge expression, knowledge base systems, knowledge engineering and other fields were proposed. Fundamental concepts are widely used in knowledge management. In the mid-1990s, with the popularization and application of the Internet, knowledge management was rapidly promoted.
⑵ At present, knowledge management systems, knowledge sharing, knowledge transfer, and knowledge management strategies are the key topics of knowledge management. From a technical point of view, knowledge management can be divided into three stages: ① the knowledge management stage centered on the knowledge base; ② the development stage centered on the small-scale knowledge community; ③ represented by social computing, dynamic knowledge and its performance A new stage of knowledge management. Currently Web2.0 establishes a connection between knowledge management, social networks and semantics, which will certainly promote the development of knowledge management based on the Semantic Web.
Chapter 2 Information Exchange
1. Basic concepts of information exchange
(1) Classification of information exchange
1. Natural information exchange
Natural information exchange is entirely natural, including information exchange and interaction between non-living things, between living things, and between humans, non-living things, and living things. This process of action or influence should completely follow the laws and laws of nature and has nothing to do with human subjective consciousness. It is purely natural information exchange, that is, information exchange at the ontological level.
2. Information exchange between man and nature
The information exchange between man and nature is a semi-natural, semi-social information exchange, and a semi-conscious information exchange. In addition to following the rules of nature, this communication method also changes with people's understanding. (Understanding and transforming the world)
3. Information exchange between people
The process of social information exchange between people is the interaction of people's subjective consciousness. This communication method is also the most complex and advanced form of information exchange. It relies on the consciousness of both communicators and the behavior of both parties. Communication behavior is not only affected by psychological factors, but also follows certain social rules. At the same time, it is closely related to other aspects of society. All aspects are connected to form a large system for the operation of the entire society.
(2) The meaning and characteristics of information exchange
1.Meaning
Information exchange: refers to the transmission and exchange of knowledge, messages, data, facts and other information between cognitive subjects (people or institutions and organizations composed of people) at different times or in different spaces with the help of corresponding symbol systems. the process of.
(Synchronic/horizontal information exchange: realized on the same time plane, that is, communication here and now or here and now. The main function is to overcome spatial barriers to communication and achieve timely information sharing. Communication means include: Internet, fax, Telephone, telegraph, radio, television, spoken language, physical objects, etc.
Diachronic/vertical information exchange: Longitudinal examination of information connections in human society on different time planes. The main function is to eliminate the time barrier of communication, fill the gap between the past and the present, connect ancient times with modern times, and provide conditions for concentration and development. Communication means include: engraving CDs, copying disks, audio recordings, video recordings, photography, paintings, documents, archives, monuments, cultural relics, etc. )
2.Characteristics
⑴No matter what form of information exchange process there must be an information transmitter S, who is the initial source of information and is actually the information producer; an information receiver R, who is actually the final user of information.
⑵The information exchange process can be direct, such as simultaneous face-to-face communication, or indirect, such as non-simultaneous and non-face-to-face communication. For indirect communication, there should be at least one "human participation" link. We call this information transmission an information chain. The nodes of the information chain can be people, documents or other forms generated and transmitted by people, or a mixture of several forms.
⑶Information exchange is essentially a one-way transmission. The flow of information is always from S to R, which is a time-irreversible process.
⑷The source of information must be information provided by other people's minds S, rather than information obtained directly from natural or social observations.
⑸ Information exchange must also be the purposeful behavior of both parties, that is, both S and R intend to transmit and receive information. R has information needs and S provides it with purpose.
(3) Information exchange behavior
1. Human behavior
⑴Purpose behavior
Information behavior (speech, writing, body posture, etc.)
Non-informational behavior (manual labor, machine operation, etc.)
⑵Non-purpose behavior
Subconscious behavior
2. Information exchange behavior
⑴Output behavior (S behavior)
Writing, speech, physical samples, paintings, symbolic features, etc.
⑵Receiving behavior (R behavior)
Reading, art appreciation, phone calls, TV listening, etc.
(4) Conditions and elements for information exchange
1. Information sender
Also called information transmitter or information producer. It is the initial source of information and the initial link in the information transmission chain. It generally does not refer to information institutions, libraries, documentation centers and other relay stations for information transmission.
2. Information recipient
Or the recipient. It is the final recipient or user of information.
3. Communication channel
That is, the channel through which information reaches the recipient. For S and R, the basic channels through which they send and receive information are still the sensory systems of both parties, including vision, hearing, taste, smell, touch, etc. Other channels or technical systems are only expansions, extensions or extensions of these sensory channels. Transform.
4. Symbol system
It is the carrier on which information is transmitted and communicated. Including language, text, gestures, expressions, tokens, computer language, etc., it also includes the ways and rules of interconnection and organization of various symbolic elements within these symbolic systems.
5.Knowledge information base
It is the general term for human brain knowledge and information, including all kinds of knowledge and information created by people. It is the most fundamental source and ultimate destination of information exchange.
6.Support conditions
It is the guarantee for information exchange to be realized. Including: ① natural conditions, such as the role of light, electricity, and air in transmitting information; ② technical conditions: including various communication technologies, storage, processing technologies, etc.; ③ social conditions: the social security system for information exchange, such as laws , policies, economic conditions, the establishment of information agencies and related organizations.
2. Information exchange model
(1) Shannon-Weaver communication model
This model describes information exchange as a one-way linear process, and the entire process consists of five links. ⑴ The source is responsible for sending out the information to be transmitted. ⑵ This information will be encoded by the encoder and reach the decoder in the form of a signal suitable for the channel it passes through. ⑶ The function of the decoder is opposite to that of the encoder. ⑷ It will receive the signal Reduced to information and sent to the destination of communication, i.e. the destination. ⑸Noise refers to any interference to normal information transmission beyond the intention of the communicator, which may cause information distortion.
The communication model is oriented to signal transmission between machines. It reflects a primitive and simple idea of information dissemination and laid the foundation for many subsequent communication and dissemination process models. But at the same time, this model also has some limitations and is not completely suitable for the information exchange and dissemination process of human society.
(2) Lasswell’s “5W” model
The 5W model uses the answers to five questions to describe communication behavior, namely who (who), what (say what), through which channel (in which channel), to whom (to whom), and what effect was achieved (with what effect). The 5w model defines the research scope and basic content of communication studies. The impact is extremely far-reaching.
While emphasizing the sender, receiver and communication channels of information, this model begins to focus on the content of the information itself, as well as the effect of communication and dissemination, and is especially suitable for analyzing political communication and propaganda.
(3) Schramm model
Three models of information exchange are proposed. The first model is very similar to the Shannon-Weaver model of communication. In the second model, Schramm advanced the simple model of communication into a more complex model. He pointed out that two individuals trying to communicate must accumulate considerable common experience, that is, between the source and the destination, only between them Only within the scope of common experience can there be real communication. In the third model, Schramm's model is further developed into a model indicating the interaction between two individuals in human communication. This model is also called Schramm's "circulation model". In addition, a "mass communication process model" that can fully reflect the characteristics of mass communication is proposed. It is believed that the two parties that constitute the communication process are mass communication and the audience, and there is a transmission and feedback relationship between the two.
The characteristic of Schramm's model is that it introduces the concept of feedback, links the feedback process with the interaction process of the communicator, and understands information exchange as an interactive cyclic process. This model is more in line with the actual situation of human information exchange.
(4) Vickery’s S-C-R model
S represents the information source, C represents the communication channel, and R represents the information receiver. Information exchange is the process in which information starts from the information source, passes through various channels and media, and then is transmitted to the information receiver. From a formal point of view, the three elements of the S-C-R model correspond to the information source-channel-information sink in the communication model. Compared with the machine signal transmission of the communication model, the S-C-R model emphasizes human-oriented social communication. Like the information source and information receiver, information channels are also affected by society. Therefore, the examination of information exchange and transmission is placed in society. It is conducted against a background and involves many humanistic factors, such as the impact of social status and cognition on individuals’ absorption of information.
Weekley believes that information exchange and transmission itself is often interactive and two-way, so the connection between various elements in the S-C-R model should be two-way.
(5) A.И. Mikhailov’s generalized scientific communication system model
American sociologist H. Menzel proposed the famous "formal process" and "informal process" communication models. Under this model, information exchange in society is divided into formal communication and informal communication. This theory was compiled by Soviet intelligence scientist and educator A.И. Mikhailov into a broad scientific communication system model. This model divides scientific communication into formal communication and informal communication.
1. Formal communication: It is the exchange of information through the control of a scientific literature information system or a "third party". Advantages: ⑴ The information obtained is highly reliable; ⑵ Detailed and comprehensive scientific information on a certain topic can be found from a large number of documents; ⑶ There is no need to meet the information producer in person. Disadvantages: ⑴ Information transmission is not timely; ⑵ Finding scientific information through literature requires certain methods and skills.
2. Informal communication: refers to the exchange of information between scientists and researchers through personal contact, such as direct conversations with each other about their research, visits to peer and science and technology exhibitions, oral speeches to various audiences, exchanges Letters etc. Advantages: ⑴ short information interval; ⑵ strong selectivity and pertinence; ⑶ rapid feedback when transmitting information; ⑷ easy to understand the information obtained, and can give appropriate evaluation. Disadvantages: ⑴ The reliability and accuracy of information are difficult to test; ⑵ Only a few people often have the opportunity to participate in direct communication; ⑶ It is impossible to accumulate intelligence for future processing.
3. Information exchange mechanism
(1) The formation of social information flow
Social information flow is a ubiquitous social phenomenon. It is the flow process in which information is transferred from the producer R to the user R, through or without other social links, and finally to the user R. S and R are connected through social information flow.
(2) Stack in information exchange and dissemination
1. Information stack (W): It is the link through which information flows from S to R. It must be a human or artificial system, such as a publishing agency, information agency, broadcasting system, library, etc., whose function is to receive, process, store and deliver information.
2. Stack communication: The social information dissemination process with the participation of information stack is called "stack communication".
3. Zero-stack communication: The direct communication between S and R is called "zero-stack communication".
(3) Direct communication and social agent communication
According to whether there is an information stack involved in the communication process, information exchange can be divided into direct communication (zero-stack communication) and indirect communication (stack communication).
1. Social agents (intermediaries) in information exchange
It means that during the information exchange process, the two parties (S or R) cannot directly transmit the information to the other party (R or S), so they transfer the information exchange behavior to the information stack in society, and the information stack acts as the agent for S or R is used to transmit communication information to complete information exchange. The information stack is called the agent (A), and the stack communication is called social agent communication.
2. The difference between direct communication and social agent communication
⑴ Direct communication is the direct connection between S-R. Its media system is purely natural and objective, without the intervention of social factors. Social agent communication is communication that is completed with the intervention of the social system. Direct communication is vivid, intuitive, rapid feedback, and short time intervals, while social agent communication does not have these characteristics.
⑵ In direct communication, the source information (that is, the information provided by S) or the demand information (the information proposed by R) is directly communicated and is a direct connection between S or R, and its information content will not be interfered and controlled by the social system. The source information and demand information communicated by social agents need to be represented by A and affected by A. On the other hand, many full-time social agency information agencies have the function of supervising and evaluating the dissemination of information and are relatively rigorous, while direct communication is more arbitrary.
⑶ When S-R communicates directly, it is all synchronic communication, while in social agent communication, there is both synchronic communication and diachronic communication.
⑷ Direct communication is an S-R relationship, and social agency is an S-A-R relationship, but the S-A-R relationship can be converted into an S-R relationship because A is the agent.
⑸ When A has multi-level agents, according to its transitivity characteristics, the one agent directly contacted with it can be regarded as all agents.
(4) Information transmission mode
1. Multi-directional active transmission
In this way, S or A actively transmits the information it produces or collects to an undetermined recipient R in response to the needs of the entire society. Various catalogs, indexes, abstracts and other tools provided on the Internet belong to the multi-directional active delivery mode.
2. One-way active transmission
This method is that S or A transfers information to a predetermined recipient R based on investigation and understanding of user needs. Its main form is topic setting service. This is an advanced stage in professional information services. The information delivered is targeted and timely, and its effectiveness can be fully exerted. This method generally means that there is a fixed and relatively close cooperative relationship between S or A and R. S and A understand R's information needs, so that they can provide information accurately and timely.
3. Multi-directional passive transmission
This method does not have a determined recipient in advance. It is an information service developed by A for the entire society, such as library, information center reading and lending services, website information services, etc. Here, users who come to the information center or library and click on website information are active, while the information transfer itself is passive, so it is also called undirected passive transfer.
4. One-way passive transmission
This method mainly refers to the information consulting services carried out by S or A. S or A often do not understand who R is and what information they need, but accept R's consultation based on the knowledge, experience and information resources they have. Consultation is a question raised by R, which is passive to S or A, and the object of information transmission is a specific consultant R, so this kind of information transmission is called one-way passive transmission, also called directed passive transmission. .
Comparison of characteristics of the above delivery modes
⑴ Multi-directional active transmission is the most basic and important information transmission among all kinds of information transmission carried out by professional information services. It forms the core of all aspects of information service and information management work, and takes it as the central link to form the work flow of information management.
⑵ Multi-way transmission is generally easier than one-way transmission, because one-way transmission has specific goals and requires A to spend a lot of energy and time to understand the needs of specific users, which naturally increases the difficulty. Active delivery is easier than passive delivery because there is more time to prepare.
⑶ Directed delivery is the ideal goal of information services. Only directed delivery can information be delivered to specific users, thereby maximizing the effectiveness of the information. The largest number of directed transmissions is directed passive transmission. Directed passive transmission is based on multiple active transmissions. In turn, problems and shortcomings in multi-directional active transmission can be discovered, so as to improve them and make the information transmission process All aspects of the process are continuously improved and improved.
4. Network information exchange
(1) The rise of online information exchange
As an emerging information carrier, the network has many incomparable advantages compared with traditional carriers, mainly in:
⑴The network spreads and updates quickly, providing the fastest and most convenient way of communication;
⑵ The amount of information is large and the content is rich. The amount and type of information are unimaginable for traditional printed publications;
⑶Easy to retrieve, people can get the information they need through search engines, e-mails, forums, etc., which is easy to obtain and save;
⑷The combination of hypertext and multimedia, the data contains links to other multimedia forms of information, enriching the network information content and increasing network affinity;
⑸ Strong interactivity, which is one of the most unique features of online media. Through the interactivity of the network, users can obtain information in a timely manner as well as responses and feedback from other users, forming two-way communication.
(2) Models and characteristics of network information exchange
1. Network information exchange model
⑴In this model, information users are divided into four types, namely:
① The producers of network information are mainly producers of original information. They can be individuals or groups, such as governments, enterprises, etc.
②Network information publishers publish original information on the Internet through information technology means, allowing users to obtain the required information through browsing pages and information executors.
③Network information executor, as an intermediate link, executes the information needs of information users, and then transmits the executed information to users who request information.
④Network information users refer to individuals or groups who receive and use information.
Among the main communication channels, information produced by information producers is released to the Internet by information publishers. The Internet includes not only private networks and public networks, but also various other networks. Information users can use information through various channels. Since information exchange is a two-way process, information producers can also be information users, and information publishers can also be information performers. The same user can play multiple roles under network conditions. The producers and users of network information can overlap.
⑵According to the network information exchange model, the network information exchange process can be divided into three stages:
①Information generation stage. Information from information producers is published on the Internet through information publishers. Original information comes from a wide range of sources. On the one hand, publishers comprehensively establish and provide digital journals, provide remote, virtual digital collections, and retrieval, delivery and long-term preservation services to websites, forming a digital information resource system that can be widely used. On the other hand, information producers can produce information directly on the website. In this way, information producers also perform the duties of information publishers.
②Information exchange stage. Network information exchange activities between information producers and information users through the Internet or Web sites with the help of computers or terminals. From the perspective of information flow, the information exchange process of this model implies the following most basic information flow processes: a) Information producer-website-information producer, which means information communication and feedback between information producers and websites. b) Information producer-website-information user, which means that the information producer transmits the information it produces to one or more information users through the website. c) Information user-website-information user means that the information user communicates and provides feedback with the website. d) Information user-website-information producer, which means that the information user transmits his or her required information to multiple information producers through the website. These four most basic information flow processes can also be combined into different communication processes.
③Information acquisition process. The process in which information users directly browse web pages or make requests to the network through search engines, and obtain the required information through information executors.
2. Formal and informal communication in the online environment
⑴Traditional information exchange: ①Informal communication: refers to the direct transmission of information between information sources and information users, such as discussions, discussions, reports, exhibitions, demonstrations, visits, etc., with obvious individuality. ②Formal communication: refers to the indirect transmission of information to information users by the information department with the help of the document information system and documents as the medium, such as document collection, arrangement, storage, retrieval, synthesis, research, analysis, provision, etc. The basic The performance is the exchange of documents that have been officially published and distributed, and has an obvious social nature. The main difference between the two is whether the information source and information users transmit information through the document information system and use documents as the medium.
⑵ Information exchange in the network environment can be divided into: formal information exchange and informal information exchange according to whether there is the participation of full-time information personnel. ① In the process of information storage and transmission from information sources to information users, communication with the participation of full-time information personnel is formal communication; ② On the contrary, spontaneous individual communication without the participation of full-time information personnel is informal information exchange. The essence of this division method is based on whether the communication information is filtered and controlled. Strict review and control of the communication information and transmitting it through the information system is the formal communication process, and vice versa is the informal communication process.
⑶ Informal communication in the network environment has some obvious advantages over traditional informal communication: ① Informal communication on the Internet can not be restricted by time and space, is more convenient, and greatly improves the efficiency of communication. ② The interval of informal communication on the Internet is shorter, communication is faster and faster, and it is very convenient to discuss and communicate with peers in real-time. ③The cost of informal communication on the Internet is lower.
3. Stack in network information exchange
We collectively refer to the carriers, tools and platforms required for network information exchange as intermediaries for network information exchange. According to the degree of control that intermediaries have over the content of information exchanges, they can be divided into purely technical intermediaries, content selection control intermediaries and content complete control intermediaries. Correspondingly, network information communication can be divided into zero-stack communication, quasi-stack communication and stacked communication.
⑴①Purely technical intermediary: refers to a technical support system that only provides network information exchange and does not participate in the control of communication content. ②Zero-stack communication: refers to the participation of purely technical intermediaries in the process of network information exchange, which only provides physical and technical support and does not intervene or control the communication content, such as sending E-mail, QQ communication, and WeChat communication , mobile phone text messages, etc.
⑵① Content selection control intermediary: It refers to the intermediary that can filter and control communication content, but only deletes or blocks it without organizing the content, such as BBS, personal websites, blogs, news groups, etc. ② Quasi-stack communication: refers to the communication in which the content selection control intermediary participates. If the intermediary does not filter the communication content, it is equivalent to zero-stack communication for the user; if the intermediary deletes or blocks the communication content Operations are equivalent to stack communication for users.
⑶① Full content control intermediary: refers to an intermediary that collects, controls, organizes, stores, releases and manages network information. ② Stacked communication: refers to network information exchange involving an intermediary with complete content control. Such as major journal database providers, major websites, digital libraries and other institutions. They have a set of rigorous operating specifications for information reception, processing and transmission. They perform quality control, sequencing and other processing on information. They have the strongest control over information content and play a very full role as an information stack in the information exchange process.
(3) Mobile information exchange
Mobile information exchange is an emerging information exchange method with the rise and development of mobile networks. Mobile web refers to Internet-connected application or browser-based access to the Internet from a mobile device that can connect to a wireless network.
The characteristics of mobile networks are: ① continuous network connectivity; ② location awareness; ③ interactive functions.
(4) Scientific information exchange under network management
1. The meaning and characteristics of scientific information exchange
⑴Meaning
①Scientific information exchange process: refers to the information workflow in all links from the publisher of scientific information to the information user. It is an information exchange process specifically targeted at scientific information. Generally, it can be divided into several basic links such as the release, transmission, organization, storage, retrieval and service of scientific information.
②Formal communication: The process of communication through the scientific literature system and using scientific literature carriers is regarded as formal communication, also known as the indirect communication process.
③Informal communication: It mainly relies on personal contact between scientists and experts. It is called informal communication, also known as the direct communication process.
⑵Features
①The information process is condensed and integrated, functional role elements are downplayed, and the entire communication activity is dominated by information flow.
②The speed of information flow is accelerated and the traffic volume is increased.
③Virtualization of information organizations.
④The number of information exchange methods increases, and the status of informal information exchange increases.
2. Tools and applications for scientific information exchange
ResearchGate, Mendely, Academia, Science Network, academic public accounts, etc.
Replenish
The manifestations of information distortion and its main causes
⑴Main forms: ①Physical distortion; ②Semantic change; ③Pragmatic attenuation
⑵Main reasons: ①Technical or channel obstacles; ②Excessive information stack; ③Social factors; ④Natural factors
Chapter 3 Information Distribution
1. The Matthew Effect in Information Generation and Distribution
(1) The manifestation and function of the Matthew effect
1.Matthew effect
In the process of generation, transmission and utilization of social information flow, we find that information and related factors often show obvious core trends and centralized orientations. For example, a few publishing houses will become the authoritative publishing institutions of certain types of books, and a few authors will write the most because they write the most. The one with the highest citation rate becomes the core author in a certain field, a few websites gather a large number of users, etc. This is the Matthew effect in the generation and distribution of information.
The Matthew Effect is ubiquitous in human society and has been widely extended and applied. The Matthew Effect truly summarizes the inertia in human social life and describes the accumulation process of advantages and disadvantages: once there is an advantage, this advantageous situation will continue to strengthen; conversely, if there is a disadvantage, the disadvantageous conditions will continue. intensified.
2. Performance of Matthew Effect in Document Information Distribution
The information-enriched distribution caused by the Matthew effect manifests as core trends and concentrated orientations.
⑴ Core trends: For example, the formation of a group of highly productive authors, the increase in information density in journals, and the establishment of high-frequency vocabulary are all the result of the active expectations and practical steps taken by information production entities.
⑵ Concentrated orientation: For example, a paper has been cited many times, a website has been clicked by many users, and certain books have been borrowed frequently. This enrichment is the result of social selection and influence, and information producers are basically in a passive state.
(2) The influence of Matthew effect
1. The positive impact of the Matthew Effect
⑴In actual information management work: it can help us highlight key points, abandon averages, provide a basis for the selection, acquisition, evaluation and utilization of information sources, and provide guidance and methods for reducing information management costs and improving information utilization benefits.
⑵ In theory: The advantages described by the Matthew Effect can help us understand the characteristics, trends and laws of information concentration and dispersion, and discover the basic laws of information management.
2. Negative impact of Matthew effect
⑴The Matthew Effect describes the excessive accumulation of advantages and disadvantages of information objects, which easily leads information workers to select, evaluate, disseminate and utilize information based on simple advantages, and goes to extremes, causing information workers to stick to the old ways, not thinking about progress, and not seeking for more. Change and innovate, and engage in complex and ever-changing information management work based on experience and simple rules.
⑵The information distribution enrichment formed by the Matthew effect is sometimes only superficial and external. For example, the number of citations for a paper does not necessarily mean that it has a high value. Some papers with wrong views or controversial views may also have a high citation rate. Some advantage accumulation processes are sudden and affected by statistical factors, which distorts the value of information.
⑶The core information source is the result of the accumulation of Matthew Effect advantages, but if you focus too much on the core information source, you will ignore the valuable information distributed in other information sources. The core information sources (such as core journals) themselves have a development process. If they are not controlled and their advantages in a certain aspect are allowed to accumulate excessively and become highly specialized, the information they contain will become more and more monotonous, which will be detrimental to some users. It may be beneficial, but it will lose more users and is not conducive to information management and information services.
⑷The growth and number of information producers have a great impact on the generation and distribution of information. The Matthew effect favors celebrities and rejects the habitual force of newcomers, which is not conducive to the growth of newcomers and limits the generation and dissemination of new ideas, new knowledge and new information. Taking the field of science and technology as an example, the emergence and growth of so-called outstanding newcomers are very difficult due to the Matthew effect, unless they make epoch-making results that are astonishing. However, step-by-step is the universal law of talent growth, and a blockbuster also requires long-term accumulation, social support, and continuous breakthroughs. It has been proven that the best creative age range for humans is between 25 and 45 years old. However, due to the parallax of the Matthew effect, newcomers in this age group and their achievements have been ignored, and they have not been effectively integrated into the social information flow. .
From the above analysis, it can be seen that the accumulation of the Matthew effect must be moderate and cannot be allowed to develop. In order to ensure that the generation and distribution of information are more scientific and reasonable, appropriate intervention is necessary.
2. Discrete distribution rules of information content
(1) Bradford’s Law
Bradford, a famous British literature and information scientist, found that the distribution of relevant papers in a certain subject field in journals is uneven and has obvious concentration and dispersion patterns. Based on long-term observations and statistics, he proposed the famous "Bradford's Law of Dispersion", referred to as Bradford's Law or Boucher's Law: "If scientific journals are measured by the number of papers they publish on a certain subject, Arranged in descending order, it is possible to distinguish among all these journals a 'core' section with the highest number of articles and subsequent sections containing an equal number of papers as the core section. The journals contained in the core section and subsequent sections can then be distinguished The relationship between counting to 1:a:a² (a>1)
(2) Zip's law
If each word in a long article (approximately 5,000 words or more) is arranged in descending order of its frequency of occurrence (high-frequency words first, low-frequency words last), use natural numbers to rank these words with the highest frequency. The first is level 1, followed by level 2...and so on until level D. If f is used to represent the frequency of a word appearing in an article, and r is used to represent the grade number of a word, then fr=c, and c is a constant. The above equation is called Zip's law. (Revealing the distribution rules of information through word frequency distribution)
(3) Content distribution of network information
Under network conditions, the distribution of information resources still satisfies the laws of concentration and dispersion, but the information publishing environment under network conditions is very different from that of traditional journal publishing, so the degree of concentration and dispersion has shown changes.
3. Distribution rules of information producers
(1) Lotka’s law
In 1926, the American statistician Lotka published a paper titled "The Frequency Distribution of Scientific Productivity" after extensive statistics and research, aiming to ascertain the productivity and productivity of scientific and technological workers through statistics of published works. Contributions to scientific and technological progress and social development. It was not until 1949 that this research result attracted academic attention and was known as Lotka's law. According to Lotka's analysis, there is the following relationship between the number of papers x and the number of authors yx: xⁿyx=c, which can be further written as: f(yx)=c/x², which is the famous inverse square distribution.
(2) Price’s Law
Inspired by Lotka's law, Price proposed the "square root law" of the distribution of core producers, that is, in a specific field, half of all papers are written by those who have the square root of all authors in the field. "Those people" are undoubtedly the core or prolific authors. This law can be expressed as: ∑ (m 1~1) n (x) = √N, which succinctly describes the distribution of information according to the capabilities of producers.
(3) Production and distribution of network information
Information production under network conditions shows different laws. The distribution of producers is closely related to the degree of specialization of information content. The higher the degree of information specialization, the more obvious the concentration of producers; conversely, the degree of dispersion of producers The more obvious it becomes.
4. The distribution of information over time
(1) Index growth rate
1. Price Curve: If we take the amount of literature as the vertical axis and the historical era as the horizontal axis, draw the amount of literature in different eras point by point on the coordinate chart, and then connect the points with a smooth curve, then we can A very approximate representation of the growth pattern of literature over time is the famous Price Curve. Through curve analysis, Price was the first to notice that literature growth is an exponential function of time.
2. Growth curve: Price pointed out that taking into account material, economic, intellectual and time influences and limitations, the growth of literature information is closer to the biological growth curve, that is, the initial growth or reproduction is very fast, and over time As time went by, its growth rate became slower and slower, so that it almost stopped increasing.
(2) Gradual obsolescence rate
1. Document half-life: refers to the period of time during which half of all the documents currently used in a certain subject field were published. "Half-life" is roughly the same as the time it takes for half of the literature in a subject area to expire.
2. Price Index: It is the ratio of the number of citations to documents published less than 5 years ago to the total number of citations in a certain subject field. Generally speaking, the larger the Price index, the smaller the half-life and the faster the literature ages.
3. Factors affecting the aging of literature information
⑴ Literature growth: The aging of literature is first of all linked to the growth of literature. The large increase in documents shows that the growth rate of scientific knowledge is accelerating. New theories, new methods, and new designs are constantly being produced and continuously improved. The original knowledge content was not comprehensive and the incomplete old documents were gradually forgotten by people, and the frequency of citations decreased. If you use the concept of half-life, you will find that the faster the literature grows, the shorter the half-life of the literature is, which accelerates the aging of the literature.
⑵Discipline differences: The aging rates of literature in different disciplines vary greatly. In some disciplines, such as electronics, medicine, chemistry and chemical engineering, due to active research work, new methods and new experiments are constantly needed. More manpower and material resources are invested, and knowledge is updated quickly, so the half-life of documents is short. The development of other disciplines, such as animal and plant taxonomy, geography, etc., is mainly the accumulation of knowledge, not revision, so it is relatively more stable.
⑶Differences in the development stages of disciplines: Even in the same discipline, the half-life of documents is not exactly the same in different periods or stages. The subject is in the early stages of birth and development, and the amount of literature is growing exponentially. The aging of the literature is in line with a negative exponential function, and the corresponding curve is a negative exponential curve. With the deepening of subject research and entering a relatively mature period, the growth of scientific literature no longer continues to maintain the original exponential rate increase. The growth rate of the literature becomes smaller, and its corresponding aging curve also becomes gentle, and the half-life is lengthened.
⑷Information environment and needs: Different information users have different needs for documents. The backbone of scientific research work needs to understand the world's cutting-edge in the subject field and is therefore interested in new documents. Those who have just started working need to understand the background. material. In addition, the age of use of documents is not exactly the same between countries and regions. Scientifically developed countries are interested in the latest, while less developed countries need to learn from the existing results and experiences of other countries.
(3) Time distribution of network information
1. The growth pattern of network information
⑴The growth pattern of the number of web pages on a single website. In a single website that is in the mature stage and only has a formal communication process, the cumulative number of web pages has a quadratic curve growth pattern.
⑵The growth pattern of the number of web pages in a specific subject/topic area. The exponential growth model has strong applicability in studying the growth patterns of online academic information in the subject area of knowledge management in a network environment.
2. Aging rules of network information
The aging of network information means that the effective value of intelligence in network information gradually decays with the passage of time, and the utilization rate gradually decreases. The main factors causing network information aging include:
⑴The growth of network information. Due to the rapid increase in the total amount of network information, the utilization rate of unit amount of network information decreases, the utility value decreases, and it shows the characteristics of aging.
⑵Update of network information. The emergence of new network information causes the aging of old network information to accelerate.
⑶Disappearance of network information. It means that network information is deleted from the system and can no longer be accessed and used, which means the end of the network information life cycle.
⑷The pragmatic attenuation of network information. Pragmatics refers to the utility value of network information to the recipient. As the receiver's needs are realized, although the objective value of network information does not change, the utility value of the receiver decreases because he has accepted similar information. In addition, the value of network information is limited by time.
⑸Others. Internal factors such as the type, nature and development stage of network information; external factors such as technological progress, changes in user needs and changes in the social environment, etc.
3. Half-life of network information
⑴ Refer to the definitions of cited half-life and citation half-life in literature half-life
① Network information citation half-life: refers to how recently the newer half of all the information cited or linked to a certain site in a certain period of time was published.
② Network information citation half-life: refers to how recently the newer half of all information cited or linked to by all information on a certain site within a certain period of time was published.
⑵ Define the half-life of network information based on the characteristics of network information research objects
① The half-life of macro network information resources: refers to how long it takes for the newer half of the resources contained in the chain of links contained in network information resources in a certain subject area to be generated.
②Microscopic network information resource half-life: refers to how long it takes for 1/2 of all other network information resources pointing to this network information resource to appear after release.
4. Network information life cycle
It refers to the various stages and the entire process that network information goes through from generation to loss of utility value.
Chapter 4 Information Acquisition
1. Concept, attributes and characteristics of information sources
(1) The concept of information source
Information sources: the results and various original records produced by people in scientific research activities, production and business activities, cultural activities and all other activities, as well as the finished products obtained by processing and sorting these results and original records (such as databases, abstract index magazines, etc.) They are all sources through which information can be obtained, referred to as information sources.
(2) Attributes and characteristics of information sources
1. Basic attributes of information sources
⑴Objectivity
The information contained in the information source is a reflection of all human thinking and social activities. The information source is the material carrier of human knowledge and memory and is objective. First, it is an objective object; second, it is a reflection of the existence of knowledge contained in the information source.
⑵Transferability
The information source is the first element in the information dissemination process. It can only play its role if it reaches the receiver through dissemination and is utilized.
⑶Activatability
Information can be perceived, identified and utilized. The process of the human brain perceiving, thinking, analyzing, synthesizing and associating the information connotation of the information source is to activate the information in the information source so that it is always in a cycle of continuous dissemination and use. During the circulation process, information can be continuously adjusted, supplemented, adapted and reorganized to make the content it contains more targeted.
2. Characteristics of information sources
⑴The accumulation of information sources
Since information records human knowledge on material carriers, we can use material means to collect, organize, and accumulate it, so that the knowledge, culture, and technology created by humans can continue to be continued, inherited, and developed.
⑵Complexity of information sources
Since information is universal in all matter, the types of information sources and the forms of carriers are complex. It is huge in quantity, rich in content, and diverse in form. It is growing rapidly with the development of human society and the advancement of science and technology.
⑶Reproducibility of information sources
Information sources are different from other material wealth. They will not be consumed after use, but will produce "value-added phenomena". At the same time, the information source itself can also be regenerated, producing secondary and tertiary information sources from the original information source.
⑷Sharing of information sources
The recipient of the information source is not an exclusive user. The information source can be spread to different recipients for simultaneous use and is shareable.
2. Classification and characteristics of information sources
(1) Recorded information sources (documentary information sources)
Recorded information sources: including knowledge information recorded and stored by traditional media and various modern media, such as various books, journals, databases, websites, etc. The characteristics are: dissemination of information system, easy to save, easy to accumulate, easy to use. It is the basic form of information existence and the main object of information management.
1. According to different material carriers
⑴Printing type: including lead printing, mimeograph, offset printing, woodblock printing, etc. This is a traditional form with a long history. Due to the convenience of reading and use, it is still widely circulated and has become the main form of disseminating information. Its disadvantages are that it is large in size, heavy in weight, takes up a lot of space for collection, and is difficult to manage.
⑵Microfilm: Generally refers to documents and materials that use photosensitive materials as carriers and use optical recording technology to reduce printed documents in proportion, including microfilm, microfilm, microfilm cards and other microforms.
⑶Computer reading type: It converts text and images into binary digital codes and records them on carriers such as tapes, disks, or optical disks. When reading, the computer outputs it and converts it into text or images. It can store large amounts of information and retrieve the required information very quickly.
⑷Network type: Network information source: It is a new type of digital information resource. It is the sum of various information resources that can be utilized through the network. It refers to all information in various forms such as text, images, sounds, animations, etc. in the form of electronic data. Resources stored in optical, magnetic and other non-paper media carriers and reproduced through network communications, computers or terminals. It is directly generated, published, stored and disseminated online, such as various online books and periodicals, online news, website information, etc., and can be downloaded and stored on other carriers.
2. According to different contents, properties and processing conditions
⑴ Primary information: All original information produced in scientific research, production, operation, culture and other various activities is called primary information, such as journal articles, research reports, market survey reports, patent specifications and various network information.
⑵Secondary information: It is a type of information generated after processing and sorting primary information, such as bibliography, bibliography, introduction, abstract and other forms of retrieval tools. The important role of secondary information is not only in reporting, but more importantly, in providing clues for finding primary information.
⑶ Tertiary information: It is information generated through analysis and synthesis based on primary and secondary information. People often refer to this type of information as the results of information analysis research, such as reviews, reviews, annual subject summaries, literature guides, bibliographic books, etc.
3. According to the different characteristics of editing and publishing forms
Books, journals, conference materials, specialized reports, patent materials, government publications, dissertations, product samples, archives, standards, news newspapers, etc.
(2) Physical information sources
Physical information sources exist in nature and artificial artifacts. They are knowledge information carried and stored by physical objects. People can communicate and spread them through practice, experiment, collection, visit, etc., such as samples of certain organisms, product prototypes, handicrafts, etc. The physical information source is intuitive and vivid, contains rich information, and is easy to understand and absorb. This type of information source cannot enter the information system directly. To manage it, it must first be converted into recorded information.
(3) Intellectual information sources
This type of information source mainly refers to the knowledge information stored by the human brain, including all kinds of knowledge, know-how, skills and experiences that people have. Some of them can be clearly expressed and recorded in language and words, while others are difficult to express and record clearly, so Also known as tacit knowledge. This type of information is carried by human activities, and various consulting services are provided according to social needs to help users solve problems. Such information sources are mainly organized, coordinated and managed through policies and regulations.
(4) Zero information source
Zero-time information: It is information that people obtain through direct communication. It is the result of the content of the information object directly acting on people's senses (including hearing, seeing, smelling, and touching), unlike recorded information and physical information that are passed through a certain The basic form of a material carrier acts. Therefore, zero-time information has typical characteristics such as directness, timeliness, novelty, randomness, and non-storage.
This type of information source refers to information spread by people orally through various channels. The existence form and communication channels of zero-time information are highly random and difficult to store and systematically accumulate, which brings great difficulties to the management of this type of information. Special methods need to be used to collect, record, organize and store it.
3. Evaluation of information quality
The significance of information quality evaluation
Information quality evaluation is oriented to user needs and satisfaction, and evaluates the quality of information from multiple angles and levels. Its purpose is to provide people with high-quality information resources to better meet the needs of information users. It is very important. important theoretical and practical significance.
First, information quality evaluation is conducive to improving the utilization rate of information sources; second, it can promote the improvement of the quality of information sources; third, it can objectively reflect the social influence or academic influence of information sources; fourth, it can improve the quality of information sources. The efficiency of resource management; fifth, the ability to purify the information environment.
(1) General dimensions and indicators of information quality evaluation
Information quality is a comprehensive concept that includes four dimensions: accuracy, completeness, consistency and timeliness.
1. Evaluation dimensions and indicators based on information content
The evaluation of information content is the judgment and estimation of the inherent quality of information resources. It is the most fundamental and widely applicable evaluation dimension and mainly involves the four indicators of correctness, completeness, relevance, and novelty.
2. Evaluation dimensions and indicators based on information expression form
Information is mainly expressed and transmitted through symbols, and information users can only understand and utilize information through the form of information expression. Therefore, the expression form of information is an important dimension to evaluate the quality of information resources, including the four basic evaluation indicators of accuracy, understandability, simplicity and standardization of information expression.
3. Evaluation dimensions and indicators based on information resource systems
The information resource system is composed of various information resource activity elements. The performance of the system is a direct reflection of the quality of the information. The evaluation dimensions based on information resource systems mainly involve three indicators: information accessibility, rapid response and reliability.
4. Evaluation dimensions and indicators based on the effectiveness of information resources
This is an evaluation of information quality from the perspective of information users. This evaluation method can identify target users and find the direct gap between the content of information resources and user expectations. It mainly includes three important indicators: availability, appropriateness and value-addedness of information.
(2) Methods and implementation of information quality evaluation
1. Evaluation and selection of information sources
⑴ The evaluation of information sources must achieve two goals: ① Determine which information carrier and information source the information is obtained from; ② Determine the intention and reliability of the information.
⑵After collecting various information, first sort them according to the type of information source. Then check whether the information carried by various information sources is correct and reliable, whether there are certain restrictions attached, etc. Organizing information according to information sources not only allows a good grasp of its classification categories, but also enables systematic testing and evaluation of the meaning and value of the information.
⑶In addition, various types of information sources can also be divided according to the purpose of using the information sources. For information sources that are comprehensively utilized, the forms of comprehensive utilization should also be separated by category according to their different purposes. This can order the information sources and conduct regular information tracking and monitoring on those with higher usage rates.
2. Basic methods of information quality evaluation
⑴Qualitative evaluation method
Qualitative evaluation method: Based on the purpose of information evaluation and the needs of the service objects, and based on certain criteria, the evaluation standards are first established, the relevant index system is established, and then the evaluation objects are evaluated.
Common qualitative evaluation methods of information quality include: questionnaire method, interview method, observation method, comparison method, simulation method, and peer review method.
⑵Quantitative evaluation method
Quantitative evaluation method: It is to objectively express the indicators reflecting the quality of information with specific numbers and formulas, mainly using statistical, mathematical or economic model methods.
Commonly used quantitative evaluation methods include: information entropy evaluation method, informetrics evaluation method (bibliometrics, webometrics), statistical evaluation method
⑶ Comprehensive evaluation method
Comprehensive evaluation method: It is an evaluation method that combines qualitative and quantitative methods.
Commonly used comprehensive evaluation methods include: analytic hierarchy process, fuzzy comprehensive evaluation method, correlation matrix method, evaluative metadata method, information construction method, etc. They are important methods for conducting information evaluation, especially the evaluation of network information resources.
3. Implementation of information quality evaluation
⑴Clear evaluation goals
The evaluation of information quality involves many aspects such as information, information services, and information users. At the beginning of the evaluation, the specific objects of evaluation should be clearly defined and the main goals of the evaluation should be determined.
⑵Analysis, argumentation and evaluation objects
Collect relevant data and information from various channels according to the evaluation objectives, conduct a comprehensive analysis of various elements and performance characteristics of the evaluation objects, demonstrate the necessity and feasibility of the evaluation objects, establish an evaluation index system, and formulate evaluation criteria.
⑶ List and evaluate plans
Program evaluation is the central link in information resource quality evaluation activities. Each program to be evaluated should have clear evaluation elements, evaluation methods, etc.
⑷Choose the best solution
The best solution is usually chosen based on the principles of economy, convenience, and usefulness.
⑸Evaluate
4. Ways to obtain information
(1) Efficiency indicators of information acquisition
These indicators are generally determined by the degree of relevance of information content to information needs, that is, the concept of relevance or relevance.
⑴Collection rate
The acquisition rate is used to measure the completeness of the collection of relevant information. It refers to the proportion of all relevant information contained in an information system to all relevant information inside and outside the system at that time. The information collection rate can be expressed as: P=r/R
⑵Accuracy rate
The accuracy rate is used to measure the pertinence of information collection. It refers to the proportion of all relevant information contained in an information system among all the information in the system at that time. E=r/Q
⑶Timely rate
Timeliness is used to measure the speed of information collection, that is, the ability to complete the information collection process in the shortest time. It is calculated by the total time spent on each step of the collection process, from the generation of information to its entry into the information base. T=∑(i=1,n)t
⑷Expense rate
The expense ratio is used to measure the financial efficiency of information collection and refers to the minimum ability to spend per unit of information in the information database. It depends on the organization of the collection process, the technical equipment of each link and other factors. C=F/G
⑸Labor consumption rate
The labor consumption rate of information collection refers to the minimum amount of labor consumed per unit of information collected by the information system, and can be calculated by the total labor consumption of all links in the collection process. L=∑(i=1,n)li
Among the above efficiency indicators of information collection, when the collection rate and accuracy rate reach a certain level, there will be a reverse and reciprocal relationship between the two, that is, under the given time rate, expense rate and labor consumption rate, In this case, the higher the collection rate, the lower the collection accuracy rate; conversely, the higher the collection accuracy rate, the lower the collection rate will be.
(2) Principles of information collection
1. Targeted
⑴The purpose of any information service agency collecting information is to provide it for use. Therefore, the scope and focus of information collection must be determined in a targeted manner based on the nature, tasks and service objects of the agency.
⑵In order to collect information accurately, information needs and information sources must be carefully investigated. ① First, it is necessary to clarify the scope of the service objects of the information institution, as well as their economic, technological, cultural and other history, current situation, trends and their demand and utilization of information; ② Secondly, it is necessary to clarify the distribution of the information they need Situation; ③Finally figure out through what channels and what carriers to obtain this information. In this way, we can be well-informed when collecting information, strengthen pertinence, and avoid blindness.
2. Systematic
⑴In order for the information service system to effectively carry out information services and meet the information needs of users, it needs to track the development process of science, technology and national economy, understand and master the dynamic changes of information sources, and systematically and continuously collect and accumulate relevant information. The continuity and systematicity of information are the prerequisites for information to exert its effectiveness, which can be divided into vertical systems and horizontal systems. The vertical system accumulates according to products, processes, disciplines, majors or topics; the horizontal system accumulates according to the carrier or type of information.
⑵ Only by continuously and systematically collecting and accumulating information can the effectiveness of information services be fully exerted. In particular, the collection of key information sources should strive to remain systematic, coherent and complete. If the information collected and accumulated by an information service system lacks continuity, systematicness and integrity, it will not be able to meet the needs of users when needed, thus causing the information service to lose credibility.
3. Foresight
⑴ As the growth and aging of information continue to accelerate, when collecting information, we must not only pay full attention to existing information sources and information channels, but also focus on the future and foresee possible new information sources and information channels. Only in this way can we not only meet the current information needs, but also adapt to the information needs brought about by future development.
⑵ To this end, we should pay close attention to the development level, trends and trends of science, technology and economy, keep an eye on the emergence and development of new information sources and information channels, predict the concentration and dispersion, growth and aging trends of information sources, and fully estimate the future needs of users. information needs, collect information in a predictive manner, and improve the initiative of information services.
4.Scientific
Contemporary information sources are huge in number, in various forms, with repetitive and scattered content, and in various varieties, which brings great difficulties to the selection and collection of information. Therefore, it is necessary to frequently use scientific methods to study the distribution patterns of information sources, and select and determine information sources with high information density and large information content.
5. Planning
⑴ Any information organization that wants to use limited manpower, material resources and funds to obtain the most effective information sources must formulate a relatively thorough and detailed information collection plan in advance so that information can be collected in a purposeful and step-by-step manner according to the plan.
⑵Information collection plans can be divided into long-term, medium-term and short-term plans or annual and quarterly plans from the perspective of time; from the perspective of content, they can be divided into comprehensive plans, special plans and supplementary plans. The items of the collection plan generally include: the content scope of the collected information, the division and proportion of key information and general information, procurement standards, budget, main measures and guarantees for completing the plan, etc.
(3) Methods and approaches for information collection
1. Procurement
Procurement is a regular, stable, systematic and effective way to collect and obtain information, and is very suitable for both groups and individuals. Procurement includes specific methods such as ordering, cash purchase, mail order, entrusted purchase, etc. You can choose according to the purchase scale and your own needs.
2.Investigation
The survey method can not only obtain recorded information sources, but also non-recorded information sources, especially zero-time information. The investigation includes both interviews and on-site inspection and collection of physical objects and sites, such as on-site surveys, interviews, questionnaire distribution, sample collection, etc.
3.Exchange
Use the information sources owned by this system to exchange with other units or systems, adjust and complement each other, and expand information sources, including traditional document information exchange and network information or data exchange.
4. Request
For unpublished information or information sources that are not circulated through formal channels, as well as information sources that have been published or publicly circulated but are not detailed, complete and comprehensive enough, we can obtain them through correspondence or direct contact as needed and possible. A way to become a request.
5. Search
That is, querying the required information from various databases and information systems manually or by computer. Manual search mainly obtains clues to information sources through various search tools. If original information is needed, a secondary search is also required. Computer retrieval can not only obtain clues to the information source, but also directly obtain the original information.
6. Web crawling
Use a web robot or web spider to regularly go to designated websites to automatically crawl part or all of the web information.
5. Acquisition and Utilization of Network Information
(1) Network information sources and their characteristics
Network information source: It is a new type of digital information resource, which refers to the sum of various information resources that can be utilized through the network. Specifically, it refers to all forms of information such as text, images, sounds, animations, etc. that are stored in the form of electronic data in optical, magnetic and other non-paper media carriers, and reproduced through network communications, computers or terminals. H. Compared with traditional information sources, the main characteristics of online information sources are:
1. Large amount of information and rich content
The Internet is an open data transmission platform with a huge number of information resources of various types, such as academic, business, government, personal, entertainment, news information, etc. On the one hand, it provides users with a large space for information selection; on the other hand, a large amount of worthless and redundant information also brings a lot of trouble to users.
2. Information is updated in a timely manner and changes are accelerated
Due to the development of network technology, compared with traditional information sources, network information sources change more rapidly and novelly, and the amount of data is constantly increasing.
3. Diversified forms of information expression
The Internet has information resources with rich forms of expression, such as sounds, images, text, videos, animations, etc. While expressed in multimedia forms, the interactivity between users and information has been greatly enhanced.
4. Non-linear arrangement of information and enhanced disorder
Network information sources use hyperlinks to form a three-dimensional network information chain, linking information from different countries, different regions, different contents, and different formats through nodes, thereby enhancing the correlation between information. But at the same time, the state of disorder is also increasingly prominent.
(2) Obtaining network information
1.Search engine
As an important way to organize network information and an important tool for network information retrieval, search engines build their own databases by extracting information from various websites on the Internet and provide query services to users. It generally consists of three parts: information searcher , indexer, retriever.
(Search engine: refers to a retrieval tool that accepts user questions, searches the database, and feeds back information objects matching the user questions to the user. Broadly speaking, a search engine not only refers to the retrieval program itself, but also refers to the retrieval interface, related the portals, programs, and index databases and services that support it).
2.Digital library
Digital library: A library whose collections are stored in digital format and can be accessed using computers. Digital content can be stored locally or accessed remotely via a computer network. A digital library can be said to be an information retrieval system.
(Digital library: A physical or virtual information institution or group of information institutions that preserves electronic documents stored in digital format, transmits the stored digital information through computers and networks, and provides virtual links to online information and provides services.)
The main resource types of digital libraries include: ① Full-text resources, including electronic journals, electronic books, electronic newspapers, open works retrieval, publication and archiving databases, electronic PhD and master's theses, electronic archives, etc.; ② Secondary literature and bibliographic information, Including online search catalogs, abstract index databases, etc.; ③ Multimedia resources, including independent static images, dynamic images, audio recordings, animations, etc.
3. Subject information portal
Subject information portal: It is a single entrance or channel for users to access the resources and services of a certain subject. It is a network service that is used to complete the high-level organization and integration of network resource content in this discipline and the aggregation of network applications, and integrate these resources and applications into a customizable and personalized interface to meet the needs of each end user. need. From a user's perspective, it is the starting site or entrance for users of a certain discipline to access the network resources and services of that discipline.
The core features of the subject information portal: ① Integration of information and applications; ② One-stop cross-system retrieval; ③ Simple unified interface; ④ Customizable.
4. Open data platform
In the context of big data, government agencies and alliances have formulated countermeasures to deal with big data strategies, with open data as an important component or even core. In the field of scientific research, open data is becoming a key player in promoting digital scholarly communication and scientific research. In line with the development trend of open data and driven by policies from all parties, more and more research data warehousing platforms have been established and provide public access to data.
5. Social question and answer platform
Social question and answer platform: also known as question and answer community or question and answer platform, etc., it is a community-based platform that encourages user participation to achieve online interaction. Its essence is to realize user interaction through information sharing and dissemination and the disclosure of the question and answer process and results. collaboration among them.
Chapter 5 Information Organization
1. Basic principles of information organization
(1) Concept and content of information organization
1. The concept of information organization
Information organization: also known as information sorting, is the use of certain rules, methods and technologies to reveal and describe the external characteristics and content characteristics of information, and arrange them according to given parameters and sequence formulas, so that information can be transformed from an unordered set into The process of ordered collection.
2. Basic objects of information organization
⑴External characteristics of information
It refers to the information object directly reflected by the physical carrier of information, which constitutes the external and formal characteristics of the information, such as the physical form of the information carrier, title, author, publication or release date, circulation or dissemination mark, etc.
⑵Content characteristics of information
It is the content contained in the information, which can be expressed by keywords, subject headings or other knowledge units.
3. Basic content of information organization
⑴Information selection
The purpose of information selection is to identify useful information and eliminate useless information from the collected, disordered information flow. It is the first step in the entire information organization process.
⑵Information analysis
Information analysis is an information activity that refines, mines, processes, organizes and categorizes the internal and external characteristics of selected information from the semantic, pragmatic and grammatical perspectives according to certain logical relationships. It is the premise and basis for information description and disclosure, and directly affects the quality of information organization.
⑶Information description and disclosure
Information description, also known as information resource description, refers to the activity of selecting, analyzing and recording the subject content, formal characteristics, material form, etc. of information resources based on the needs of information organization and retrieval. Information description and disclosure are mainly divided into two types: one is description, which mainly describes the formal characteristics of documentary information; the second is indexing, which mainly reveals the content characteristics of documentary information.
⑷Information storage
Information storage is an information activity that stores processed and sequenced information in a specific carrier in a certain format and order. The purpose of information storage is to facilitate information managers and information users to quickly and accurately identify, locate and retrieve information. The storage of information in various retrieval tools means the end of the information organization process, which also means the beginning of information retrieval.
(2) Type of information organization (classification based on information or information sources)
1. Divide according to the form of information expression
① Text information organization ② Image information organization ③ Sound information organization ④ Video information organization
2. According to the degree of information processing
① Primary information organization ② Secondary information organization ③ Tertiary information organization
3. Divided according to the communication carrier of information
① Document information source organization ② Network information source organization
(3) Principles of information organization (four basic principles)
The objects, methods and processes of information organization are not simple and uniform. This complexity and variability require us: whether it is the macro information organization of social information flow or the micro information organization of branches of social information flow, it must be done to a certain extent. Under the guidance of scientific principles, the randomness, unplannedness, blindness and other phenomena of information organization work should be effectively avoided, so that information organization can truly play the role of organizing information, scientific diversion, promoting selection and ensuring utilization.
1. Principle of objectivity
⑴The principle of objectivity takes the lead in determining that the data source for information description and disclosure must be the objectively existing information itself. At the same time, the principle of objectivity also puts forward corresponding requirements for our specific information organization work:
⑵In information organization, we cannot damage the original function of information, do not distort the information itself, and do not dismember the information itself. You cannot add some inaccurate thoughts and opinions artificially without any basis, and you must reflect the objective characteristics of the information completely, comprehensively and accurately.
⑶The principle of objectivity also requires us to continuously track the development and changes of information sources and information organization technology, so that information organization remains objective and consistent with changes in conditions and changes in the environment.
2. Systematic principles
⑴ Without systematic information organization work, it is impossible to achieve its overall goal. In order to achieve systematic information organization, we must grasp four relationships:
①The relationship between macro information organization and micro information organization. Information organizations must not only base themselves on their own actual conditions, but also consider the market share and division of labor of information organizations within a larger scope, thereby forming a complete macro-information organization work system.
②The relationship between the information organization department and other departments. It is necessary to pay attention to the collaborative relationship between various departments within the information organization, especially the relationship with the two types of departments engaged in information collection and information dissemination, as well as the administrative department that conducts daily management of the organization, so as to ensure that information can be ensured in an environment where the information organization operates well. The one-stop workflow of collection - information organization - information retrieval - information dissemination is smooth.
③The relationship between various aspects of information organization work. The information organization work itself also has multiple links. Maintaining the normal connection of each link will help the information organization work to be closely linked. Among them, special attention should be paid to the basic position of information analysis and description, which is the key to information disclosure and storage. necessary preparations.
④The relationship between different information processing methods. Due to the particularity of information itself, when organizing a certain type of information, we must be fully satisfied with this particularity, but we must also fully grasp the great similarities of various information processing methods, and use unified and standardized processing as much as possible method.
⑵ Using systematic perspectives and methods to coordinate and manage information organization work will help bring into play the overall advantages of information organization and also help realize the overall function of information organization.
3. Purpose principle
⑴ Information organization has a clear purpose and must work around the information needs of users, pay attention to the demand status and changing characteristics of the target market of the information organization, and meet the principle of cost-benefit symmetry.
⑵ Therefore, information organization work must actively carry out user research, fully understand user needs, and improve information organization methods so that information organization results can greatly facilitate user selection and utilization.
⑶ In addition, in order to achieve the goals of information organization, attention must also be paid to the planning and long-term nature of information work, as well as the adaptability to the nature, characteristics and capabilities of the information organization itself.
4. Principles of modernization
⑴The principle of modernization of information organization includes two aspects: modernization of ideological concepts and modernization of technical means.
⑵The modernization of ideological concepts of information organization is concentrated in the standardization of information organization. The standardization of information organization is mainly reflected in the unity of information organization work, the standardization of information organization methods, the compatibility of information organization systems and the universality of information organization results. The standardization of information organization mainly includes: ① basic terminology standards; ② relevant information technology standards; ③ information organization technology standards, and ④ other related standards.
⑶ The modernization of technical means of information organization has fully proved that the widespread promotion and application of modern information technology in information organization has formed an irreversible direction of automation of information organization. Automatic indexing, automatic generation of secondary information and database construction have become relatively mature at present. Information organization automation results. The modernization of technical means of information organization has changed the traditional manual methods, greatly improved work efficiency and work quality, better met the diverse information needs of users, and is in line with the general trend of information management development.
(4) Theoretical basis of information organization
1. Linguistics
Language is the most important communication symbol system for human beings and the carrier of information. To organize complex and scattered information into an orderly and optimized whole, a symbol system must be established. With this symbol system, the orderly characteristics of the information system can be reflected. Although the forms of various information organization symbol systems are different, they all share the same characteristics as natural language: words, vocabulary, and grammar. This is the linguistic basis of information organization.
2. Logic
Logic is the science of the laws of thinking. Thinking has two forms: image thinking and abstract thinking. Information organization belongs to the scope of abstract thinking and is carried out on the basis of various concepts. Therefore, it must follow scientific thinking methods, such as deductive reasoning and inductive reasoning, comparison, analysis and synthesis, etc. Only when the behavior of information organization conforms to the laws of logical thinking can the optimization and ordering of information organization be ensured.
3. Knowledge taxonomy
Any in-depth information resource organization system requires organizing and revealing the relationships between subject contents. This requires based on the results of existing knowledge classification. Knowledge classification system is a discipline that studies the structure of knowledge systems and is an important basis for processing relationships between topics. Information organization is based on knowledge classification, which can better reveal the relationships and regularities between knowledge in various fields and reflect the progress of knowledge fields in a timely manner.
4.System theory
Systems theory believes that the various elements within the system are not a simple combination, but are interconnected and organically combined. In information organization, people need to organize a large amount of scattered and messy information into a system and establish internal correlations so that the overall function of the information system is greater than the sum of the functions of each information unit. This will give full play to the value and role of information resources.
5. Dissipative structure theory
Dissipative structure theory believes that the internal non-equilibrium of the system is the source of order, and open systems increase and maintain order by exchanging matter and energy with the outside world. Since the information system has the basic characteristics of openness and non-equilibrium such as input, output, multiple cycles and feedback, we believe that the information system is a dissipative structure system. It exchanges information with the outside world, collects information, Information is also output, and dissipative structure theory can be used to guide the development, organization, and utilization of information systems.
6. Synergy Theory
An information system is an open system composed of multiple subsystems. How to establish a synergy mechanism between subsystems to transform the information system from disorder to order is an important content in the research of information organization. Therefore, synergy theory is also an important part of information organization. One of the theoretical foundations.
7. Self-organization theory
Self-organization means that the system can organize, create and evolve on its own without external instructions, that is, it can move from disorder to order autonomously. Any system that can achieve the transition from disorder to order without resorting to external control and maintain a stable and ordered state is called a self-organizing system. The rapid development of information technology has made information systems significantly equipped with the conditions for self-organization. In particular, network information has the openness of a self-organizing system and is far away from equilibrium and non-linear correlation. Therefore, studying information self-organization theory is of great significance to network information organization. Very important theoretical and practical significance.
2. Thoughts and methods of information organization
(1) Classification
1. The concept of taxonomy
Classification: It is a common method for building an orderly information system. It is a method of classifying and systematically organizing and revealing resources according to their content attributes and other characteristics. It uses classification numbers to express various concepts, and classifies and systematically arranges various concepts according to the nature of the subject.
2. Characteristics of information resource classification
⑴It is organized according to the interrelationship of information content characteristics.
⑵ It organizes and reveals information resources from a certain perspective.
⑶ Use certain marking symbols as a sorting tool.
3. Main classification methods at home and abroad
"Dewey Decimal Classification" (DDC), "International Decimal Classification" (UDC), (Chinese Library Classification) ("CLC"), the United States "Library of Congress Classification" (LCC), "Colon Classification" Law (CC)
(2) Subject method
1. The concept of subject law
Theme method: refers to a method of indexing and retrieving information that directly uses words that represent the subject of information resources, provides a word order search method, and mainly uses a reference system to reveal the relationship between words. It is another method of indexing and retrieving information resources from a content perspective in addition to taxonomies.
2. Characteristics of the subject method
⑴Concentrate information resources around specific things, issues, and objects, that is, with the theme as the center.
⑵ Use words directly as information organization and retrieval identifiers.
⑶ Use word order as the main search method.
3. Types of subject law
1) Title method
⑴Basic concepts
①Title method: It is a topic method that uses title words as subject identifiers and indexes and retrieves in a predetermined combination of terms.
②Title words: Also known as titles, they refer to labels used to express the subject of a document through lexical control. Most of them are standardized expressions of the names and characteristics of things discussed in the document content.
③ (Title language: The theme identification system composed of many titles is the title language.)
④Title table: The compilation of title words is called the title table, which consists of three parts: preparation instructions, main table and appendix. The Library of Congress Title List is widely used.
⑵Types of title method
Single-level titles, single-level titles with captions, multi-level titles, inverted titles, etc. (The heading method uses a reference system of headings to show the interrelationships of the concepts they express.)
⑶Advantages of the title method
① Because the sequential relationship between titles is pre-assembled, the pre-assembled titles are used directly during indexing and retrieval, and are less likely to be confused.
②The title method focuses on a thing to gather documents related to the thing. It is suitable for retrieval based on the topic and is easy to find documents about an event.
③Intuitive, easy to master, fast retrieval, easy to reflect new things and small-scale problems, and easy to make additions and modifications.
⑷Defects of the title method
①It is difficult to find all the literature on a subject or something with a certain attribute.
② It is inflexible to use and cannot satisfy multi-channel retrieval.
2) Unit lexicon
⑴Basic concepts
①Unit words: refer to the most basic words that cannot be divided literally, used to index the subject of information resources. The mutual combination or combination of several unit words can form a specific identifier to accurately express the document theme or search topic.
②Unit word method: The unit word is used as the subject identifier, and the information resource theme is expressed through literal combination. Post-group identification is an important feature of this method.
⑵Advantages of unit lexicon
Principles of assembly. It has great flexibility and can provide multiple search methods. It has now further evolved into thesaurus.
3) Thesaurus
⑴Basic concepts
① Descriptors: Also known as subject headings in China, they refer to words and phrases that have been standardized and based on basic concepts to express the theme of the document.
② Thesaurus method: refers to the type of thematic method that selects strictly processed words from natural language as document theme identifiers and expresses the document theme through concept combination.
③Thesaurus: The tool used in the thesaurus method is thesaurus. The most widely used thesaurus in my country is the "Chinese Thesaurus". (Structure: various arrangement tables, indexes, word relationship diagrams)
⑵Characteristics of descriptors
①Intuition: Directly using standardized natural language descriptors as identification symbols, it is highly intuitive.
② Specificity: Select directly from the specific objects and issues of literature discussion and research, and use descriptor combinations to describe the theme. It can be directly selected as a descriptor or expressed through combination according to needs.
③ Strong adaptability: The thesaurus can be added, deleted and modified at any time for new things, new subjects, new concepts and new research topics that are constantly emerging.
④Quick and accurate: Adopt word order arrangement, easy to search, quick and accurate.
⑤ Post-assembly type: Mainly adopts the post-assembly concept assembly method, which has greater flexibility.
⑥Multi-dimensional retrieval: Also called multi-channel retrieval, that is, documents on the same topic can be retrieved from multiple channels.
⑦Network structure: It is mainly reflected in several aspects such as the reference system of the word order index of thesaurus, the category index, the word family index, and the word family diagram of thesaurus in the thesaurus. The compilation and establishment of this network structure reveals the intrinsic connections between things and their concepts in various subject areas and aspects, and strengthens the systematic and ethnic retrieval function of thesaurus.
4) Keyword method
⑴Basic concepts
① Keywords: refer to those words that appear in the title, abstract, and text of a document and have substantial significance for describing the subject content of the document.
②Keyword method: This method extracts substantive keywords used in the original documents that can describe the concept of the document's theme as subject identifiers, without standardization or only a very small amount of standardization, and reveals the documents through rotation and other methods. Theme of.
⑵Types of keyword method
Off-topic keyword index, in-topic keyword index, simple keyword index (double~, word pair type~, simple~)
(3) Classification – Theme Integration
1. Integration of classification themes
The classification method is highly systematic, indexing and organizing information from the perspective of ethnic characteristics, and is in line with people's thinking and retrieval habits; the topic method is relatively intuitive, indexing information from the perspective of characteristics, and providing a direct object-oriented approach. , facts, and concepts information organization methods. The two form an organic combination, learn from each other's strengths and complement each other's weaknesses, and can better meet the different needs of information organization and retrieval.
2. Integrated vocabulary list of classified topics
The integrated thesaurus of classification topics established on the basis of classification tables and thesauruses includes a classification system that arranges and organizes topic concepts according to subject systems and logical systems, and also includes a system of arranging topic concepts in alphabetical order and using a reference system. An organized subject law system, they are two cooperating and independent subsystems in a unified system. (Used to meet the needs of both classification indexing and subject indexing) "Chinese Classification Subject Thesaurus" is a large-scale, comprehensive thesaurus compiled by my country on the basis of absorbing advanced classification subject integration theories and technologies from abroad. Vocabulary list.
3. Information description and disclosure
(1) Information indexing
1. Basic concepts of indexing
Information indexing: refers to the process of analyzing the content attributes and related appearance attributes of information objects, and expressing the analyzed attributes or characteristics in a specific language as the basis for information storage and retrieval. It is developed based on the analysis of document information objects, so it is also called document indexing. The information indexing process generally includes two links: one is topic analysis, and the other is conversion identification.
2. Indexing language
1) The concept of indexing language
Indexing language: It is a concept identification system that expresses the conciseness, unity and relevance of document subject concepts and retrieval requirements. It is an artificial language compiled according to indexing and retrieval needs. In terms of purpose, slogan language is also called search language and index language. Compared with natural language, index language is also called controlled language and specification language.
2) Classification of indexing languages
⑴According to the thinking method of information organization
① Classification language: An indexing language that uses classification numbers to express subject concepts, organizes and arranges subject concepts into a category system based on knowledge classification, and mainly uses the structure of the category system to show the relationship between concepts. Systematization centered on disciplines and majors is its main feature. It can be subdivided into system classification language, combination classification language, and system-combination classification language.
②Theme language: It is an indexing language that directly expresses the theme concepts by controlled natural language words, arranges the theme concepts according to the word order, and mainly uses reference to display the relationship between concepts. Thing-centered immediacy is its main feature. It can be further divided into title language, unit word language, descriptor language, and keyword language.
⑵According to the order of combination time of logos
①Pre-assembled language: refers to an indexing language in which the marks have been assembled during tabulation (before indexing), and there is no need or less need for assembly during indexing and retrieval. For example, the title method in the subject method falls into this category. type.
② Post-grouping language: It refers to an indexing language in which the marks are mainly used for configuration, and the marks are not grouped together during indexing, but are grouped together during retrieval, such as unit lexical method and descriptor method.
③Scattered group language: It refers to an indexing language in which marks are mainly used for combination, and when indexing, several marks that express the theme concept must be grouped together.
⑶According to the specification and degree of control of the label
①Controlled language: refers to the identification vocabulary of information organization and the index vocabulary of information retrieval that have been optimized and standardized before use, and the entire language is often under the management of an authoritative organization or retrieval system. This language is also called a normalized language, as opposed to a natural language. For example, the title language, unit word language, thesaurus language, system classification language, etc. in the subject method are all controlled languages.
②Natural language: refers to the index vocabulary and search vocabulary that come directly from the document itself and have not been optimized and standardized before use, as opposed to controlled language. Such as the keyword language in the subject method.
3. Types of information indexing
⑴The type of information indexing depends on the type of indexing language used to express the document content. Therefore, information indexing is mainly divided into two types: classification indexing and subject indexing.
① Classification indexing: refers to the process of subject analysis of documents, expressing the analyzed topics in a specific classification language, and assigning classification retrieval identifiers (classification numbers) to the documents. Traditionally, classification indexing is called classification, because classification indexing is essentially the classification of documents. (Document classification: refers to the process and method of systematically revealing and organizing documents into categories based on the content of the document and other related attributes, using classification language/taxonomy as a tool.)
② Subject indexing: refers to the process of subject analysis of documents, expressing the analyzed topics in subject language (topic method), and assigning subject search identifiers (title words, descriptors, etc.) to the documents.
⑵According to the order of time when indexing marks are combined, information indexing can also be divided into first group indexing, last group indexing and scattered group indexing; according to the degree of control of indexing marks, it can also be divided into controlled indexing. Indexing and natural language indexing.
(2) Information description rules
Information description: It is the process of describing and recording the characteristics of information entities according to certain rules.
1. Machine Readable Catalog (MARC)
⑴ Concept
MARC: is the abbreviation of machine readable directory. It is a directory that can be recognized and read by computers. (It is a standardized computer-readable format established based on the characteristics of documents and the needs of information exchange between document institutions.)
⑵Features
It has strong scalability and wide adaptability, and has the following main features: ① It includes a total of 001~999 fields, of which 999 fields are user-defined fields; ② The field contents are described in detail, and there are subfields and repeated fields under the fields; ③ Fields The function is enhanced and there are many searchable fields; ④ Each MARC record can be divided into header area, directory area and data area.
⑶Several typical MARC standards
①USMARC (United States Library of Congress Machine Readable Catalog Format)
The predecessor of USMARC is the MARC II format, which is completely consistent with the ISO2709 regulations and only differs in its field identification and content items. Each USMARC record consists of three parts: header area, directory area, and data area. The main fields include: catalog control area, main items, book title publication items, subject additional items, other additional items, etc. Widely used in bibliographic record data.
②UNIMARC (International Machine Readable Catalog Communication Format)
This standard was developed by IFLA in 1976 on the basis of USMARC to unify the machine-readable catalog formats of various countries. It realizes the integration of machine-readable catalog formats of documents in different languages and different carriers, creates conditions for the exchange of machine-readable catalogs between bibliographic agencies in different countries, and promotes the development and revision of machine-readable catalog formats in various countries.
③CNMARC (Chinese Machine Readable Catalog Format)
CNMARC: is the abbreviation of Chinese Machine Readable Catalog and is my country’s common machine catalog communication format. It is used to exchange bibliographic information in a standard computer-readable form between China's national bibliographic agency and other national bibliographic agencies as well as domestic libraries and information departments. It is compiled based on some special circumstances and new changes in rules of UNIMARC and Chinese publications.
2. Functional requirements for bibliographic records (FRBR)
FRBR studies the relationship between the functional requirements of bibliographic records, diverse user needs, and diverse data types, and uses the entity-relationship model (E-R) to build a conceptual model that reveals the bibliographic structure and relationships.
3. Resource Description and Retrieval (RDA)
RDA is a standard for description cataloging. RDA is based on the Functional Requirements for Bibliographic Records (FRBR) and the Functional Requirements for Authority Data (FRAD) published by IFLA. As the successor of the Anglo-American Cataloging Rules (Second Edition) (AACRⅡ), RDA provides a set of Guidelines and instructions on how to develop data to support resource discovery, covering all types of content and media.
4. Bibliographic Framework (BIBFRAME)
BIBFRAME is the abbreviation of Bibliographic Framework. It is a bibliographic description data model developed by the Library of Congress (LC) based on associated data rules to replace the MARC format. 2.0 includes three core categories: creative works, instances, and single items, as well as key concepts such as agents, themes, and events.
(3) Metadata
1. Metadata
⑴The concept of metadata
Metadata: is data about data, information about information, or data that describes data. It is specially used to describe the characteristics and attributes of data. It is also a language and tool for describing and organizing information resources and discovering information resources. Metadata is information that organizes data, various data domains, and their interrelationships.
Types of metadata include: element name, element description, element representation, element code, element semantics and element classification.
⑵The role of metadata
⑴Description. The most basic function of metadata is to describe the content, characteristics and location of information objects, thereby laying the necessary foundation for the access and utilization of information objects.
⑵ Positioning. Metadata contains information about the location of network information resources, which can determine the location of resources, promote the discovery and retrieval of information objects in the network environment, and improve the accuracy and recall rate of information resources.
⑶Search. In the process of description, important contents in information objects are extracted and organized, given semantics, established connections between data, and pointed out the addresses and access methods of relevant data, thereby helping users identify the value of resources and discover what they really need. information resource.
⑷Evaluation. Metadata provides basic attributes such as the name, age, format, and producer of the information object, allowing users to have a basic understanding of the information object and judge the use value of the information resource without browsing the information object itself. As a reference for access and utilization.
⑸Select. Based on the description information provided by the metadata, referring to the corresponding evaluation standards, and considering the use environment, the user makes a decision on the information object and selects the resources suitable for the user.
2. Dublin Core Set (DC)
⑴ Concept
Berlin Core Set (Dublin Core for short). It is a universal and simple metadata. In March 1995, the First Metadata Workshop was hosted by the Online Computer Library Center (OCLC) and the National Center for Supercomputing Applications (NCSA) in Dublin, Ohio, USA. The conference finalized an element set containing 13 data units, the Dublin Core Element Set. Later it was increased to 15 units.
⑵Category
① Resource content description elements: title, theme, description, source, language, association, coverage.
② Resource intellectual property description elements: creators, publishers, other responsible persons, and authority management.
③Resource external attribute description elements: date, type, format, resource identifier.
⑶Advantages of Dublin Core
① Simple and easy to understand: 15 mark elements are easy to understand.
②Flexible use: It can be used in both standardized resource description fields and non-professional fields.
③Internationally applicable: There are versions in more than 20 different languages, including German, Japanese, Portuguese, and Spanish.
④ Modifiable: All its elements can be reused, which solves the problem of describing repeated elements such as multiple authors and multiple publications. For materials that require detailed description, DC modifiers are introduced.
⑤Extensibility: Not only the 15 mark elements and sub-elements can be expanded, DC can also be connected with other metadata elements.
3. Other metadata specifications
VRA core categories, PICS, Web Collections, CDF channel definition format, MCF metadata framework, RDF resource description framework
(4) Markup languages and related standards
1. Markup language
⑴General Markup Language Standard (SGML)
SGML: It is a meta-language, a language used to describe markup language, and is suitable for electronic document exchange, document management and document publishing. SGML describes documents from two levels: structure and content. Its core is the document type definition DTD.
⑵Hypertext Markup Language (HTML)
HTML: It is a specific application implementation of simplified SGML DTD. HTML provides a text structure and format that enables it to be rendered on a browser to users who visit it. It is a common language on the Web. It can easily create web pages, establish links, and transform data information from linear organization to network organization.
⑶Extensible Markup Language (XML)
XML: It is also a simplified subset of SGML. It combines the rich functions of SGML with the ease of use of HTML into Web applications. It defines the data structure in an open self-describing way, while describing the data content. It can highlight the description of the structure to reflect the relationship between data.
2. Resource Description Framework (RDF)
RDF: Using Extensible Markup Language (XML), a subset of the Standard General Markup Language (SGML), it is a description framework that can be understood by both humans and machines. It provides a powerful mechanism for expressing, exchanging and utilizing metadata, enabling interoperability between various metadata systems.
3. Simple Knowledge Organization System (SKOS)
It is a recommended standard for the conceptual framework representation of knowledge organization systems published by the World Wide Web Consortium (W3C). Provides a simple and feasible standard description scheme for describing knowledge organization systems. It has the characteristics of simplicity, versatility, and easy expansion. SKOS is mainly used to represent various relatively simple network knowledge organization systems, such as thesaurus, taxonomy, subject thesaurus, glossary and other types of conceptual frameworks.
4. Network Information Organization
(1) Characteristics of network information organization
In the information network environment, with the development of information volume, information type, information form and communication speed, new challenges have been brought to the orderly organization of information, and network information organization also shows new characteristics:
1. The accelerating speed of massive network information and information production and dissemination poses challenges to traditional information processing methods. Information organization technology has evolved from manual to semi-automatic, automated, and intelligent.
2. The types of information are increasing, the components of the database are becoming more complex, and the information organization methods that were mainly applicable to text information in the past have become less applicable. In addition to text information, network information also contains a large amount of non-text information such as pictures, sounds, videos, etc., and the proportion of full-text information is also gradually increasing.
3. Different from previous information users, the information behavior of network information users has special needs for information organization. They require information organization methods to be transparent and easy to use. They hope that information dissemination is fast and convenient, and the organization method is simple and practical.
4. The non-linear, interactive and dynamic nature of network information requires timely and flexible information organization methods. Multimedia and hyperlinks have made network information develop from linear static resources to non-linear dynamic resources, requiring new information organization methods to adapt to this change.
(5. Network information resource sharing requires standardization and compatibility of information organization)
(2) Classification organization of network information
Classification organization is a relatively mature information organization method. Currently, information classification and organization on the Internet are mainly carried out in two ways: one is to follow the traditional document classification method, mainly focusing on academic resources; the other is to use a self-compiled classification system, which is more commonly used on portal websites.
1. Application of traditional document classification methods
⑴DDC
DDC's strict hierarchical system has very good practicality. It can be used to organize network information, and its hierarchical structure, conceptual system and category relationships can also be used for automatic indexing of network information. DDC, LCC, UDC, etc. are all used to classify and organize network information to varying degrees.
⑵UDC
It consists of more than 60,000 categories and provides a variety of compound tables. It has unique advantages in meeting the subdivision of literature materials. It has some characteristics of the faceted combination classification method. Combinations of concepts can be obtained through symbol combination. Therefore, this classification method is more suitable for machine inspection.
2. Application of self-compiled classification system
⑴ Due to the complexity of network information, the classification and organization model of network information is more diverse. Some websites adopt a multi-dimensional classification model to meet the needs of network information organization and retrieval, specifically represented by website classification directories (indexes) and web page classification indexes.
⑵Characteristics of the network’s self-compiled classification system (Compared to traditional document classification)
① Pay attention to setting categories centered on things.
Different from the traditional document classification system, although the network classification system does not completely give up organizing network resources from the perspective of disciplines, most of them set categories directly based on the things being retrieved based on the convenience of users. The categories set from the perspective of disciplines are very few.
②The category collection range is broad.
When the category system is expanded, its lower categories not only include the subordinate categories of the category, but also include various related categories, which greatly increases the number of lower categories of some categories, which is much higher than the same level categories in the traditional document classification method. quantity.
③Multiple column categories and repeated reflections.
In order to facilitate users' information search, this type of network classification system generally attaches great importance to combining network display characteristics and uses multiple column categories to repeatedly reflect network information so that it can be used from different angles.
④The arrangement is simple.
There are two main ways to arrange homogeneous classes. The first is the alphabetical order. This arrangement cannot reveal the content connection between the same categories, but it can facilitate users to search for specific categories, and it can also help simplify the addition of new categories. Second, the search frequency determines the order of homogeneous categories. High-frequency categories are listed first among homogeneous categories to highlight popular topics, which is used by most users.
⑤Organize information directly with words.
These systems abandon the traditional classification method of using mark symbols to express document themes, directly express the category system with words, and use link technology to connect online documents, which is more convenient, intuitive, and easier to understand than using classification tags.
⑥Update quickly.
Since the network classification system is connected to network documents through links, the adjustment of categories is less constrained by the original system. New categories can be added in a timely manner or relevant categories can be repeatedly reflected according to the development and changes of network resources, which often has greater dynamism.
(3) Thematic organization of network information
The topic method is another main way to organize network information. It can be divided into two main forms: one is to use existing vocabulary (such as thesaurus, that is, subject vocabulary), which is mainly used in subject-specific databases; The second is to use the keyword method, which is more commonly used in search engines and other websites.
1. Application of thesaurus
⑴Thesaurus, also known as thesaurus, is a vocabulary management tool for thesaurus method. It is an index vocabulary that provides standardized thesaurus and its semantic relationship information for users who use thesaurus method to index and retrieve information. Online display of networked thesauruses is one of the main forms of application of thesauruses in the network environment, including the digitization of original printed thesauruses and digitized thesauruses compiled and produced in a digital environment.
⑵According to function and usefulness, thesauri in the network environment can be divided into two types: ① independent thesaurus; ② thesauri integrated in a database or information retrieval system.
2. Application of keyword method
1. Due to the huge number of network resources, the use of standardized languages will greatly increase the cost of information organization and reduce the efficiency of text processing. The automatic word processing technology of network information directly extracts keywords from the text title or body as information organization identifiers. Compared with thesaurus, keywords are an uncontrolled or less controlled standard language. Therefore, keyword technology is widely used in information organization in Internet search engines and other information systems.
2. Automatic indexing software in Internet search engines such as Robot and Spider all use keyword technology. Search engines such as Google, Baidu, and Bing that we are familiar with use the keyword method to organize network information. In addition, some literature databases on the Internet also widely use word tag systems, allowing users to browse literature by category while also browsing from Search entries such as title, author, article title, etc. use keywords as search identifiers to search.
(4) Network information organization method
Information organization method is a specific form of organizing information using various information technologies and information organization methods. Currently, the most commonly used network information organization methods include file method, database method, topic tree method, hypermedia method, etc.
1.File method
⑴ Concept
A file is a collection of orderly organized data and is the basic unit for computers to save processing results. The computer has a complete set of file processing technologies and methods that can realize "access by file name". The file management program can automatically complete the data transfer operation according to the file name given by the user. The function of the file transfer protocol FTP that we are familiar with is to transfer various types of text and non-text files to users through the network.
⑵Advantages:
①Simple and convenient. Computers have a complete set of file processing technologies and methods that can be easily utilized when organizing network information resources.
②It is a natural unit for storing unstructured information. Unstructured information such as graphics, images, audio, and video can be easily managed using the file system.
⑶Disadvantages:
① It is difficult to balance the relationship between network load and the number of files.
②Appear weak to structured information organization.
③With the rapid increase of information resources saved and managed in the form of files, the files themselves can be regarded as information units and also need to be managed as objects. Therefore, the file method is often used as an auxiliary form for network information resource management and as a supplement to other information organization methods.
2.Database method
⑴ Concept
The database organization method is to store all obtained network information resources in a fixed record format. By querying keywords and their combinations, users can find the required information clues (that is, related site links), and directly use the information clues to Connect to appropriate network information resources.
⑵Advantages
① This organizational method uses data models to standardize information, which greatly improves the flexibility of data operations and reduces the load of network transmission.
②The development of multimedia databases and semi-structured databases provides a good way to organize unstructured information and facilitate the full sharing and effective dissemination of Internet information resources.
⑶Disadvantages
① It is more difficult to process unstructured information. This method is difficult to organize the increasing amount of unstructured information such as multimedia information in the network environment.
② Unable to provide knowledge correlation between data information. The database organization method is based on pragmatic organization, and it is difficult to display the semantic association between the knowledge content of information units.
③Higher requirements are put forward for both users and database servers. Users are required to master certain retrieval skills, including the selection of keywords and their combination methods. At the same time, on the database server, there are also higher requirements on how to automatically expand the database and how to provide good interpersonal interaction.
3. Topic tree method
⑴ Concept
The topic tree organization method is to organize information resources layer by layer according to a certain predetermined conceptual system structure. Users can select layer by layer through browsing and traverse layer by layer until they find the required information clues (i.e. related information). site link), and directly find the corresponding network information resources through information clues. Some well-known Internet search tools, such as Yahoo! InfoSeek and others organize information resources in this way.
⑵Advantages
① Simple and easy to use, it shields users from the complexity of the network information resource system, and provides a simple and easy-to-use network information retrieval and utilization page based on tree browsing.
②Information retrieval is based on a certain category classification system, viewed one by one, and searched according to pictures. For users, it is highly targeted and has a high accuracy rate.
③Use a tree directory structure to organize information resources, which has good scalability and strict systematization.
⑶Disadvantages
① A complete category system must be established in advance. However, this architecture is difficult to include all network information resources, and also artificially increases the user's intellectual burden.
② To ensure the structural clarity and resource availability of the topic tree, the structure of the topic category system cannot be too complex, which reduces the number of network information resources it can accommodate. Therefore, this organizational method is not suitable for building a comprehensive large-scale network. resource system.
③Theme directory tree is a manual resource organization method. Therefore, it cannot keep up with the growth rate of network information in terms of management and maintenance. Its collection scope is not comprehensive enough, and its novelty and timeliness are relatively poor.
4.Hypermedia method
⑴ Concept
Hypermedia technology is a combination of hypertext technology and multimedia technology. It organizes text, charts, sounds, images, videos and other multimedia information in a hypertext manner, allowing people to freely browse various information libraries through highly linked network structures. Navigate and find the information you need in any media.
⑵Advantages
① It organizes information in a non-linear way that conforms to people's leaping thinking habits, and has good inclusiveness and scalability.
② It transcends the restrictions of media type on information organization and retrieval, realizes the search method of link browsing, and avoids the complexity of retrieval language.
⑶Disadvantages
Using browsing to search for information, when the hypermedia network is too large, it is difficult to locate quickly and accurately, and it is difficult to save all historical records during the browsing process, making it difficult to avoid the so-called "wandering" phenomenon.
(5) Network information organization based on crowdsourcing
⑴Definition of crowdsourcing
The practice of a company or organization outsourcing work tasks formerly performed by employees on a free and voluntary basis to an unspecified (and often large) mass network. Crowdsourcing integrates computers and the unknown masses on the Internet to complete tasks that are difficult for machines to handle alone.
⑵Technology based on crowdsourcing ideas
①Wiki: It is a hypertext system that is open on the Internet and allows for collaborative creation by many people.
②Tag: Tag is also called tag technology, which is used to describe the classification identification of information content. Generally, a tag is a natural language vocabulary. Each label can be created and modified voluntarily by the user, and the label is shareable, and users can access the information marked by the label. (With the application of information self-organizing technologies such as tags, Folksnomy began to appear and gradually became a new network information classification and organization method that cannot be ignored.)
5. Knowledge Organization
Knowledge organization is an advanced form of information organization and the development direction of information organization.
(1) The meaning and characteristics of knowledge organization
Knowledge organization: It is the process or behavior of revealing knowledge units (including explicit knowledge factors and tacit knowledge factors), mining knowledge associations, and providing users with effective knowledge or information most quickly.
(2) Knowledge representation
Knowledge representation: refers to expressing the knowledge factors and knowledge associations in the knowledge object so that people can identify and understand the knowledge. Knowledge representation is the foundation and premise of knowledge organization, and any knowledge organization method must be based on knowledge representation. There are two types of knowledge representation: subjective knowledge representation and objective knowledge representation.
⑴Representation of subjective knowledge
Knowledge representation methods of expert systems: logical representation, production rule representation, semantic network representation, framework representation, and object-oriented knowledge representation.
⑵Representation of objective knowledge
Representation of literature knowledge: classification indexing method and subject indexing method.
(3) Knowledge organization from a semantic perspective (Techniques and methods of knowledge organization)
1. Semantic Web
⑴Meaning
Semantic Web: It is the vision of the second stage of the World Wide Web. It is not another independent World Wide Web, but an extension of the World Wide Web. It is a distributed World Wide Web composed of a large amount of machine-understandable data. In this semantic network, the relationship between data is expressed through some terms, which in turn form a complex network connection. The computer can obtain the meaning of the data through these terms, and can apply logic to this connection. Perform reasoning to complete some tasks that cannot be completed directly.
⑵The core composition of the information organization mechanism of the Semantic Web
①The information description module uses XML and RDF as means to manage information and has some semantics;
②The semantic annotation module using ontology as a means to provide "meaning" and realize semantic interoperability between machines.
2.Ontology
⑴Meaning
Ontology: It is a clear formal specification of a shared conceptual model. Specifically, in the field of network information organization, ontology is a set of words and terms that express knowledge in a certain field. The compiler organizes these words and terms into hierarchical categories according to the structure of the knowledge field and stipulates the categories. characteristics and the relationships between them.
⑵Characteristics of ontology-based network information organization
① Directly embody semantics. ②Distributed sharing. ③Multi-dimensional and networked information organization method. ④Support for reasoning.
3. Linked data
Linked data: It is a method of publishing data on the World Wide Web proposed by the inventor of the World Wide Web, Tim Berners-Lee, and is an implementation method of the Semantic Web. It generally requires the adoption of the RDF data model, the use of Uniform Resource Identifiers (URIs) to name data entities, and the publication and deployment of sample data and class data so that these data can be exposed and obtained through the HTTP protocol.
4. Knowledge graph
⑴Meaning
Knowledge graph: It is a graph-based data structure composed of nodes and edges between nodes. A knowledge graph is essentially a semantic network in which nodes represent entities or concepts and edges represent various semantic relationships between entities or concepts.
⑵Function
As a new type of knowledge storage structure, knowledge graph can display complex knowledge fields through data mining, information processing, knowledge measurement and graphics rendering. The visual structure and network relationships can reveal the dynamic development rules of the knowledge field. Provide practical and valuable reference for the subject to be studied.
⑶Classification
Linguistic knowledge graph, factual knowledge graph, domain knowledge graph, and knowledge graph automatically constructed by machines.
Chapter 6 Information Retrieval
1. Basic concepts and principles of information retrieval
(1) The meaning of information retrieval
Information retrieval: refers to the entire process of quickly and accurately finding the information required by users from numerous information collections. Information retrieval in a broad sense is also called information storage and retrieval, which includes two aspects: one is the storage of information, that is, organizing and storing information in a certain way; the other is information retrieval, that is, finding out relevant information according to the needs of the user. information process. Information retrieval in a narrow sense only refers to the second half of the process, that is, the entire process of finding the required information from an information collection.
(2) Types and characteristics of information retrieval
1. Types of information retrieval
⑴Divided according to the content of the search object
①Literature search: refers to a search with the goal of finding relevant literature on a certain topic. Document retrieval is a correlation retrieval rather than a deterministic retrieval, and its search objects are various types of documents containing specific information. Literature retrieval includes: full-text information retrieval and secondary literature information retrieval such as catalogs, bibliographies, indexes, and abstracts.
②Data retrieval: refers to retrieval targeting specific numerical data. Data retrieval is a deterministic retrieval, that is, the retrieval system directly provides the exact data required by the user, and the retrieval results are generally deterministic. Fact retrieval refers to retrieval targeting specific facts.
③Fact retrieval: It is also a deterministic retrieval. This retrieval includes not only the retrieval, operation, and derivation of numerical data, but also the retrieval, comparison, and logical judgment of facts, concepts, etc.
⑵Divided by search method
① Manual information retrieval: refers to the use of printed retrieval tools to conduct information retrieval manually. Its advantages are: intuitive, flexible, and easy to control the accuracy of retrieval; disadvantages are: the search process is complex, the retrieval speed is slow, and the search workload is large.
②Computer retrieval: It is the process of converting information and its retrieval identifiers into a binary encoding form that can be read and processed by computers, and storing them in a database system. The computer searches and outputs the digitized information according to the designed program: computer retrieval It greatly improves the retrieval efficiency and comprehensiveness, broadens the field of information retrieval, and enriches the research content of information retrieval. It can be further divided into offline retrieval, online retrieval, CD retrieval and network retrieval.
⑶Divided according to search requirements
①Feature retrieval: Also called strong correlation retrieval, it emphasizes providing users with highly relevant information. This type of retrieval emphasizes the accuracy of the retrieval, as long as the retrieved literature information can meet the needs of the user, and usually there is no requirement for the number of retrieval results.
② Family retrieval: Also called weak correlation retrieval, it emphasizes providing users with complete system information. This type of search focuses on the comprehensiveness of the search and requires retrieving all information on a specific topic within a period of time. In order to avoid missing relevant information as much as possible, the accuracy of the retrieval is relatively low.
⑷Divided by search time span
① Set-topic retrieval (SDI): Based on the content of the user's search topic and retrieval needs, the retrieval question form is formulated and the formulated retrieval question form is stored in the retrieval system in advance, and the information in the retrieval system is queried regularly . The characteristics of fixed-topic retrieval are: only the latest information is retrieved, and the retrieval time span is small. This retrieval mode is very suitable for information tracking and makes it easy to keep abreast of the latest developments in relevant subject areas.
②Retrospective search (RS): Also called retrospective search, it is a search for information about a specific topic in the past period of time, and the search results are provided to the user at one time, so that the user can fully understand a certain topic in a certain period with a single search. development over time. The characteristics of retroactive retrieval are: it can not only search for specific subject information in a certain period of time in the past, but also can find recent specific subject information. Unlike fixed-title searches, each backtracking search is generally run only once.
⑸Divided according to the information expression form of the search object
① Text retrieval: It is a search for text documents containing specific information. The search results reflect the documents with specific information in text form. This is a traditional type of information retrieval that still occupies a dominant position in information retrieval today.
(Full-text retrieval: For the entire article or even the entire book stored in the retrieval system, you can obtain relevant chapters, paragraphs, sentences, sections and other information according to your own needs, and you can also perform various frequency statistics and content analysis.)
② Multimedia retrieval: It is the process of organizing and storing text, sound, graphics, images and other media information according to the user's needs, so as to identify, find and obtain relevant information.
③Hypertext retrieval: Hypertext is a non-linear text structure formed by connecting many text information through hyperlinks. Hypertext retrieval emphasizes the semantic connection structure between central nodes, relying on the complex tools provided by the system for graphical navigation and node display, and providing browsing queries.
④Hypermedia retrieval: It is a supplement to hypertext retrieval. Its storage objects exceed the scope of text and incorporate a variety of media information such as graphics, images, and sounds. The information storage structure has developed from single-dimensional to multi-dimensional, and the scope of storage space has also continued to expand. .
2. Characteristics of information retrieval
⑴Relevance of information retrieval
The so-called relevance refers to the degree of consistency between a text specified in information retrieval and another text representing an information question. Domestic information circles believe that correlation indicates whether the user thinks a document is consistent with a question. Relevance problems widely exist in information retrieval.
⑵Uncertainty of information retrieval
Uncertainty exists both in indexing and in the selection of search terms. ①The uncertainty of indexing means that different indexers will use different indexing words when indexing the same information object, that is, the inconsistency in the selection of indexing words. ② The uncertainty in the selection of search terms means that there is not just one set of candidate search terms, but multiple ones, and the search process is heuristic. The above two uncertainties act on the retrieval system, making information retrieval uncertain.
⑶The logic of information retrieval
Information retrieval, as the core of information management, has a very strong logic. ① In terms of retrieval language, the retrieval word list serves as the reference text of the retrieval language, and its own arrangement has strong logic. ②The logic is more obvious in the research of retrieval strategies. The so-called retrieval strategy refers to the scientific arrangement of the logic and search steps for processing information retrieval questions. Correct search strategies optimize the search process, help achieve the best search results, and obtain highly relevant documents.
(3) Basic principles of information retrieval
The basic principles of information retrieval can be summarized as: matching and selection of information resource collections and information demand collections.
⑴Information resource collection
Information resource collection refers to a collection of information related to a certain field that has been selectively collected, organized and processed. In order to ensure the speed and efficiency of information retrieval, it is necessary to perform some formal processing on the collection of information resources to form their characteristic representation, that is, to analyze and index them so that the original hidden and difficult-to-identify features are displayed. Personalize it and obtain corresponding identification (such as classification number, subject heading, etc.). Storing these analyzed and extracted features and their identifiers forms an index database, which becomes the basis and standard for organizing and searching for information resources.
⑵Information needs collection
The collection of information needs of many users in different forms forms a collection of information needs. The information requirements put forward by users also need to be characterized, that is, the content of the requirements is analyzed, the subject concepts or other attributes are extracted, and the same identification system (i.e., retrieval language) as the information resource collection is used to represent the concepts contained in the requirements. and attributes, so as to obtain the characteristic representation of user needs - question-formation.
⑶ Matching and selection
In order to quickly obtain the information and knowledge required by users from a collection of information resources, information retrieval is required to provide a "matching" mechanism. The main function of this mechanism is to compare and judge the information demand set and the information resource set based on a certain similarity standard, and then select information that meets the user's needs. On the basis of the characteristic representation of the information resource collection and the combination of information requirements, the matching between them is simplified to the matching between the question and the established ordered index library.
(4) Information retrieval model
⑴Boolean logic retrieval model
The Boolean retrieval model uses Boolean algebra and set theory methods, uses Boolean expressions to express user questions, and retrieves documents through logical operations on document identifiers and question formulas. The most commonly used logical operators are Boolean logical operators. The main logical operators are logical "AND", logical "OR", and logical "NOT". They are AND (or *), OR (or), NOT (-) respectively. express.
Advantages: There are fewer logical operators, and the question structure is simple and easy to modify; Disadvantages: There is no weight difference between keywords in the search, the search results are not ranked by importance, the recall rate is difficult to control, and the user's semantic extraction ability is required to be high.
⑵Vector space model
The vector space retrieval model is a new type of retrieval model constructed using linear algebra theory and methods. Its basic premise is to regard both retrieval documents and retrieval questions as a set of numerical vectors. These values form a space vector graph, thus converting the document and question matching process in information retrieval into document vectors and questions in vector space. Vector similarity calculation problem. The relevance of a certain document to a certain question is determined by retrieving the similarity between the vector pairs.
The probabilistic model is an information retrieval model that is simple to implement and has good effects. Its basic idea is: given a user asking a question, there is an ideal result set in the retrieval system that only contains documents related to the question, denoted as R. If we can know the characteristics and description of the set R, we can find all relevant documents and exclude all irrelevant documents.
⑷Fuzzy retrieval model
Fuzzy retrieval treats documents as questions related to a certain extent. For each indexing word, there is a fuzzy collection of documents related to it. For a given indexing word, each document is represented by a membership function. The degree of correlation with the word, that is, the membership degree, takes a value on [0, 1], 0 means not relevant, 1 means completely relevant, the larger the value, the higher the correlation.
2. The development history of information retrieval
(1) Manual information retrieval stage (1876-1945)
1. The origin and development of manual information retrieval
⑴Information retrieval originated from document indexing and library reference work. The symbol of the emergence of reference work was the first conference of the American Library Association held in 1876. By the 1940s, "information retrieval" developed into an independent user service work, and gradually developed from simple experience work to professionalization.
⑵ During this stage of development, manual information retrieval activities have achieved fruitful results and laid a solid foundation for the development of computerized information retrieval in the next period. These achievements are mainly reflected in: ① A large number of high-quality abstract retrieval tools have been compiled and put into use. For example, "Engineering Index", "Science Abstracts", "Chemical Abstracts", etc.; ② The creation and testing activities of search languages are very active, and classification languages such as DDC and UDC have appeared; in terms of topics, grouped title languages first, and then grouped Search languages such as formula unit language, descriptor language, and citation language have emerged one after another.
2.Techniques and methods of manual information retrieval
⑴Search tools: catalogs, indexes, abstracts, yearbooks, manuals, encyclopedias
⑶Search method: forward search method, reverse search method, spot check method, retrospective method, circular method
The characteristics of manual information retrieval are: simple operation, low cost, high accuracy, but low efficiency, and the recall rate cannot be guaranteed.
(2) Mechanical information retrieval stage (1945-1954)
1. The origin and development of mechanical information retrieval
⑴ The mechanical information retrieval system started in the 1950s. A mechanical system that uses various mechanical devices to perform information retrieval is a transitional stage from manual retrieval to modern information retrieval. Mechanical retrieval mainly includes two basic types: ① Electromechanical information retrieval system: an information retrieval system that uses electromechanical equipment such as punching machines, hole inspection machines, and sorting machines to record secondary documents, and uses brushes as retrieval elements. ② Photoelectric information retrieval system: A system that uses microphotography to record secondary documents and uses photoelectric retrieval elements to search for documents.
⑵ Features: The mechanical information retrieval system used the advanced mechanical devices at the time to improve the storage and retrieval of information and promote the automation of information retrieval; however, it did not develop an information retrieval language, and relied too much on equipment, making the retrieval complex and costly. , the retrieval efficiency and quality are not ideal. It was quickly replaced by rapidly developing computer retrieval systems.
2. Technology and methods of mechanical information retrieval
⑴ Mechanical and electrical information retrieval system: Mechanical information retrieval was initially developed gradually from simple punched cards. Following the manual inspection of punched cards, machine inspection of punched cards and card selection machines appeared, which formed an electromechanical information retrieval system.
⑵ Optoelectronic information retrieval system: Optoelectronic information retrieval system mainly appears in the form of microfilm retrieval, which can be roughly divided into two types: addressing retrieval method and coding retrieval method.
(3) Computer information retrieval stage (1954-1991)
1. The origin and development of computer information retrieval
With the advent of computers in 1946, information retrieval entered the computer retrieval stage. The development of computer information systems has gone through different development stages such as early offline batch retrieval, later online real-time retrieval, and more powerful online networked information retrieval.
⑴ Offline batch retrieval (1954-1965): In the early days of development, computer technology was still relatively backward, and data storage and data communication capabilities were relatively poor. The basic feature of information retrieval was "offline batch processing", which is represented by: input into the computer Almost all information is stored on magnetic tapes, and retrieval questions are stored on punched paper tapes or punched cards. Retrieval results cannot be obtained immediately and must wait for batch or regular retrieval processing.
⑵Online real-time retrieval (1965-1975): In the mid-to-late 1960s, with the emergence of the third generation of computers - integrated circuit computers, the advent and use of high-density mass random access memories - disks and disk drives, information Search has entered the era of human-computer conversational online real-time search. The performance is as follows: the host computer of the information retrieval center is connected to the long-distance retrieval terminal via a telephone line to form an online real-time retrieval system. Users can communicate with the retrieval system through the retrieval terminal and can obtain the retrieval results instantly on the terminal screen.
⑶Online networked retrieval (1975-1991): In the mid-to-late 1970s, large-scale integrated circuit and very large-scale integrated circuit technology, database technology, especially modern communication technology based on packet switching networks, etc., brought online retrieval into the field. Online network development stage.
The main characteristics of the computer information retrieval period are: ① Various machine-readable databases are used as search objects; ② Various information institutes and online service centers exist as emerging information service departments; ③ Information retrieval users gradually shift from professional search personnel to personal terminals User transfer and diffusion, the social popularity of information retrieval is increasing day by day.
2. Technology and methods of computer information retrieval
⑴Online information retrieval
Online information retrieval refers to information users using retrieval terminal equipment, using some specific instructions and retrieval words, to conduct human-computer dialogue online with an information retrieval system that stores a large amount of information through a communication network, and to find the information the user needs from the database of the retrieval system. the whole process.
Advantages: ① Fast retrieval speed; ② Wide and comprehensive retrieval scope; ③ Multiple retrieval methods and high quality; ④ New retrieval content and strong real-time nature; ⑤ Complete retrieval auxiliary functions, easy to use, and flexible and practical search result output methods.
Disadvantages: ① The host has a heavy burden, and once it fails, the entire network will be paralyzed; ② The information organization method is mainly linear and not flexible enough; ③ Online retrieval is not oriented to end users like the Internet, and the operation is not as convenient as the latter.
⑵ Optical disc information retrieval: Optical disc is a storage carrier that uses laser beams to record and reproduce information after paper, microfilm, and magnetic storage. The optical disc used for retrieval and reading is usually a read-only optical disc. It is an information carrier, and retrieval and utilization of the information in it requires the cooperation of a computer.
Features: ① The use of the optical disc retrieval system can avoid the use of telecommunications equipment and errors caused by communication line transmission; ② The optical disc system provides users with software equivalent to the functions of the online information retrieval system, and provides menu drivers and There are two command-driven methods; ③ CDs have large storage capacity, are durable, and have low copying costs; ④ can combine text, graphics, images, sounds and dynamic images; ⑤ If the CD database is not large enough, the information resources will be limited. Purchasing large quantities of databases is subject to financial constraints; ⑥ In terms of timeliness of information needs, CD-ROM retrieval is not as good as online retrieval systems, because CD-ROMs can only be provided regularly; ⑦ Database costs are high.
(4) Network information retrieval stage (1991-):
1. The origin and development of network information retrieval
After the 1990s, marked by the emergence of the Internet, most systems adopted distributed network management, and information processing changed from traditional models to new models. These changes have caused the mainstream platform of information retrieval to quickly shift to a network application environment with the Web as the core. On the one hand, the emerging opportunity Web search engine system is increasingly widely used; on the other hand, traditional online retrieval systems, various database retrieval systems and their information service businesses are gradually expanding and shifting to distributed network structure characteristics. On the Web platform, a WWW site is opened for network users and provides rich functional services.
2. Technology and methods of network information retrieval
The characteristics of network information retrieval: ① The amount of information is larger and different languages need to be processed; ② The scope of information retrieval is wider (multi-disciplinary, multi-field); ③ The timeliness requirements of information query are higher; ④ The search results are The accuracy rate is higher and the accuracy rate is lower.
3. Information retrieval steps and strategies
(1) Steps of information retrieval
1. Analyze and study information retrieval topics
Analyzing and studying information retrieval topics is the fundamental starting point for determining information retrieval strategies, and is also the key to retrieval efficiency and success.
Its main contents include: ① Clarify the fields and scope involved in the information retrieval topic; ② Clarify the content of the required information and its content characteristics; ③ Clarify the type of information required, including document media, publication type, amount of documents required, and age. Scope, languages involved, relevant authors and institutions, etc.
2. Select information retrieval tools
Information retrieval tools are cards, tables, computer information systems and specific publications that are processed and compiled by people to report, reveal, store and find information in order to fully, accurately and effectively utilize existing information resources. There are many types of information retrieval tools of all kinds. You only need to choose those high-quality information retrieval tools that are relevant to the topic and meet the time requirements.
Specific types of information retrieval tools can be divided into two categories according to their uses: ① Indicative clue-type information retrieval tools (secondary literature), including bibliographies, collection catalogs, index abstracts, reference book guides, and dictionaries. ② Reference books (tertiary documents) that provide specific information, including encyclopedias, biographical materials, manuals, institutional directories, geographical data, statistical data, yearbooks, and government documents. Currently, most search tools exist in computer information systems or are distributed on the Internet.
3. Determine information retrieval methods
The method used to accomplish any given task is important, and information retrieval is no exception. Commonly used information retrieval methods include forward search method, reverse search method, spot check method, retrospective method, circular method, etc. Each information retrieval method has its own characteristics. In practice, it can be selected or used in combination according to the information retrieval requirements. Complete information retrieval tasks quickly and accurately to achieve expected goals.
4. Grasp the clues to obtain original information
After analyzing and studying the information retrieval questions, determining the search scope, and selecting retrieval tools and methods, information retrieval can be implemented. When obtaining information clues, read carefully to determine whether the detected information meets the retrieval requirements. If the detected information meets the requirements, it is necessary to record the relevant characteristics of the information material, such as the title of the article, the author and work unit, the source of the information, etc., in order to find the original information.
5. Get original information
Obtaining original information is the last step of information retrieval and is very important to achieve the ultimate goal of information retrieval.
Its main tasks include: ① Determining the publication type of documents; ② Organizing the source of documents; ③ Searching the collection catalog or joint catalog in libraries or information institutions to determine the collection according to the publication type. ④ Obtain original information from as many channels and methods as possible.
(2) Information retrieval strategy
Information retrieval strategy: It is an information retrieval plan designed based on retrieval questions and using retrieval methods and technologies. Its purpose is to achieve a certain recall rate and precision rate.
In a broad sense: it refers to the process of selecting a search system and search tools, clarifying search approaches and methods, determining the logical relationship between search terms and the best solution for search steps on the basis of analyzing the substantive content of the search topic and clarifying the search goals. Series scientific arrangement.
In a narrow sense: it refers to the construction of retrieval question expressions, that is, using the specific retrieval technology of the retrieval system to determine the logical relationship between search terms to form a retrieval question expression that expresses the user's information needs.
From the perspective of specific operations, the process of information retrieval strategy formulation corresponds to the steps of information retrieval. The steps of information retrieval indicate what is to be done, while the formulation of information retrieval strategy is to determine the specific issues in each step. And choose optimized solutions to achieve excellent retrieval results.
(3) Information retrieval language
Information retrieval language: also known as indexing language, index language, etc., is a specialized language developed in response to the common needs of processing, storage and retrieval of document information. It is used to describe the content and external characteristics of information in the retrieval system and their interactions. A conceptual identity system that relates and expresses information user needs questions.
1. The role of information retrieval language
⑴At the superficial operational level, retrieval language is used to describe the characteristics of documents and information content in documents, convert document information into certain document identifiers, and constitute various retrieval methods of the information retrieval system; at the same time, it is also used to describe Retrieve the characteristics of the question and the required content, and convert the question into a certain question identifier or search identifier so that specific document information can be found in the retrieval system. From this level, an information retrieval language is a logical semantic tool designed to express various things.
⑵ At a deeper principle level, information retrieval language can link document indexing and retrieval questions, link indexers and searchers, and link document information storage and retrieval to achieve common understanding and mutual understanding between the two. correspond.
2. Types of information retrieval languages
⑴Describe the characteristics of information content
Title language, author language, number language
⑵Describe information content characteristics
Classification language: system classification language, combined classification language, hybrid classification language
Subject language: title word language, unit word language, descriptor language, keyword language
(4) Evaluation of information retrieval efficiency
Retrieval efficiency refers to completeness, accuracy, speed, convenience and economy (retrieval rate, precision rate, retrieval speed, retrieval convenience, retrieval cost and benefit), and the most important thing is completeness and accuracy.
1.Evaluation indicators
⑴ Recall rate R: It refers to a measurement index that measures the ability of an information retrieval system to detect relevant documents when performing a certain retrieval operation. It refers to the percentage of the number of relevant documents detected to the total number of relevant documents in the system. The recall rate reflects the comprehensiveness of the search, and its complement is the missed search rate. The calculation method is: R=the amount of relevant documents detected/the total amount of relevant documents in the retrieval system.
⑵ Precision rate P: It is a measurement index to measure the retrieval accuracy of the information system when implementing a certain retrieval operation. It refers to the percentage of the number of relevant documents detected to the total number of documents detected. The precision rate reflects the accuracy of the search, and its complement is the error rate. The calculation method is: the amount of relevant documents detected/the total amount of documents detected.
⑶ Missing detection rate O: as the complement of the recall rate. The calculation method is: the amount of relevant documents that have not been detected/the total amount of relevant documents in the retrieval system.
⑷Error rate E: as the complement of the precision rate. The calculation method is: the amount of non-relevant documents detected/the total amount of documents detected.
2. Factors affecting recall rate and precision rate
⑴Factors affecting recall rate
① From the perspective of document storage, the main reasons include: incomplete documents included in the library; lack of control and specificity of index vocabulary; incomplete word list structure; ambiguous or incorrect relationships between words; unknown indexing; inconsistent indexing; indexing personnel Missing important concepts of the original text or using inappropriate words, etc.
② From the perspective of the retrieval process, the main reasons include: the retrieval strategy is too simple; improper selection and logical combination; too few retrieval approaches and methods; retrieval personnel are unskilled and impatient; the retrieval system does not have word truncation function and feedback function. The search requirements cannot be fully described during the search.
⑵Factors affecting precision rate
The index words cannot accurately describe the document subject and search requirements; the combination rules are not strict; the word selection and the relationship between words are incorrect; the indexing is too detailed; the combination is wrong; the search terms or search terms used in the search are not specific enough, and the search coverage is Wider than retrieval requirements; the retrieval system does not have the logical "not" function and feedback function; the number of words allowed in the search formula is limited; the truncation position is inappropriate, the logical "or" is improperly used in the search formula, etc.
3. The relationship between recall rate and precision rate
There is a close relationship between recall rate and precision rate, that is, a reciprocal relationship. In an information retrieval system, when the recall rate and precision rate reach a certain level, the two will show a non-linear inverse relationship. In other words, while the precision rate continues to increase, the recall rate continues to decrease; conversely, while the recall rate continues to increase, the precision rate will continue to decrease.
4. Network information retrieval
(1) Characteristics of network information retrieval
1. Wide search range
Network information retrieval can retrieve information resources in all fields, all types, and various media on the Internet, far exceeding the information sources available through online retrieval, CD-ROM retrieval and other information retrieval methods.
2. User-friendly interface
The network information resource retrieval tool directly targets users and is simple and convenient to operate. It generally uses a graphical window interface and provides a variety of navigation functions and multiple search methods. Searchers do not need to master complex search instructions. As long as they enter the search formula according to the prompts and rules on the search interface, they can obtain the search results.
3.Interactive work mode
The network information resource retrieval tool has the characteristics of interactive operations. It can respond to the user's requirements in a timely manner, obtain corresponding instructions from the user's commands, perform corresponding operations according to the instructions, and finally feed back the execution results to the user.
4. Integration of traditional retrieval technology and network retrieval technology
Network information retrieval not only follows many traditional retrieval methods and technologies, but also uses new retrieval technologies such as hypertext/hypermedia, full-text retrieval, and intelligent retrieval with the help of the development of network information technology.
5. High retrieval efficiency
Through hyperlink technology, the retrieval process of network information resources and the information browsing process are carried out in the same interface. With a simple click of the mouse, users can browse and obtain the full text of Web page documents that can be directly read and utilized.
6. Large information redundancy
Network information resources lack unified and standardized management and control and are highly dynamic. Current network information retrieval tools have certain deficiencies in information collection and indexing. The information retrieval process will generate a large amount of useless or even junk information. Accuracy, completeness and authority cannot be guaranteed.
(2) Network information retrieval tools
Network information resource retrieval tools refer to computer systems that provide information resource retrieval services on the Internet. Their retrieval objects are various types of network information resources that exist in the Internet information space.
Types of network information resource retrieval tools:
1. Classify by search content
⑴ Comprehensive: Comprehensive network resource retrieval tools are also called universal network resource retrieval tools. They do not limit the discipline, subject scope and data type of resources when collecting information resources. They can be used to retrieve almost all aspects of network information. resource.
⑵Specialist type: Specialist type network resource retrieval tool refers to the specialized collection of information resources on a certain subject subject and scope, and provides more detailed classification, in-depth indexing and description suitable for the characteristics of its professional resources and retrieval needs.
⑶Special type: Special type network information resource retrieval tools are tools designed to provide retrieval services for a special type of information resources.
2. Classification according to the type of information resources retrieved
⑴Non-Web resource retrieval tools: A type of retrieval tool that mainly targets non-Web resources, such as FTP information resources, Gopher information resources, Telnet information resources, Usenet information resources and other special types of information resources.
⑵ Web resource retrieval tool: It is a specialized Web server or Web website established on the Internet using hypertext technology to provide online information resource navigation and retrieval services. It is a clue tool that not only takes Web resources as the main search object, but also provides services in the form of Web.
①Keyword search tool: Search engine, which uses automatic indexing software to discover, collect and index web pages, and establish a database; it provides users with a search interface in the form of a Web for users to enter search terms such as keywords, phrases or phrases ; Replace the user in the database to find records matching the question, and return the results and output them in order of relevance.
② Directory search tool: It is a hierarchical structure directory that can be searched according to a certain classification system. The classification method is mainly based on subject classification. The retrieval method using such tools is called "categorical search". This is a "top-down, gradually refined" search method that traverses layer by layer.
③Hybrid retrieval tools: At present, the keyword retrieval of search engines and the classification retrieval of catalog-type retrieval tools are gradually integrated. You can not only enter search terms directly, but also browse the catalog to learn about resources in a specific field to enhance retrieval capabilities.
(3) Search engines and applications
1. Development of search engines
Search engine: It is a tool that provides users with keywords, phrases or natural language to retrieve information on the Internet. The basic principle of its work is: the user puts forward a search request, and the search engine searches in the database on behalf of the user and will retrieve the information. The results are fed back to the user.
2. Basic principles of search engines
⑴How independent search engines work
①Data collection; ②Data analysis and indexing; ③Data retrieval; ④Data mining
⑵How meta-search engines work
①User interface; ②Query agent; ③Result summary output
3. Application of search engines
⑴ Comprehensive independent search engine
⑵Vertical independent search engine
⑶Meta search engine
⑷Mobile search engine
(4) Important areas of network information retrieval
1. Multimedia information retrieval
Multimedia information retrieval is to retrieve information from multimedia such as graphics, images, text, sounds, animations, etc. according to the user's requirements to obtain the information the user needs. Divided into text-based retrieval and content-based retrieval.
2. Cross-Language Information Retrieval (CLIR)
Cross-language retrieval means that users construct and submit search questions in a certain language (source language), and the system retrieves document sets expressed in other languages (target language) that meet the user's needs. Cross-language information retrieval uses user queries described in a single language to retrieve multilingual information resources. The essence is the matching between single-language user queries and multilingual information representations.
At present, the main research hotspots of cross-language information retrieval include: cross-language information retrieval auxiliary technology methods, language conversion methods, information organization and retrieval models, etc. At the same time, there are still some issues that need further research, such as semantic-based information retrieval, application-oriented Oriented cross-language retrieval platform, correlation search result merging, visualization processing, etc. will become new research directions.
3. Intelligent information retrieval
The intelligent information system is developed from word extraction and full-text retrieval. It is an artificial intelligence retrieval system based on the relevance of search terms and has high judgment, understanding and processing capabilities for search terms.
In recent years, intelligent information retrieval based on semantics, agents, and ontology has become a research hotspot. Semantic intelligent information retrieval systems attempt to semantically understand and index documents and intelligently understand users' information needs based on their relevant information. Agent is an adaptable and intelligent software entity. It is combined with intelligent information retrieval to improve the intelligence of retrieval technology and promote human-computer interaction. Ontology can better describe the connotation of concepts and the relationship between concepts, and has a good concept hierarchy and support for logical reasoning. It can be used to expand and reason about user retrieval, thereby improving the efficiency and accuracy of retrieval.
4.Visualization of information retrieval
Information retrieval visualization is the application of information visualization technology in information retrieval. It refers to converting document information, user questions, various information retrieval models, and internal semantic relationships invisible during the information retrieval process into graphics, a two-dimensional or three-dimensional displayed in the visualization space.
As a cutting-edge exploration direction, the current research and application of information retrieval visualization has achieved certain progress and results, which are mainly reflected in different aspects of retrieval processing activities: (pre-retrieval) questioning structure; transparency of the retrieval process; (retrieval) (After) result processing; development of visual retrieval vocabulary system, etc.
5. Intelligent question and answer system
The current rapid development of artificial intelligence, and the gradual application of machine learning, neural network and other technologies, have greatly promoted the development of intelligent question and answer systems, producing some representative products, including automated question and answer platforms, voice question and answer robots, etc. The one-question-one-answer service method of these intelligent question and answer systems facilitates and accurately locates user needs, enables real-time interaction, and greatly improves the degree of personalized service.
Chapter 7 Information Services
1. Content and methods of information services
(1) The meaning and content of information services
Information service: It is an activity that provides users with the required information in different ways. It is a service business with information as its content.
It includes two aspects: ① First, collect, evaluate, select, organize, store information scattered on different carriers, order it into a convenient form; ② Second, meet the needs of users and information Conduct research to provide them with valuable information.
(2) Characteristics of information services
Information service is a type or a series of behaviors that can meet user needs that occur between information users and information providers, information resources, and information service systems. In the process of information services, information sources, information activities and information users become three basic elements. The characteristics of information services are mainly reflected in the following aspects:
1. Highly knowledge-intensive
Information services require service personnel to have comprehensive knowledge and quality, and be able to make judgments based on users' information needs and determine optimized service provision content and methods; at the same time, users also need to have corresponding knowledge reserves. Only when the user's own knowledge and the service provider provide Information services can be effectively utilized only when the information is matched.
2. Information user participation is high
Since users' information needs are not easy to express clearly and are not easy to be perceived, it is only possible to understand their needs and provide information services based on more interactions with users. This makes information services an interactive service that is driven by user needs and requires a high degree of user participation.
3. Information service providers have deep contact with users
Information service is a service oriented to users' dynamic and personalized information needs. Due to the deep contact level, many uncertain factors will be faced in the service process. Users may put forward new requirements at any time, so service personnel often replace information resources and become Information service dominant factors.
(3) Principles of information services
1.Principle of pertinence
Meeting the specific needs of specific users at a specific time is the basic starting point of information services. Information service agencies must carefully study users' information needs and changes in needs, understand users' habits and characteristics of using information, select information content, information carriers, and information channels that meet user needs, and provide users with highly targeted information services.
2. The principle of timeliness
Information is time-sensitive, that is, it can only be effective within a specific time frame. This time is usually before the user makes a decision and selects the information he needs. If information is provided too early, users will not need it, and the information utility cannot be realized; if it is provided too late, the information will be worthless.
3. Principle of ease of use
Practice shows that user utilization of information is affected by availability and ease of use. Availability and ease of use often outweigh the value of the information itself when deciding whether to select and utilize information. Therefore, information service agencies should provide the greatest convenience for users to obtain and utilize information.
4. Cost/benefit principle
Information services must emphasize both social benefits and economic benefits. Although the benefits of information services are potential and delayed, and it is difficult to make a definite evaluation, both information service organizations and users need to spend a certain cost, and it is necessary to ensure that the maximum benefits of information services are obtained with the minimum cost.
(4) Types and methods of information services
1. Types of information services
The development of information services relies on human resources and technical resources. From the perspective of different resource dependence models of information services, information services can be divided into execution type, experience type, and expert type.
⑴ Execution-type information services: Mainly rely on specific service systems, and front-end products are often information or information collections provided according to the explicit requirements of users. Backend work focuses on information sequencing, information system design and maintenance. Users are the dominant factor in the service process, such as information data delivery services, information analysis software application services, retrieval services, etc.
⑵ Experience-based information services: Mainly rely on specific service experience. The front-end products are customized information supplied based on current situation analysis and judgment of user needs. The backend needs to strengthen information collection and conduct certain information integration in advance based on predicting user needs. Both users and service personnel play important roles in the service process, such as transactional consultation, industry reports, etc.
⑶Expert information services: The front-end products are mainly targeted reports and solutions. The backend needs to focus on information collection and information analysis, with special emphasis on information analysis. In this service process, the dominant factor is the service personnel.
2. Information service methods
⑴Information retrieval and delivery services
Information retrieval and delivery services are information service organizations that use the organization's internal and external information resource systems to provide users with appropriate information at the right time and in the right way according to the specific needs of information users. Specific forms include information search services and novelty checking services. , interlibrary loan and document delivery, etc.
①Information search service: Information agency service personnel use various search tools to obtain relevant literature information resources for their queries according to the needs of users.
②Novelty search service: It is a type of traditional information search service. Its application purpose is to avoid blindness, duplication and subjective errors in scientific research topic selection and results evaluation. This service is widely used in scientific research project establishment, achievement appraisal, award application, patent application, etc.
②Interlibrary loan and document delivery: It has now become one of the main forms of public welfare information services. This service uses a convenient network environment to establish smooth channels for users, and can provide users with books, journal articles, full-text database documents and other resources. Interlibrary lending and delivery services to meet user needs.
⑵Information reporting and publishing services
That is, after information agencies organize, process, evaluate, research and select a large amount of collected data and information, they report it in a timely manner to meet users' information needs. There are three common ways of reporting information:
①Text reporting: This is the main form of information reporting. Information service agencies study the tasks and information needs of users in scientific research, production, business and other activities, and selectively publish materials and information of important value (original information). or primary information) is processed into secondary information and tertiary information for reporting. This kind of reporting is generally divided into directional reporting and fixed-topic reporting. Text reports can also be divided into primary information reports, secondary information reports and tertiary information reports according to their different contents and functions.
② Oral reporting service: mainly providing information through direct conversations, special lectures, academic conferences, and information conferences. The information reported verbally is generally zero-time information, which is characterized by rapidity, timeliness, strong interactivity and good effect.
③Intuitive communication and reporting services: that is, conveying information through physical objects such as samples, prototypes, exhibitions, movies, audio and video and other carriers. This method is intuitive and vivid, easy to accept, and easy to understand, but the scope of information transmission is limited.
⑶Reference service
In information institutions represented by libraries, reference services have been developed for a long time. In the Internet age, this service is more represented as a digital reference consulting service, that is, a question-and-answer service based on the Internet that connects users with expert subject expertise. At present, the digital reference consultation service in the library industry adopts the distributed cooperative reference service model, which to a certain extent solves the contradiction between the limited number of consultants and unlimited user questions. It is mainly divided into three types: asynchronous, real-time, and cooperative.
①Asynchronous services: Mainly implemented using E-mail, BBS, message boards, FAQ and other methods.
②Real-time service: Mainly realized through real-time communication technologies such as online chat, video conferencing, and online call centers.
③Cooperative digital reference consultation: It is a distributed digital reference consultation network formed by many member institutions to solve consultation problems that a single library or information institution cannot solve due to various conditions.
⑷Information consulting services
Information consulting services are a specialized consulting activity that helps users solve information problems. It originated from the library's reference consultation and continues to develop with the expansion of the scope of modern information services. Compared with other information service methods, information consulting services are flexible and diverse, more proactive and penetrating, can directly meet users' information needs, help users solve problems, and are more influential to users. Information consultation mainly provides the following services to users:
① Where can you obtain the required information and what is the most valuable information source;
②What is the most convenient information channel and how reliable it is;
③Evaluate the reliability, applicability and economy of the information.
④ Provide users with information analysis research reports and forecast reports;
⑤ Provide and recommend information technology selection and evaluation solutions to users.
⑥Provide users with information system and network design solutions, etc.
2. Evolution and modern development of information services
(1) The evolution and development trends of information services
New development trends of modern information services:
⑴Mobile information services
① The popularization of mobile terminals such as smartphones, and the development of mobile communication technology and mobile Internet technology have changed the way people access and use the network, allowing network information services to extend to mobile information services. Mobile information services are oriented to mobile terminal users and provide information services through wireless networks or mobile communication networks.
②The characteristics of mobile information services are mainly reflected in: ubiquity, personalization, flexibility, and location. These features change the way information services are provided, enrich the content of information services, and enhance user experience.
⑵Intelligence of information services
① With the rise of new generation information technologies such as human-computer interaction, big data, and artificial intelligence, as well as the integrated development of Internet technology and information communication technology, people's information acquisition methods and information needs are increasingly diversified and diversified. In the context of new technologies, information services are beginning to develop in an intelligent direction.
②The connotation of intelligent information services is: using big data analysis to creatively provide users with more optimized information services based on factors such as users’ professional fields, behavioral habits, demand preferences, etc., mainly as follows: personalized services and proactive services , self-study.
③The intelligence of information services mainly includes three aspects: intelligence of information resource organization, intelligence of information service methods, and intelligence of information service content.
⑶ Online and offline integration of service scenarios
The development of China's Internet economy has gone through the initial online era, and then the O2O (online to offline) model that penetrated from online to offline. Nowadays, the popularity of mobile Internet and mobile payment, as well as the development of artificial intelligence, Internet of Things and other technologies, have led to the integration of online services and offline services. The penetration rate of information services into the real economy is getting higher and higher. Information services and the real economy are showing The trend of integration. In this context, the industry has proposed the OMO (Online-Merge-Offline) model that integrates online and offline.
⑷In addition, the development of information services also shows the following trends:
① From a single form of service to a comprehensive service; ② From department-based information services to a socialized information service model; ③ Various special information services and systematic information assurance services will become a major factor in the development of information service business Mainstream; ④ Information services develop towards diversification and diversification; ⑤ Personalized, knowledge-based and integrated development of network-based information services.
(2) Network information services
Network information services: refers to the general term for information services organizations to provide processed and organized information products and services through computer networks in order to meet users' information needs. It includes the processing of information content, the provision of information content, and the storage and acquisition of information content. , network value-added services, information consulting services.
1. Characteristics of network information services
⑴Information provision is knowledge-based
Network information services provide highly concentrated information products that have been processed and processed. Compared with complex, disordered, and unorganized network information resources, their content is more standardized and can establish a "knowledge base" for users based on their needs. ", "data warehouse", etc., thereby achieving the purpose of saving users time and energy and quickly providing processed and organized information content.
⑵Socialization of information services
The socialization and network sharing of information resources of network information service organizations are the most distinctive features of network information services. In the network environment, network information service organizations can meet the requirements of users to the greatest extent, maximize the effectiveness of information resources, fully embody the concepts of unlimited resources and unlimited services in the network, and move from a closed library to an open society. At the same time, the scope of information services is constantly expanding.
⑶Information service virtualization
The Internet breaks the boundaries of region and time. The staff of network information service agencies can contact users in real time through the Internet. The service content and process are all electronic. The subject and object of the service are also hidden. What users face is networked. It is an information service system instead of a specific person or thing, thus breaking through the time and space limitations in traditional information services, making users' information retrieval, request and acquisition more convenient and timely, saving users time and energy .
⑷Personalization of information services
The personalization of network information services is an obvious feature of information services after the emergence of the Internet. It is precisely because of the improvement of network technology and information technology that the personalization of information services has been fully realized and developed. It is reflected in the fact that in the face of different users, the content, process and means of information services are more targeted and purposeful.
2.Personalized information services
⑴Personalized network information service: It is oriented to users’ information usage behavior, interests, hobbies, habits, and demand characteristics. It provides content, systems and functions for users to search, organize, select, and recommend personalized customized information services to satisfy users. Information needs to solve real-world problems.
⑵Personalized information services include two aspects: first, users can customize the information they need according to their own needs; second, the Internet information base actively selects the resources and services most needed for users based on the characteristics of user services, and based on the user's needs As needs change, the information provided is dynamically changed so that users can receive personalized services.
⑶The service methods it includes include: subject portal service, personalized classification customization service, active information push service, information intelligent agent service, digital reference service, information help retrieval service, vocabulary navigation service, automatic summary service, etc.
⑷The more important technologies include: user interest and behavior acquisition and user modeling technology, data mining technology, personalized recommendation technology, information filtering technology, artificial intelligence technology, intelligent agent technology, personalized retrieval technology, etc.
3. Integrated information services
⑴Information integration service: refers to the modern digital network environment, based on modern information integration theory and technology, through the integration and dynamic integration of service elements, and the construction of an integrated service system with complementary advantages, allowing users to A service concept and model that utilizes the most needed resources and services at the lowest cost.
⑵The development motivations of information integration services mainly include: ①Changes in user needs; ②Diversification of information resources; ③Promotion of information technology; ④The needs of information services themselves.
⑶Types of information integration services: ① Resource-centered information integration services; ② Technology-centered information integration services; ③ Institutional cooperation-centered information integration services; ④ User-centered information integration services.
4. Mobile information services
⑴Mobile search service: ①Search service of mobile search engine; ②Search service within mobile application.
⑵Mobile positioning service: Location-based service is a value-added service provided by a combination of mobile communication network and satellite positioning system. Location information such as the longitude and latitude coordinates of the mobile terminal is obtained through positioning technology to realize various location-related services.
3. Research on information users, their needs and behaviors
Information users: From the perspective of information management, information users mainly refer to individuals and groups who use various information service methods or information exchange channels to obtain the required information.
(1) User information needs
1. Information needs and information needs
⑴Information needs: Information needs are people’s sense of dissatisfaction and necessity for information in order to solve various practical problems in practical activities. Information needs to be in a multi-level structural state when practical activities and practical problems to be solved are relatively stable.
⑵Level of information needs:
① (Objective state): In the state of information needs that people exchange to solve various practical problems in practical activities, some information needs are not realized and are still in the internal state, and some information needs are realized and become A state of external reaction. This is the first level of the state structure.
② (Cognitive state): We define aware information needs as information needs, that is, information needs become information needs once they are realized. Among the states of aware information needs, some cannot be expressed in words and are in the "consciousness" state, and some are expressed by people through questions and other methods and become a problem state. This is the second level of the state structure.
③ (Expression state): The information needs that are realized but not expressed are called potential information needs, and the information needs that are realized but not expressed are called actual information needs. The "information needs" that people often say are often actual information needs, that is, Users can promptly obtain complete and reliable information needed to solve problems in a convenient format. When people express their information needs, they can address this need to many information sources. He may make a request to the information service agency to find information, or he may make a request to other information sources. We often refer to the specific requests made by users to information service agencies as information questions. This forms the third level structure of information needs.
2. Information needs theory
⑴Meaning construction theory; ⑵Knowledge abnormality theory
3. Characteristics of information needs
⑴Diversity characteristics of information needs
Users' information needs are diverse, and this diversity is determined by the diversity of social roles they assume. People's social roles can be divided into two categories: pre-assigned roles and self-generated roles based on the way they are acquired. According to the information needs of users in different roles, the user's information needs structure can be divided into three parts: personal information needs, organizational information needs and social information needs. At the same time, the diversity of users' information needs also comes from the environment in which information needs arise.
⑵Knowledge characteristics of information needs
Information needs are different from other needs. They not only require the stimulation of demand intensity, but also depend on the user's own conditions, especially his cognitive ability. As a condition for the correct transformation of information needs from objective state to subjective state, users must have at least three aspects of knowledge, namely industry knowledge, professional knowledge and information resource knowledge.
⑶Fuzziness characteristics of information needs
User information needs are a derived need, arising from the user's specific lack of specific knowledge, and information needs are often related to certain problems or decisions. For a wider range of knowledge fields, it is even impossible to determine its subject, scope or affiliation. . Therefore, users' information needs are often in a vague state. This ambiguity in users' personal knowledge has become a common feature of users' information needs.
(2) User information behavior and its characteristics and rules
User's information behavior: refers to the user's needs expression, information acquisition, information utilization and other behaviors when seeking the information he needs. Users' information needs vary widely, the influencing factors are diverse, and users' information behaviors are also complex and changeable. However, if we examine the characteristics of users' information needs, psychological motivations, and information behaviors, we can find some common rules.
⑴The importance of the problem that the user needs to solve, or the value of the information required is the fundamental reason that determines whether the user's information needs are transformed into information behavior.
If the problem that the user wants to solve is crucial and a certain piece of information can help solve the problem, and the information source is unique, then the information service that the user uses from a certain information center is uniquely determined.
⑵The availability of information and information services and the ease of use of information resources and information systems are the most important factors that determine whether intelligence users utilize certain information services.
① Availability is determined by the material carrier of the information source, the geographical location of the information source and information institution, network conditions and other factors. It is an attribute of whether information and information services are easy to obtain and use. The user's choice of information sources or information services is almost always based on availability. The most accessible information source is selected first, and the requirements for quality and reliability come second.
②The ease of use of information systems has the famous "Moores' Law", which states: "For an information retrieval system, if it is more troublesome and troublesome for the user to obtain information than not to obtain information, this The system will not be exploited." This law is actually a specific application of the famous Zipf's law. Information users always hope that the retrieval system is as easy to use as possible. The easier and easier to use the retrieval system, the more frequently users will use it. On the contrary, few users will use it. This is the embodiment of the "labor-saving law".
⑶In the process of seeking information, users first search from personal databases, and then turn to informal channels to obtain help from peers.
①Only when these methods cannot achieve the purpose, information systems or information centers will be considered. There are two reasons for this behavioral characteristic: due to ease of use; due to availability.
②These behaviors of information users are closely related to the accessibility selection theory. This theory holds that the user's behavior of using information is divided into three stages: selective confirmation, selective absorption and selective memory, and the behavior at each stage is restricted by the accessibility selection law. The concept of accessibility here refers to the relationship between users and information resources, and its connotation includes three aspects: physical accessibility, intellectual accessibility and psychological accessibility.
⑷Any information user looks for the information he needs through both formal and informal channels.
For many users, informal channels are considered more important than formal channels. Although informal channels are not as rigorous and reliable as formal channels, they still occupy an important position in the information exchange system due to reasons such as availability, ease of use, and user habits.
⑸The general trend of users’ time requirements for providing information services and information materials is to update faster.
① This is because the pace of life in modern society is accelerating, various social systems are running at overspeed, and people's various decisions are urgent, so information services are required to be faster and more timely.
②On the other hand, because of the rapid development of social economy, science and technology, more and more information, and accelerated aging, this requires that the information provided by information agencies should be up-to-date. These two factors make information users need the fastest service and the latest information.
⑹Two phenomena in which users consult information service workers in the information center.
① First, when users consult information service personnel, they have a strong tendency to express what they think the information center can provide rather than what they really want. There are two reasons for this situation: first, the functions of the information system cannot satisfy users, and second, new users use the information services of the information system for the first time.
②Second, users can express their information needs more accurately by writing questions than directly consulting information personnel. This often happens with questions about computer information retrieval systems.
③ In view of the above two phenomena, when providing services, information service personnel should first make users understand the characteristics and functions of the information system, and secondly, they should carefully figure out the real information needs that have not been expressed by users. If necessary, they should try to allow users to answer questions on the question list. Fill out the information requirements.
(3) Contents of information user research
1. Research on the composition and classification of information users
Study the basis or standard for classifying information users and the types of user classification, and on this basis, study the quantitative composition and knowledge composition of various types of users.
2. Survey and analysis of user information needs
Investigate and analyze what information users need in terms of content, form, scope, and time limit, when they need it, and what methods and channels they are used to using to obtain the information they need.
3. Research on the psychological laws of user information
Use the theories and results of psychology to study the psychological characteristics of users' information, explore the psychological state and process of users' information needs turning into information seeking behavior, analyze the psychological rules of users in the process of searching for information and conducting information consultation, and help information users eliminate psychological barriers. Reduce the constraints of information systems on users and provide a basis for improving information service methods and improving service quality.
4. Research on the utility of user information and the mechanism of users absorbing information
The process of users selecting, evaluating and absorbing information is an important link in realizing the value of information. This aspect requires research on: how users select and evaluate information, the process by which users absorb information to make decisions or create, the principles and evaluation methods of user information utility, etc.
5. Research on factors affecting users’ information needs and information activities
The information needs of information users are a specific social need, which is affected by many factors. It is necessary to comprehensively study the impact of social politics, economy, culture and science and technology on users' information needs. It is also necessary to study the impact of users' own knowledge literacy, personal interests, information capabilities and professional characteristics on information needs. Users' information activities include complex psychological activities and behavioral activities of collecting, querying, absorbing and using information. These activities are affected and affected by a large number of factors, including users' personality factors, environmental factors and information activity factors.
6. User information assurance research
User information assurance refers to providing users with appropriate forms of information through all possible channels and methods for their selection and use. It is the systematization of "information services" and is the key to meeting users' information needs. The purpose of "information assurance" is to provide users with the information they need in a timely and appropriate manner to serve the work they are engaged in. Research on user information assurance includes the principles of information assurance, users’ choices of information and information service methods, information assurance methods and systems, the organization and management of user information assurance, user training, etc.
(4) Research methods on information users
Questionnaire method, interview method, observation method, experimental method, think aloud method, eye tracking method, video capture method, network log analysis
4. Information needs and behaviors of network users
(1) Information needs of network users
The information needs of network information users refer to users’ needs for specific information, retrieval needs of information retrieval tools, and information service needs. The information needs of network users have shown some new characteristics:
⑴The demand for information has increased significantly
Human beings' ability to process and transmit information is getting stronger and stronger, further stimulating the intensity of the demand for information. With the development of society and the progress of civilization in human society, after people's daily life is satisfied, in order to improve the quality of life, the demand for information on leisure, shopping, and tourism will become greater and greater. Social sustainability requires a lot of information to support it. In addition, modern information technology can improve the social environment in which users live, indirectly leading to an increase in the number of users, and the total amount of information required will inevitably increase.
⑵The requirements for information content are getting higher and higher
Users require accurate information, fast transmission speed, and strong timeliness, which are determined by the timeliness characteristics of the information itself. In the face of a rapidly developing society that is increasingly informatized, it is particularly important to obtain and utilize information quickly, accurately, and efficiently. People's requirements for the quality of information will inevitably become higher and higher.
⑶Socialization of user information needs
In the network environment, the traditional model of information service in the information service department is increasingly difficult to meet the open information needs of users. Users' information needs have changed from the original stability and centralization to an open and social one. The Internet, metropolitan area networks, and local area networks have created an open and objective environment for users' information needs, and accelerated the socialization process of users' information needs.
⑷Integration of user information needs
The development of information networks organically combines computer technology, remote communication technology and network information processing technology, fundamentally changing the development, organization and distribution of information resources, allowing users to conveniently obtain centralized information in the network environment according to the objective needs of the subject. The required information is to integrate various information acquisition methods in the network, so that information exchange, query, data acquisition, full-text reading and information release are integrated into multi-functional, multi-channel and multi-mode information needs and service utilization behaviors.
⑸Matthew effect and Robin Hood effect of information needs
The Matthew effect of user information needs refers to the correlation between user information needs and the amount of accumulated information. Generally speaking, users with large information needs, as time goes by, the more information they accumulate, their information needs will become higher and higher than the average level. As for users with small information needs, the total amount of information they have accumulated has stagnated as time goes by, and their information needs have become increasingly lower than the average level. On the other hand, the level of user needs is always relatively balanced, and the total information needs of most users tend to be average. This is the Robin Hood effect of user information needs.
(2) Information behavior of Internet users
Network user information behavior: It is the response of network users to external conditions under the control of cognitive thinking. It is based on information needs and ideological motivations, using the information content and information services provided by the network, and engaging in understanding and understanding of information needs. Activities such as expression, information query, information selection, information storage, information absorption and utilization, information processing, and information interaction.
1. Internet user information query behavior
Network information query behavior refers to a series of physical activities and psychological activities that users take in the process of interacting with the network to obtain the required information. It is mainly completed through two types of behaviors: information retrieval behavior and information browsing behavior.
2. Network user information selection and storage behavior
The information selection behavior of network users means that users use certain analysis and judgment abilities to judge and select the relevance of the presented network information based on their own needs and based on their individual existing knowledge accumulation, thereby filtering out the information that is most relevant to them. Useful information or links to information.
Information storage refers to the behavior of users saving selected information that is helpful to them.
3. Internet user information absorption and utilization behavior
Facing the collected information, users need to react, identify, and organize it in their minds. With the help of their personal digestion and learning abilities, they can understand, absorb and translate it into a member of their own knowledge sequence, thereby enriching their personal knowledge reserves.
Information utilization behavior is based on information absorption, that is, users use the information they have and act on their production, evaluation, decision-making, solving, innovation, life, leisure, interpersonal communication, replenishment of think tanks and improvement of thinking level etc. It is a key link in making information truly effective.
4. Internet user information interaction behavior
Network user information interaction behavior is the online information exchange behavior of users using the network as a communication platform. In this kind of information interaction, users can not only obtain information from network resources and network interaction peer entities, but also upload information to the network or transfer information to each other, that is, they become information publishers. It is a two-way information exchange behavior. According to the interactive participants, we can divide network information interaction behavior into two types: human-computer interaction and human-human interaction.
5. Collaborative information behavior of online community users
Compared with a single individual user, the information behavior of online community users shows stronger interactivity and collaborative characteristics. In addition to searching and browsing the information they need on the Internet, they also publish and share knowledge that they are interested in. At the same time, they also expect to interact and collaborate with other users with the same interests, reflecting a strong group character and influencing those around them. The online awareness and behavior of other users.
Chapter 8 Information Systems
1. Basic concepts of information systems
(1) Concept and composition of information systems
Information system: It is a human-machine integrated system composed of computer hardware, network and communication equipment, computer software, information resources, information users and rules and regulations for the purpose of processing information flow. The purpose is to collect, process and store in a timely and correct manner. , transmit and provide information to achieve management, regulation and control of various activities in the organization.
The components of the information system: ① computer hardware system, ② computer software system, ③ data and its storage media, ④ communication system, ⑤ non-computer system information collection and processing equipment, ⑥ rules and regulations, ⑦ staff
(2) Functions of information systems
1.Input function
The input function of the information system is determined by the purpose of the system and the capabilities of the system and the permission of the information environment. Generally, the input content of the information system includes the collection of information resources, the input of control instructions, the input of information retrieval conditions, etc.
2. Storage function
Storage function refers to the system's ability to store various information materials and data. After the original data and information are input into the information system, they first need to be stored for multiple use, and the data and information can be shared in multiple processing links and processes.
3. Processing functions
After a large amount of information and data are stored, they must be processed in a timely manner. Processing is the production process within an information system. The size of the information system processing function depends on the professional technical strength within the system and the modernization level of the information processing technology equipment.
4.Output function
The various functions of the information system are designed to ensure the ultimate output function. After the information is processed, the output content includes data information processed by the information system, feedback information on the status of the information system during operation, and prompt information when manual intervention is required.
5.Control function
In order to keep the input, storage, processing, output and other links of the information system running evenly and continuously, the system must have the ability to manage and control. The control function of the information system is reflected in two aspects: one is to control and manage the various information processing equipment that constitute the system; the other is to control the entire information processing, processing, input, output and other links through various programs . Through the control function, various functions of the information system are optimized, thereby optimizing the operation of the entire information system.
(3) Development and types of information systems
1. Development stage of information system
⑴The initial information system was mainly an information processing system, such as a document-based information retrieval system such as a library and information center. It is manual, that is, a system in the traditional sense.
⑵ Since the advent of electronic computers in the 1940s, information systems have entered the stage of modern information systems - computer information systems, and have developed rapidly with the leap in information technology and the growth of information needs in modern society.
⑶In the 1950s, computer breakthroughs in data processing technology expanded the space for computer applications. Data statistics systems, data update systems, data query systems, data analysis systems, system status reporting systems, etc. appeared one after another. At the same time, electronic data processing systems emerged, which effectively promoted the development of information systems.
⑷ From the late 1960s to the 1970s, various forms of information systems appeared, such as management information systems and decision support systems.
⑸ In the 1980s, the senior executive support system to support the highest decision-making level of the enterprise and the manager information system to support the middle and senior managers appeared. At the same time, information systems in different fields have also seen new developments. Expert systems have emerged in the field of artificial intelligence. In processing and manufacturing enterprises, the application of computer integrated manufacturing systems has enabled the automation of enterprise production and operation links.
⑹ In the 1990s, information technology developed further, and group decision support systems, intelligent decision support systems, product data management systems, etc. appeared.
⑺In the 21st century, a large number of information systems of different styles and applications have appeared in various industries. Information systems have become one of the most important indicators for evaluating the degree of organizational informatization. At the same time, with the emergence and development of data mining technology, computer processing of information has risen to the processing of knowledge, and knowledge management systems and customer relationship management based on data mining have emerged.
Stage theory of information system development process—Nolan model
⑴ Starting stage: This stage starts when the enterprise introduces the first computer, which is usually used in finance, statistics, material and other departments. As the enterprise deepens its understanding of computer applications, people realize the value of computer applications and begin to learn. , use and maintain computers.
⑵Expansion stage: As computers see results in some departments, they spread from the initial application departments to other departments, and a large amount of manual data processing turns to computer processing. People's enthusiasm for computers increases and demand grows. But for the entire organization, without an overall information system, this stage requires a lot of investment.
⑶Control stage: Due to the growth of people's demand for computer information processing, financial expenditures have increased significantly. Business leaders have to control it, focus on cost/benefit analysis of application development, and analyze the differences between various developed application projects. Inconsistency and data redundancy are planned in a unified manner. The control stage is the key to realizing the shift from computer-based to data management-based, and generally develops slowly.
⑷Integration stage: After the comprehensive analysis in the third stage, database technology is introduced, and under the condition of developing data network, the data processing system enters a stage of rapid development, gradually improving the original system, and developing a new system that can serve the middle and upper levels. Management provides support and provides enterprises with a management system for various information resources. Investments and expenses will again increase rapidly during this phase.
⑸Data management stage: After the system is integrated and synthesized, it is possible to enter into effective data management and realize data sharing. At this time, the data has become an important resource of the enterprise.
⑹Maturity stage: The maturity of the information system shows that it is consistent with the goals of the organization, can support everything from the organization's transaction processing to high-level management and decision-making, and can adapt to any new changes in management and technology.
2. Types of information systems
From the perspective of organizational management processes and levels, information systems can be divided into: transaction processing systems, management information systems, and strategic decision support systems, which respectively correspond to the operational layer, management layer, and decision-making layer of organizational management.
⑴Transaction processing system (TPS): It is a system that processes organizational affairs and business data. It is data-oriented and records, calculates, classifies, summarizes, stores and outputs data in the organizational affairs process. This type of information system is relatively simple in structure and has a single function. It can run on a single machine or in a network environment, and is mainly oriented to single applications. Although transaction processing is simple, it is large in number and very important. It is the basis of all management and decision-making information systems of an organization.
⑵ Management Information System (MIS): It is a system that handles organizational management information. The main users are middle managers in the organization. MIS is information-oriented. Middle-level managers need to use management control information to measure performance, determine control actions, formulate regulations for operating personnel and allocate various resources, etc., to assist in realizing the management functions of various departments of the organization, and provide a solution for semi-structured problems. Provide support for decision-making.
⑶ Decision support system (DSS): It is a system that handles unstructured and semi-structured problems and supports managers' non-procedural decision-making. It uses data in transaction processing systems and information in management information systems, other organizational information and knowledge, and organizational external information and knowledge to provide decision-making support for decision makers. Top managers choose the best option among the alternatives based on their own knowledge and experience and the support of the decision support system. DSS consists of three parts: data, model and user interface. Compared with MIS, DSS is knowledge-oriented, has a higher level of intelligence, and can handle unstructured problems.
①Intelligent Decision Support System (IDSS): It is an information system formed by introducing artificial intelligence technology into DSS.
②Group Decision Support System (GDSS): It is an extension of DSS. It combines computer processing, data communication, email and other technologies to provide GDSS software called groupware to support the decision-making process and enable decision-making in different locations. A computer-aided decision-making system in which people can participate in joint decision-making through their own computer terminals.
③Expert system (ES): It is a computer software system designed to simulate the question-answer activities of human experts in specialized fields.
(4) Architecture model of information system
1. Host - terminal mode
This model is a centralized computing environment that uses large hosts as servers for centralized management and accesses data through terminals. In the centralized mode, the services provided by the information system completely rely on the host server. Each request from the client needs to be processed on the server and then the results are returned to the client, so the requirements for the server are extremely high. The advantages are: the system is stable and easy to maintain; the disadvantages are: excessive requirements on host resources, insufficient support for large-scale applications, and difficulty in realizing cross-hardware and cross-platform information sharing. At the same time, terminal application tasks are simple, resulting in a waste of resources.
2. Client/server (C/S) mode
In a distributed computing environment, this model combines the advantages of large host organizations and network workgroups, centers on a high-performance database server, centrally manages core resources, and provides clients with the data support required for applications. Advantages: It not only fully shares the resources of the server, but also fully utilizes the local resources of the client. The load of the entire system is reasonably distributed between the client and the server, making large-scale, networked applications possible; Disadvantages: Development The cost is high, system configuration, maintenance management and upgrade are troublesome and expensive, the manageability is poor, and the work efficiency is low.
3. Browser/server (B/S) mode
The B/S model is developed on the basis of the C/S model. It uses the browser as the client application and adopts a three-tier structure of browser-application server/Web server-database server. Data information is in the form of network pages. transmission. It has many advantages such as being simpler, lower cost, and able to provide more information. It is very convenient to use and has become the development trend of information system architecture.
4.Multi-layer architecture model
Since the concept of the three-tier structure was proposed and applied, it has been continuously extended and developed, and a multi-tier structure has gradually emerged, namely the front-end interface (such as a browser), Web server, middleware (application server) and database server. The emergence of this logical structure not only means the basic formation of the e-commerce system framework, but more importantly, it makes the system more reasonable in terms of structure and performance.
2. Development of information systems
(1) Technical foundation and operating environment
The foundation of information systems is information technology. It is generally believed that information technology is the general term for computer hardware technology, computer software technology and communication and network technology.
1. Computer hardware technology: Computer hardware is mainly composed of five functional components: arithmetic unit, controller, memory, input device and output device.
2. Computer software technology: Software systems help users use computer hardware to complete data input, processing, output, storage and other activities. Software is divided into system software and application software.
3. Computer network technology: Computer network is a system that uses communication media to connect computers, computer systems and other network equipment distributed in different geographical locations, and uses fully functional network protocols to achieve information exchange and network resource sharing. Computer network includes: network media; protocols; nodes; links.
4. Database technology: A database is a collection of related data stored together in a certain organizational manner. The database system includes four components: data collection, hardware, software and users.
(2) Stages of information system development
System Development Life Cycle (SDLC): Refers to a structured, step-by-step approach to developing information systems, describing the various stages of an information system development project, from initial feasibility studies to complete program maintenance.
1. System planning stage
① Define the system to be developed; ② Determine the project scope; ③ Develop a project plan
2. System analysis stage
① Collect business requirements; ② Sort requirements
3. System design stage
①Design technical framework; ②Design system model
4. System development stage
①Establish technical framework; ②Establish database and programming
5. System testing phase
① Write test conditions; ② Implement system testing
6. System implementation stage
①Write detailed user instructions; ②Provide training for system users
7. System maintenance stage
① Establish a service office to serve system users; ② Provide an environment to support system changes
(3) Information system development methods
Information system development methods organize, construct and manage information systems based on information system development rules, including people's thinking methods, guidance strategies, technical approaches, activity rules and steps. Abstract thinking and image thinking methods are the basic ideas of system development methods; generalization and division, induction and deduction are the basic logical methods used in system development. From the perspective of engineering technology, the more influential system development methods include: structural life cycle method, prototyping method, object-oriented method, CASE method, etc.
1. Structured life cycle method
The structured life cycle method is one of the most commonly used information system development methods, also known as the structured development method or waterfall development method. The basic idea is: in the system analysis and design stage, it is decomposed layer by layer from top to bottom. In the system implementation stage, it is bottom-up, forming a complete information system development process through module links, and is developed in strict accordance with system analysis, system design, system implementation and other processes.
2. Prototype method
The prototype method means that system developers use powerful auxiliary system development tools to quickly develop a prototype based on a preliminary understanding of the user and demonstrate it to the user. The developer will modify the prototype based on the user's opinions and evaluations. This is repeated and gradually improved until the user is completely satisfied. Steps: ① Identify basic needs; ② Establish initial prototype; ③ System user evaluation; ④ Revise and improve prototype
3. Object-oriented method
Object-oriented technology encapsulates the properties (data) and processing (methods) of objects together. Through the inheritance of subclasses from parent classes, the software is easy to maintain and expand, and the reusability of the software is improved. Stages: ① System investigation and requirements analysis; ② Object-oriented analysis; ③ Object-oriented design; ④ Object-oriented programming
4.CASE method
Computer-aided software engineering (CASE) is a software development automation technology that supports the entire software development life cycle. It is a computer technology that supports the development of information systems from a developer's perspective. The functions are: ① Support different development methods; ② Support various stages of the software development life cycle; ③ Have document publishing functions and text and graphics editing functions; ④ Support development information resource sharing.
5. Test-driven development approach
Test-driven development (TDD) is a new software development idea and method that can effectively allow program developers to develop higher-quality, fully tested programs. The TDD method requires that before implementing the code of a certain function, the test code for the function should be written first, and then only the functional code that can pass the test should be written, and the entire software development process can be promoted through testing. Process: ①Testing; ②Implementation; ③Refactoring
3. Management of information systems
Information system management: It is the planning, control, maintenance and evaluation of the entire process of the development period, operation period and update period of the information system.
Information system project management, information system quality management, information system operation and maintenance management, information system evaluation
4. Information system security
(1) The meaning of information system security
Information system security: refers to the sum of ensuring the security of the information system structure, the security of elements related to the information system, as well as various related security technologies, security services and security management. It includes 4 levels of meaning: ⑴ information security; ⑵ computer security; ⑶ network security; ⑷ communication security
(2) Information system security architecture
⑴Protocol-based security architecture; ⑵Entity-based security architecture; ⑶Object-based security architecture; ⑷Agent-based information system security architecture
(3) Information technology security protection strategy
1. Non-technical security strategies
Non-technical security strategies refer to the use of legal guarantees, administrative management and other measures to prevent the occurrence of information system security incidents;
⑴ Legal guarantee: The current laws and regulations for information systems can be roughly divided into two categories: social norms and technical norms. These laws and standards are the basis and main guarantee for ensuring the security of information systems. ①Social norms are the norms of behavior that regulate people in information activities. Including confidentiality laws, data protection laws, computer security laws, computer crime laws, etc. ②Technical specifications refer to various technical standards and procedures, such as computer security standards, network security standards, operating system security standards, data and information security standards, etc.
⑵ Administrative management: It is a general administrative measure for safety management. It is a set of rules, regulations and functional structures established and formulated to maintain system security based on system practical activities. Mainly include: ① organization and personnel system; ② operation, maintenance and management system; ③ computer processing control and management system; ④ computer room security system, etc.
⑶Personnel training: For information system staff, such as terminal operators, system administrators, system designers, etc., because they are familiar with the functions and structure of the system, they pose a great threat to system security and must carry out comprehensive security training. Confidentiality education, professional ethics and legal education.
2. Technical security strategy
Technical security strategy refers to preventing the occurrence of information system security incidents by adopting technical means directly related to the system.
(4) Information system security technology
⑴Data encryption technology
Use technical means to turn important data into garbled (encrypted) transmission to prevent the data from being eavesdropped by others during the transmission process. Current encryption technologies include symmetric encryption and asymmetric encryption.
⑵Identity authentication technology
Identity authentication technology mainly uses one-time password technology and digital signatures to verify whether the user's identity is legitimate to reduce the risk of using public networks for data transmission.
⑶Firewall technology
Firewall is an isolation technology that forms a security barrier between internal and external networks by establishing effective access control for specific network segments and services.
⑷Switched virtual LAN technology
It is a technology that uses switches as the network center and uses virtual LAN logical topology to enhance network security.
⑸Data security storage technology
It refers to the technology of centralized backup management of data in a network environment to achieve backup automation and disaster recovery. There are three storage backup methods: full backup, incremental backup and differential backup. There are two methods of data recovery: hardware recovery and software recovery.
⑹Vulnerability detection technology
Vulnerability detection technology is a technology that checks network information systems to find system security vulnerabilities. Commonly used methods are passive and active.
⑺Intrusion detection technology
Intrusion detection technology is a technology that detects intrusion behavior by collecting and analyzing information on the protected system. Intrusion detection work is usually undertaken by a specific intrusion detection system (IDS).
Chapter 9 Information Organization and Its Management
1. Types and functions of information agencies
(1) The meaning and types of information institutions
1.The meaning of information organization
Information organization: It is an organizational form that implements information collection, processing, analysis, storage, transmission and other related information management activities. It is commonly found in various industrial forms and has an increasing status and role in the national economy and social development.
2. Types of information institutions
⑴In the study of the industrial structure of the national information sector, Porat divided the national economic sector into primary and secondary sectors based on whether information products or services enter the market for transactions.
① First-level information department: refers to those departments that provide price information products or services to the market and participate in market exchanges.
②Secondary information department: refers to those departments that provide information production and services to meet the internal consumption of the government or information enterprises.
⑵Based on Porat's division of information departments, we divide information organizations into:
①Independent information agencies: Corresponding to the first information department, its information products and services are oriented to external objects such as the market or the public, and are subdivided into two categories: public welfare information agencies and for-profit information agencies.
②Information organizations within the organization: Corresponding to the second information department, this type of organization is not an independent organization, but is attached to a specific organization. It exists as a department within the organization, and the information products and services it produces are only provided to the organization. , rather than for the market or the public.
(2) Functions of information agencies
1. Promote scientific and technological innovation
The construction of scientific and technological information resources is the foundation of scientific and technological information institutions and the core of information services. Scientific and technological information institutions can accelerate the construction of scientific and technological literature information resources and service platforms through innovative management systems and operating mechanisms, and realize the transformation from information services based on "indicating information sources" to "one-stop" networked information that is user-oriented and solves problems. Changes in service models.
2. Provide decision-making support to government departments
An information agency serving government departments, its functions are to promote and improve the information environment of government agencies; to meet the daily information needs of government agency personnel; to provide managers with an alternative channel for information collection and analysis, and to provide support for the work of government agencies support.
3. Service companies
In order to cope with the increasingly intensified market competition, companies have begun to pay attention to and carry out intelligence information work to support corporate strategic decision-making, technological innovation, market operations and other activities.
4. Support scientific research and development
The information work carried out by information institutions can provide three levels of support for scientific research activities: first, scientific and technological document protection, second, technical monitoring and analysis, and third, evaluation and evaluation of scientific research activities.
5. Other social functions
The work of information institutions plays an important role in the dissemination of knowledge in human society. Information institutions represented by libraries, as an important part of national cultural undertakings and social document information centers, play an important role in information literacy education and improving public information literacy.
2. Operation and management of information institutions
(1) The operating mechanism and operating efficiency of information institutions
Mechanism is the way and law of the operation and regulation of things themselves, which directly determines the operation efficiency of things themselves.
(2) Management strategies of information organizations
1. Strategic planning
Strategic planning is key to determining the direction of an information organization. Conceiving a successful strategic plan should follow the following coordinated steps: ⑴ determination of tasks; ⑵ evaluation of competitor behavior; ⑶ evaluation of external environment; ⑷ forecast of expenditures and income; ⑸ formulation of specific goals; ⑹ establishment of work specifications and feedback mechanism
2. Perform various management functions
The management of information organizations is to effectively plan, organize, lead and control the various resources owned by information organizations in a specific environment in order to achieve the established organizational goals-to serve information users. Specifically include: planning, organization, leadership, and control.
3. Organizational structure design of information organization oriented by process reengineering
At present, information organizations have developed into digital methods. Faced with the increasingly updated needs of information users, increasingly series of competition in the market, and increasingly frequent changes in the environment, information organizations need to completely change the traditional way of organizing work and replace the previous work processes with integrated work processes. Each door is separated and the fragments are bonded together in the crushing workflow.
4. Strengthen incentives
Information organizations can take the following measures to motivate employees: implement performance-linking and use promotion incentives.
5. From humanistic management to knowledge-based management
The so-called humanistic management is people-oriented, that is, trusting people, respecting people, caring for people, understanding people, motivating people, and cultivating people. To be knowledge-oriented is to consider how to motivate and utilize the knowledge of personnel so that personnel can make sustainable contributions, make personnel willing to use their knowledge for their work, and improve the operational efficiency of the entire information organization.
3. Public welfare information institutions and their management
(1) The nature and positioning of public welfare information institutions
Public welfare information organizations are defined as a type of non-profit organizations. Public welfare information services refer to services that mainly use free or low-cost services to allow users to obtain the information resources they need fairly and conveniently, but are not for profit. Their service benefits are mainly social benefits and indirect economic benefits.
Public welfare information institutions are mainly composed of the following three categories: ⑴ information centers, information institutes, etc. affiliated to various government departments; ⑵ libraries; ⑶ information research institutes and information service centers facing the society. It is gradually positioned as a non-profit-research information service organization.
(2) Business philosophy of public welfare information institutions
On the basis of adhering to its non-profit-research orientation, public welfare information institutions can have business ideas and profit-making behaviors. While mainly providing free information services, they can charge for high-end and value-added information products and services.
Having determined the rationality of implementing paid information services, public welfare information institutions also need to adopt a "market-oriented" strategy to keep abreast of market needs to meet the changing needs of customers. And convert funds obtained through various channels into resources to produce greater social benefits.
(3) Collaborative management of public welfare information institutions
Public information service: It is an open information service, that is, a social service that targets the public, including users from various industries, provides public information release, exchange and utilization services, and aims to serve the society.
Public information resources mainly have the following characteristics: ① universality ② basicity ③ openness ④ sharing ⑤ extensiveness
4. For-profit information institutions and their management
(1) The meaning and characteristics of profit-making information institutions
Profitable information institutions: also known as information service enterprises, are the product of information marketization and information commercialization. In order to meet the consumption needs of the information market, they engage in economic activities such as information production and circulation, conduct independent operations, and achieve independent economic accounting. A specialized economic unit with legal personality.
Features: ⑴Market-oriented service pattern; ⑵Flexibility of operation and management; ⑶Specialization of service items; ⑷Advanced technical means.
(2) Marketing management of for-profit information plans
1. Marketing strategies of for-profit information agencies
⑴ Information market segmentation; ⑵ Target market selection; ⑶ Target market positioning
2. Marketing strategies of for-profit information agencies
⑴Product and service strategy
⑵ Price strategy: ① High price skimming strategy; ② Low price penetration strategy; ③ Differential price strategy; ④ Free pricing strategy.
⑶Channel strategy
⑷Promotion strategy
(3) Development strategies of for-profit information institutions
1. Industrial cluster development model
Information industry cluster refers to the concentration of a large number of information industries and related supporting institutions within a certain range. It has the characteristics of regional agglomeration of institutions, strong innovation, high specialization of each enterprise, and collaboration and division of labor between enterprises.
The characteristics of the cluster development model of for-profit information institutions: ① Knowledge and technology intensive; ② Industrial chain; ③ Collaborative network; cluster ecological
2. Collaborative innovation and development methods
The key to collaborative innovation is to form a large-span integrated innovation organizational model with enterprises, universities, and research institutions as core elements, and innovation with collaborative interaction of multiple entities with financial institutions, intermediary organizations, innovation platforms, non-profit organizations, etc. as auxiliary elements. model. It is holistic and dynamic.
3. Ecosystem operating mechanism
The information industry cluster ecosystem is an industrial cluster ecosystem composed of natural subsystems, social subsystems and economic subsystems. In the ecosystem, for-profit information institutions and other ecological entities form an interlocking and interdependent ecological chain.
Chapter 10 Information Policy and Information Law
1. Information policy
(1) Basic issues of information policy
1. The meaning and content of information policy
⑴The meaning of information policy
National information policy: refers to the relevant information products and policies adopted by the state or government within a country to solve the problems that arise in the development of information management and information economy and involve public interests, rights and security, and to ensure the coordinated development of information activities. A series of measures, plans, principles or guidelines for resource production, circulation, utilization and distribution as well as promoting and promoting the development of related information technology.
⑵Scope of information policy
① Determine the information needs of society; ② Design appropriate methods to satisfy information needs; ③ Promote the effective use of information resources; ④ Human development of information professionals; ⑤ Determine necessary infrastructure; ⑥ Ensure continued financial support.
⑶Classification of information policies
① Judging from the history of information policy research, information policy can be roughly divided into four categories: normative frameworks and guidelines; specific research reports and consulting reports; general theoretical method research; and information policies related to the development of information technology.
② Divide from the subject and scope of application of information policy formulation: international information policy, regional information policy, national information policy, and local information policy.
③Divided from the fields of information policy specifications: economic information policy, scientific and technological information policy, educational information policy, cultural information policy, etc.
⑷The role of information policy
① Determine the development policy of social information activities, indicate the general direction of the information industry, and guide the development of the information industry.
② Mobilize or restrain social information power and promote the rational development and effective utilization of information resources.
③Coordinate the relationship between various departments of the information environment system and the external environment to ensure the healthy development of the social information environment.
④ Intervene in the construction of information resources and the utilization process of information resources.
⑤Standardize information work.
(2) Construction of national information policy system
1. Policy subject
The government is the main body of national information policy. In addition to the government, the main body of information policy shows diversified attributes. From the perspective of the universality of information systems and networks, information policy subjects may involve all aspects of society, from the state to local governments, from government departments in charge of various industries to social groups.
2. Policy objectives
Information policy goals refer to the interest coordination and distribution relationships achieved through the formulation of information policies. The current goals of my country's national information policy are: to implement new development concepts in depth, accurately grasp the general trend of international and domestic information development, fully implement major national strategic deployments, and coordinate the information work of various regions and departments.
3. Policy issues
Information policy issues refer to objective social issues that can be included in the policy-making agenda, that is, social contradictions and problems that information policy should resolve. Information policy issues in modern society are complex and interrelated, requiring comprehensive research on information policy as a whole in order to clarify the various complex relationships involved in information policy. Information policy issues are divided into economic issues and legal issues.
4. Policy content
The content of national information policy is simplified into two levels: economic and legal. ⑴Economic level: macro-economic issues of industrial development; micro-market economic issues. ⑵Legal level: intellectual property protection; free flow of information and information security.
5. Policy form
Policy form refers to the external manifestation of policy content, which is an effective tool and legal basis for publicizing and implementing policies. The form of information policy refers to the documents, guidelines, or regulations on the content of information policy formulated and issued by the state. In the context of building a socialist country under the rule of law, national information policies should be transformed into specific and operable laws and regulations for effective implementation and implementation.
(3) Overview of domestic and foreign information policies
2. Information Law
(1) Basic issues in information law
1. The meaning of information law
Information law: refers to the general term for legal norms that regulate social relations arising from information activities. The social relations here mainly involve interests, rights and security issues. These legal regulations involve information systems, organizations that process information, and individuals responsible for information. The main content of information law includes intellectual property law, information security law, information disclosure law, press, publication and communication law, telecommunications law, e-commerce law, laws related to computer crimes, etc.
2. The role of information law
⑴The normative role of information law (The objects affected by the specification vary)
①Guidance function: refers to the role of guiding and guiding the behavior of information subjects. Its object is the information subject's own behavior.
②Evaluation role: refers to the role of information law as the evaluation standard for the behavior of others by information subjects. Its object of action is the behavior of others.
③ Predictive function: It means that the information subject can predict in advance how people will behave with each other and the consequences of their behavior according to the provisions of information laws, so as to make reasonable arrangements for their own behavior. The object is the mutual behavior between people.
④ Enforcement: It is mainly reflected in the deterrence, punishment and prevention of information violations and criminal behaviors.
⑵The social role of information law
① Protect and promote the progress of science and technology: One of the purposes of establishing and improving information laws is to protect and promote the progress of information technology by confirming technical specifications, safeguarding the interests of technology inventors, and making it more beneficial to mankind.
② Protecting national interests and social public interests: This protective role in protecting the information rights of each information subject is fundamentally consistent with protecting basic human rights. It is an inevitable requirement for fully protecting information rights.
③Resolve various contradictions caused by informatization in a timely and appropriate manner: In the information society, people will face the efficiency and fairness that will inevitably arise from the information society. Information is relatively insufficient or excessive, and information quality is poor or false. The contradiction between profit-making and social welfare of information. The resolution of these contradictions cannot be separated from the social normative role of information law.
④ Promote the development of the information industry and the process of social informatization: Information law adapts to the development characteristics and technical characteristics of the information society and makes up for the shortcomings of traditional legal norms. It can not only provide guarantee for the improvement of economic efficiency and benefits, but also protect new social relations and new productivity. Objectively, it promotes the development and growth of the information industry. At the same time, we should effectively standardize and adjust new information society contradictions and remove obstacles to the development of information technology.
⑤ Promote the sound operation and coordinated development of economy and society: This is the deeper impact indirectly produced by information law based on the above-mentioned role, and this impact reflects the ultimate goal of information law.
⑶Objects to be adjusted by information laws
The adjustment object of information law is information relations, which refers to the social relations generated when realizing the information process. It includes two aspects: on the one hand, it is the social relations generated around the production, dissemination, collection, processing, storage, application, exchange, consumption and other information activities of information. On the other hand, there are a series of new social relations and social problems arising from the development of information technology.
Information legal relationship: refers to the rights and obligations relationships formed by information laws in the process of adjusting people's information activities. The constituent elements of information legal relationships mainly include three aspects: subject, object and content.
⑷Conflicts brought by informatization to society
①Contradiction between information freedom and information security
The right to freedom of information is an important right of citizens. Citizens can freely collect, process, handle, disseminate, store and utilize information in accordance with the law. The development of computer networks has provided broad space and convenient conditions for people to enjoy broader freedom of speech. However, freedom of information does not mean absolute freedom and abuse of information. In the network structure composed of modern information technology, the security of information systems is being seriously threatened. Its vulnerability is not only a technical issue, but also a legal rule issue.
②The contradiction between insufficient information and excessive information
The flow of information in human society is uneven. On the one hand, it cannot fully meet people's needs for information acquisition; on the other hand, there is also the problem of excess information. The phenomenon of information overload refers to the fact that in today's era of information explosion, it is difficult to tell the truth from the truth in front of the vast amount of information. When real accurate information and distorted fuzzy information are mixed together, the relative amount of information is reduced, so the excess of information brings about a new lack of information.
③The contradiction between the social welfare of information and individual profitability
The social welfare of information requires that it be widely, freely and openly provided to the public for use, which conflicts with the economic interests of information owners. The contradiction between efficiency and fairness manifests itself as the contradiction between personal interests and social interests. The contradiction between social sharing of information and individual profit can only be coordinated and resolved through legal means, taking into account both efficiency and fairness.
④Contradiction between information confidentiality and information disclosure
In the process of socialization, on the one hand, confidential information should be effectively protected, and on the other hand, what needs to be disclosed should be fully disclosed. If there is a lack of coordination between information confidentiality and disclosure, information that should be disclosed will not be disclosed, and information that should be kept confidential will not be effectively protected. In particular, government information and social welfare information need legal norms to promote their full disclosure without harming national interests.
⑤Contradictions in the circulation process of information commodities
There are many contradictions in the circulation process of information commodities. There are not only issues regarding the information rights of owners, producers, and disseminators of information products, but also property rights disputes such as ownership and licensing rights when information products are traded and transferred between information activity subjects. At the same time, each information product subject The conflict between personal interests and social interests is also quite prominent. At present, the information market still lacks complete and effective management regulations and operating mechanisms, and there is an urgent need to formulate corresponding laws to solve this problem.
(2) The relationship between information law and information policy
1. The difference between information law and information policy
⑴The agencies and procedures established by the two are different.
① Information laws are created in accordance with legal procedures by the country’s specialized legislative body, the National People’s Congress and the Standing Committee, or agencies with law-making powers such as the State Council and its ministries and commissions. Its legislative authority and creation procedures are strict and complex. Regulation.
②The formulation of information policies comes from many departments, and the procedures are not very strict. my country's information policies are formulated by departments and areas.
⑵The expression forms of the two are different.
① Information laws usually take the form of statutory laws, mainly including general principles of civil law, some separate laws, such as the "Works Law", "Patent Law", "Trademark Law", "Anti-Unfair Competition Law" and some separate regulations.
②Information policies often appear in the form of resolutions, decisions, orders, rules, regulations, opinions, notices, meeting minutes, etc. formulated and promulgated by state agencies.
⑶The adjustment range and methods of the two are different.
① In terms of scope, the social relations regulated by information policy are much broader than those regulated by information laws; while the social relations regulated by information laws are often those social relations that have a greater impact on the country and society in information activities.
② From a method perspective, information law generally adjusts relatively stable social relationships, so it focuses on the confirmation, protection or control of existing social relationships; information policy is a means of coping, which not only deals with existing problems, but also It needs to respond to problems that are forming or will arise, so it focuses on adopting flexible and diverse measures to adapt to the needs of the continuous development of the social situation.
⑷The degree of stability of the two is different.
① Information policy, as the guiding principle for social information activities, is often a macro-level call for policy. Flexibility is allowed in policy implementation, and as the information environment changes, it must be constantly revised, supplemented and improved. Due to its high dynamic nature, , so the timeliness is shorter.
②Information law is generally a relatively specific code of conduct determined based on certain experience gained after long-term implementation of information policies. It has a long time limit, and its formulation, modification or abolition must go through strict and complex legal procedures, which is quite stability.
⑸The essential attributes and functions of the two are different.
① Information policy represents the interests and significance of political organizations and does not have the attribute of coercion; while information law represents the interests and will of the country and has the attribute of coercion.
②Functionally speaking, the basic function of information policy is "guidance", that is, using administrative means to encourage and support social information activities to achieve the goals of information policy; while the basic function of information law is "restraint", that is, using legal means to restrict and restrict social information behavior to protect the healthy development of the information environment.
2. The connection between information policy and information law
⑴Information policy has a guiding role in information legislation.
Information policy guides all social information activities. Information legislation, as an important part of social information activities, is also inseparable from the guidance of information policy. At the same time, information policy is the basis for the formulation of information laws.
⑵Information policies need to be implemented based on information laws.
Information law is the most important means to realize the information policy of the country and the ruling party. Information law is mandatory, but information policy does not have this attribute. Therefore, without the embodiment and implementation of information law, we can only rely on the power and freedom of information policy itself. , often fails to achieve the economic and political goals it aims to achieve.
(3) Information legal system
Information legal system: refers to the structure and classification of information law, that is, what legal systems are included in information law, that is, the legal systems formulated by the state to regulate the acquisition, processing, dissemination, use, transfer and other information activities as well as all information activities in the field of information technology. All legal regulations arising from various interest issues and safety issues.
1. Information Basic Law
This law mainly stipulates the purpose, principles, adjustment objects and scope of information legislation, information, and the subjects and objects of legal relationships.
2. Information legal system and legal norms
Information technology legal system, information resource management legal system, information property legal system, information industry management legal system, information market management legal system, information organization organization and management legal system, information security, confidentiality and information crime legal system, information talent legal system, Legal system for international information cooperation and exchange.
(4) Legislative model of information law
1.Special legislative model
Based on the actual situation, some countries start with certain fields and pass special legislation to build a new legal normative system to adjust the development of informatization in this field.
2. Modify the applicable mode
Many developing countries with relatively backward information technology and relatively underdeveloped informatization mainly apply this legislative model. They basically only make supplementary changes to the existing backbone laws in the traditional legal departments that are relatively complete and universally applicable. Generally there is no separate separate legislation.
3. Cautious hold mode
Since the country's informatization development as a whole is in the process of rapid development, there are many specific issues that still need to be studied, explored and tested. Many countries currently adopt a "cautious" attitude, suspend legislation, and rely mainly on government policy guidance to promote the free development of national informatization.
3. Principles for the formulation of information policies and information laws
1.Pragmatic principle
The formulation of national information policies and laws must be based on national conditions, seek truth from facts, and be consistent with the current situation and actual development level of social informatization.
2. Principle of coordination
① The country’s original policies and laws and the newly formulated policies and laws should coordinate and cooperate with each other.
② Within the national information policy and laws, both vertical and horizontal information policies and laws can achieve coordination and reach consensus.
③ Information policy laws must be coordinated with the Constitution, that is, constitutional, and at the same time, they must be able to coordinate with the policies and laws of other fields such as economy, politics, science and technology, culture, military and other fields.
④ Try to be in line with international practices, that is, coordinate with international information policy laws.
3. Principle of balance
There are four basic contradictions in the information environment in which human beings live: ① Information freedom and information confidentiality; ② Insufficient information and excessive information; ③ Information disclosure and information security; ④ Social sharing of information (information sharing) and private possession of information The contradiction between sex (information exclusivity).
The state must face up to and solve this problem when formulating information policies and laws. It must fully embody the principle of balance, pay attention to the coordination of the interests of both conflicting parties, and find a balance of interests acceptable to both parties, even if both parties eventually reach equilibrium. state.
4. Principle of economy
Starting from the principle of economy, when formulating information policies and laws, the entire formulation cost should be less than the benefits that society obtains from it.
On the other hand, national information policy laws themselves should also have economic functions, that is, they can reduce transaction costs, improve economic benefits, provide incentive mechanisms, reduce uncertainty, internalize externalities, and promote cooperation.
5. Principle of flexibility
This principle requires that national information policies and laws not only serve as a summary, generalization and refinement of currently existing information activities, but also have the function of guiding and foreshadowing, and can foresee the cutting-edge development of information technology and the ever-changing changing trends of the information society. So that the existing national information policy laws can be supplemented and modified in a timely manner according to changes in the environment and the development of the times.
6. Principle of comprehensiveness
① In terms of quantity, the enacted information laws are far from meeting the growing legal needs of the information society; ② In terms of quality, the legal content is relatively broad, the provisions are not precise, and the operability is poor, which seriously affects the effectiveness of the law in judicial practice. efficiency. Therefore, in the process of formulating national information policies and laws, the principle of comprehensiveness must be actively implemented and conscientiously implemented, so as to build a systematic and comprehensive information policy and legal system.
7. Dynamic principle
Since information policy generally functions within a certain historical period, it has the characteristics of stages, flexibility and change; while information law is converted from mature policies and has strong stability. Therefore, compared with information laws, dynamic formulation principles are particularly evident in the formulation of national information policies.
8. Principle of focus
When formulating national information policies and laws, efforts should be concentrated on solving the key issues and key issues among the many problems faced. From a spatial perspective, it is necessary to implement the principle of comprehensiveness and involve all aspects. At the same time, it is necessary to clarify the direction, highlight the key points, and give priority to areas that currently require urgent policy formulation and legislation. From a time perspective, there should be different priorities in different development periods.
4. Information policy and law in the network environment
(1) Network information policy
1. The meaning and background of network information policy
2. Content framework of network information policy
Information network system construction and development policies, information network resource policies, network information industry policies, and network information market policies.
(2) Information law in the network environment
Network intellectual property, e-commerce, network privacy, network information security, artificial intelligence legal issues