WO2021128523A1 - 一种基于科技大数据的技术成熟度判断方法和系统 - Google Patents
一种基于科技大数据的技术成熟度判断方法和系统 Download PDFInfo
- Publication number
- WO2021128523A1 WO2021128523A1 PCT/CN2020/073118 CN2020073118W WO2021128523A1 WO 2021128523 A1 WO2021128523 A1 WO 2021128523A1 CN 2020073118 W CN2020073118 W CN 2020073118W WO 2021128523 A1 WO2021128523 A1 WO 2021128523A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- technology
- influence
- curve
- data
- technical
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0639—Performance analysis of employees; Performance analysis of enterprise or organisation operations
- G06Q10/06393—Score-carding, benchmarking or key performance indicator [KPI] analysis
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/02—Knowledge representation; Symbolic representation
- G06N5/022—Knowledge engineering; Knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/215—Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2228—Indexing structures
- G06F16/2272—Management thereof
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2462—Approximate or statistical queries
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/283—Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2216/00—Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
- G06F2216/11—Patent retrieval
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
Definitions
- the present invention relates to the technical field of document analysis, in particular to a method and system for judging technology maturity based on scientific and technological big data.
- Technological maturity is a summary of the law of technological maturity based on a large number of engineering practices.
- the evaluation of technological maturity is carried out concurrently in the process of scientific and technological engineering project development.
- technology maturity evaluation we can objectively assess the maturity level of key technologies, accurately grasp the research and development status of key technologies, and comprehensively analyze existing problems and gaps in order to strengthen the guiding role of engineering development.
- Technological maturity can not only guide the formulation of technological development routes and scientifically and rationally arrange scientific research activities in accordance with the laws of technological maturity, but also evaluate the degree to which the current technological development status meets the project’s expected goals, and can also be used as an export criterion for technology transfer and technology risk management. tool. Through the evaluation of technology maturity, we can accurately understand the current technology development status and the gap between the current technology status and the target status to provide a basis for formulating technology maturity plans.
- the current technology maturity evaluation methods mainly include the following three:
- Patent analysis method This method emphasizes the analysis of the quantity and quality of patents.
- the representative of this method is the patent analysis model based on Altshuller's TRIZ theory (invention problem solving theory).
- the TRIZ theory divides the technology life cycle into embryonic stage, There are four stages of growth, maturity, and decline. The technical performance and time relationship of these four stages are presented as an "S" curve.
- This method mainly describes each technology in terms of the number of patents, patent grades, performance and economic benefits. The characteristics of the stage. Firstly, the patent information is statistically analyzed based on the time axis, and then the obtained current technology S-curve is compared with the standard S-curve, so as to judge the development stage of the life cycle of the technology.
- TRIZ theory also includes a technology maturity evaluation model based on patent profitability and patent innovation level.
- patent profitability is affected by various factors such as the socio-economic environment and the level of business operations, which is difficult to accurately reflect Technological maturity, and the degree of patent innovation is usually by experts reading patent documents and subjectively judged, the workload is large and it is difficult to be objective and accurate.
- TRL Technology Readiness Levels
- the invention patent with publication number CN105184078A discloses a technology maturity evaluation method based on the analysis of the relative amount of patents.
- the absolute increase of patents of the technology to be analyzed is counted, and the absolute increase is calculated based on the increase multiple of the total number of patents and the quality level of patents. Revise to obtain the relative increase of patents, and perform curve fitting on the normalized relative increase of patents based on the year, and compare the fitted curve with the standard patent growth curve to determine the current technological maturity.
- the present invention includes the following steps: (a) search and count the absolute increase of patents of the technology to be analyzed; (b) calculate the relative increase of patents of the technology to be analyzed; (c) normalize the relative increase of patents; (d) Curve fitting is performed on the normalized value of the relative increase of patents.
- the disadvantage of this method is that it is too simple to judge the quality of patents.
- the present invention proposes a method and system for judging technology maturity based on big data of science and technology. It is mainly based on patent analysis method, supplemented by big data analysis of science and technology such as papers and projects. Through multi-dimensional evaluation indicators and algorithms, The technology points are placed in the related technology clusters for comprehensive analysis, and the technology big data indicators and technology maturity are mapped to realize automatic judgment. Different from the traditional expert review method, it requires a lot of manual subjective work. With objective authenticity.
- the first object of the present invention is to provide a method for judging technology maturity based on big data of science and technology, including the establishment of a database, an algorithm library and an index library, and the following steps:
- Step 1 Perform data retrieval in the database
- Step 2 Perform data calculation and sorting on the search results
- Step 3 Perform regression calculation on the sorted data to obtain technical maturity indicators
- Step 4 Corresponding to the technical maturity index, sum up and obtain a judgment conclusion.
- the database is a multi-dimensional science and technology database
- the multi-dimensional science and technology database includes at least one of a patent library, a paper library, a project library, and a news library.
- the algorithm library includes a science and technology knowledge graph and/or technology cluster, and the science and technology knowledge graph and/or technology cluster is used to define technology points, technology clusters, and related technical content.
- the indicators in the indicator library include at least one indicator among technical influence indicators, immediacy indicators of technical influence, and the proportion of core node patented technologies.
- the step 1 includes the following sub-steps:
- Step 11 Using the existing science and technology knowledge map, input the name of the technical point to be analyzed and evaluated, and determine the multiple key characteristics of the technical point to be analyzed and evaluated through the connection and step distance between the technical points to be analyzed and evaluated Word meaning
- Step 12 Perform overall data retrieval on the keyword semantics obtained in step 11 in the paper, project, and news database;
- Step 13 Intersecting the keyword semantics obtained in step 11 with the existing technology cluster, and performing an overall data search on the result of the intersection in the patent database;
- Step 14 Integrate the data retrieval results of the step 12 and the step 13.
- the step 2 includes the following sub-steps:
- Step 21 Perform data cleaning and disambiguation on at least one objective big data of patents, papers, and projects, and perform computer sentiment calculations on non-objective big data of news, and calculate the authenticity to retain reliable results;
- Step 22 Draw a basic coordinate curve diagram according to different ordinates, the abscissa of the basic coordinate curve diagram is time, and the ordinate is at least one of the number of patent applications, the number of patent authorizations, the number of partition papers, the number of partition projects, and news ;
- Step 23 Draw a reference coordinate graph according to different ordinates, the abscissa of the reference coordinate graph is time, and the ordinate is at least one indicator of the technical influence, the immediacy of the technical influence, and the core node patent .
- the step 3 includes the following sub-steps:
- Step 33 Fit the curve model with reference to the general method of least squares.
- the judgment conclusion includes judging that the technological maturity is in one of the following stages: the budding stage, the growth stage, the mature stage, and the decline stage.
- the main body of the ordinate data in the basic coordinate curve is the number of the partition theory, it is determined as the budding stage, and the budding stage includes the early stage of the budding period and the direction of the budding period. Long-term phase.
- a 1 in the technical influence curve of the paper is less than 0;
- a 1 in the instantaneous curve of technical influence of the paper is less than 0;
- a 1 is less than 0, and a 2 is close to 0 or less than 0;
- a 1 is close to or less than 0, and a 2 is less than 0.
- a 1 in the technical influence curve of the paper is less than 0;
- a 1 in the instantaneous curve of technical influence of the paper is less than 0;
- a 1 is less than 0, and a 2 is close to 0 or less than 0;
- a 1 is close to or less than 0, and a 2 is less than 0.
- a 1 in the influence curve of the patented technology is less than -1;
- a 1 in the patent curve of the core node is less than -1.
- the second objective of the present invention is to provide a technology maturity judgment system based on big data of science and technology, including a database, an algorithm library and an index library, and the following modules:
- Data retrieval module used for data retrieval in the database
- Data sorting module used to calculate and sort the data of the search results
- Data calculation module used to perform regression calculation on the sorted data to obtain technical maturity indicators
- Summarizing and judging module for corresponding to the technical maturity index, summarizing and obtaining a judgment conclusion.
- the database is a multi-dimensional science and technology database
- the multi-dimensional science and technology database includes at least one of a patent library, a paper library, a project library, and a news library.
- the algorithm library includes a science and technology knowledge graph and/or technology cluster, and the science and technology knowledge graph and/or technology cluster is used to define technology points, technology clusters, and related technical content.
- the indicators in the indicator library include at least one of the technical influence index, the immediacy index of technical influence, and the proportion of core node patent technology.
- the data retrieval module includes the following sub-modules:
- Data mining sub-module used to retrieve the semantics of the keywords obtained in the overall search sub-module in the papers, projects, and news databases;
- Designated retrieval sub-module used to intersect the keyword semantics obtained in the overall retrieval sub-module with existing technology clusters, and perform overall data retrieval in the patent database with the result of the intersection;
- the data sorting module includes the following sub-modules:
- Data cleaning sub-module used for data cleaning and disambiguation of at least one objective big data of patents, papers and projects, computer sentiment calculations on non-objective big data of news, and calculation of authenticity and retention of reliable results
- curve drawing sub-module Used to draw basic coordinate graphs according to different ordinates. The abscissa of the basic coordinate graphs is time, and the ordinates are at least one of the number of patent applications, the number of patent authorizations, the number of partition papers, the number of partition projects, and news. ;
- Graph revision sub-module used to draw a reference coordinate graph according to different ordinates, the abscissa of the reference coordinate graph is time, and the ordinate is the technical influence, the immediacy of the technical influence and the core node patent At least one indicator in.
- the work of the data calculation module includes the following steps:
- Step 33 Fit the curve model with reference to the general method of least squares.
- the judgment conclusion includes judging that the technological maturity is in one of the following stages: the budding stage, the growth stage, the mature stage, and the decline stage.
- the main body of the ordinate data in the basic coordinate curve is the number of the partition theory, it is determined as the budding stage, and the budding stage includes the early stage of the budding period and the direction of the budding period. Long-term phase.
- a 1 in the technical influence curve of the paper is less than 0;
- a 1 in the instantaneous curve of technical influence of the paper is less than 0;
- a 1 is less than 0, and a 2 is close to 0 or less than 0;
- a 1 is close to or less than 0, and a 2 is less than 0.
- a 1 in the technical influence curve of the paper is less than 0;
- a 1 in the instantaneous curve of technical influence of the paper is less than 0;
- a 1 is less than 0, and a 2 is close to 0 or less than 0;
- a 1 is close to or less than 0, and a 2 is less than 0.
- a 1 in the influence curve of the patented technology is less than -1;
- a 1 in the patent curve of the core node is less than -1.
- the present invention proposes a method and system for judging technology maturity based on big data of science and technology, which overcomes the shortcomings of existing evaluation technology, especially the subjectivity of experts, and provides a method for assessing technology maturity based on big data of science and technology such as patents. , It is more objective and authentic, and can realize the semi-automatic calculation of the machine.
- FIG. 1 is a flowchart of a preferred embodiment of a method for judging technology maturity based on big data of science and technology according to the present invention.
- FIG. 1A is a flowchart of the data retrieval method of the embodiment shown in FIG. 1 of the method for judging technology maturity based on technological big data according to the present invention.
- FIG. 1B is a flowchart of the data sorting method of the embodiment shown in FIG. 1 of the method for judging technology maturity based on technological big data according to the present invention.
- FIG. 1C is a flowchart of the regression calculation method of the embodiment shown in FIG. 1 of the method for judging technology maturity based on technological big data according to the present invention.
- Fig. 2 is a block diagram of a preferred embodiment of a technology maturity judging system based on technological big data according to the present invention.
- step 100 is executed to establish a database, algorithm library, and indicator library.
- the database is a multi-dimensional science and technology database, the multi-dimensional science and technology database includes at least one of a patent library, a paper library, a project library, and a news library;
- the algorithm library includes a science and technology knowledge map and/or a technology cluster, and the science and technology knowledge map and/or a technology cluster It is used to define technology points, technology clusters and their related technology content;
- the indicators in the index library include at least one of the technology influence index, the immediacy index of technological influence and the proportion of core node patent technology.
- Step 110 is executed to perform data retrieval in the database.
- step 111 is performed, using the existing scientific and technological knowledge map, input the name of the technical point to be analyzed and evaluated, and determine the technical point to be analyzed and evaluated based on the connection and step distance between the technical points to be analyzed and evaluated The semantics of multiple keywords for the feature of the point.
- Step 112 is executed to search the overall data of the keyword semantics obtained in step 111 in the paper, project, and news database.
- Step 113 is executed to intersect the keyword semantics obtained in step 111 with the existing technology cluster, and perform an overall data search on the result of the intersection in the patent database. Perform step 114 to integrate the data retrieval results of step 112 and step 113
- Step 120 is executed to perform data calculation and sorting on the search results.
- step 121 is performed, using existing public technology to clean and disambiguate at least one objective big data of patents, papers, and projects, and using existing public technology to perform computer sentiment calculations on news non-objective big data , The veracity of the calculation retains the reliable result.
- Step 122 is performed to draw a basic coordinate curve chart according to different ordinates.
- the abscissa of the basic coordinate curve chart is time, and the ordinate is at least one of the number of patent applications, the number of patent authorizations, the number of partition papers, the number of partition projects, and the news. data.
- Step 123 is executed to draw a reference coordinate graph according to different ordinates, the abscissa of the reference coordinate graph is time, and the ordinate is at least one of the technical influence, the immediacy of the technical influence, and the core node patent index.
- Step 130 is performed to perform regression calculation on the sorted data to obtain a technical maturity index.
- Step 133 is executed to fit the curve model according to the general method of least square method.
- Step 140 is executed, corresponding to the technical maturity index, and the judgment conclusion is obtained by summarizing. The judgment conclusion includes judging that the technological maturity is in one of the following stages: the budding stage, the growth stage, the maturity stage, and the decline stage.
- the judgment method is as follows:
- the main body of the ordinate data in the basic coordinate curve is the number of the zoning theory, it is judged as the germination stage, and the germination stage includes the early stage of the germination and the stage of the germination to the growth stage.
- a 1 is less than 0, and a 2 is close to 0 or less than 0;
- a 1 in the technical influence curve of the paper is less than 0;
- a 1 in the instantaneous curve of technical influence of the paper is less than 0;
- a 1 is less than 0, and a 2 is close to 0 or less than 0;
- a 1 is close to or less than 0, and a 2 is less than 0.
- a 1 in the influence curve of the patented technology is less than -1;
- a 1 in the patent curve of the core node is less than -1.
- a technology maturity judgment system based on technological big data includes a database 200, an algorithm library 210, an index library 220, a data retrieval module 230, a data sorting module 240, a data calculation module 250, and a summary judgment module 260.
- the database 200 is a multi-dimensional science and technology database, and the multi-dimensional science and technology database includes at least one of a patent library, a paper library, a project library, and a news library.
- the algorithm library 210 includes a science and technology knowledge graph and/or technology cluster, and the science and technology knowledge graph and/or technology cluster are used to define technology points, technology clusters, and related technical content.
- the indicators in the indicator library 220 include at least one indicator of the technical influence indicator, the immediacy indicator of the technical influence, and the proportion of core node patent technologies.
- Data retrieval module 230 used to retrieve data in the database, including the following sub-modules:
- Data mining sub-module used to retrieve the semantics of the keywords obtained in the overall search sub-module in the papers, projects, and news databases;
- Designated retrieval sub-module used to intersect the keyword semantics obtained in the overall retrieval sub-module with existing technology clusters, and perform overall data retrieval in the patent database with the result of the intersection; comprehensive sub-module : Used to integrate the data retrieval results of the data mining submodule and the designated retrieval submodule.
- Data sorting module 240 used to perform data calculation and sorting on the retrieval results. Including the following sub-modules:
- Data cleaning sub-module used to clean and disambiguate at least one objective big data of patents, papers, and projects by using existing public technology, and use existing public technology to perform computer emotion calculation on non-objective big data of news, and calculate the truth Reliable results are retained.
- Graph drawing sub-module used to draw basic coordinate graphs according to different ordinates.
- the abscissa of the basic coordinate graph is time, and the ordinates are the number of patent applications, the number of patent authorizations, the number of partition papers, the number of partition projects, and the news At least one type of data in.
- Graph revision sub-module used to draw a reference coordinate graph according to different ordinates, the abscissa of the reference coordinate graph is time, and the ordinate is the technical influence, the immediacy of the technical influence and the core node patent At least one indicator in.
- Data calculation module 250 used to perform regression calculation on the sorted data to obtain a technical maturity index.
- the work of the data calculation module includes the following steps:
- Step 33 Fit the curve model with reference to the general method of least squares.
- the summary judgment module 260 used to correspond to the technical maturity index and summarize to obtain a judgment conclusion.
- the judgment conclusion includes the following situations:
- the budding stage which includes the early stage of the budding stage and the stage of the budding stage to the growth stage.
- a 1 in the technical influence curve of the paper is less than 0;
- a 1 in the instantaneous curve of technical influence of the paper is less than 0;
- a 1 is less than 0, and a 2 is close to 0 or less than 0;
- a 1 is close to or less than 0, and a 2 is less than 0.
- a 1 in the technical influence curve of the paper is less than 0;
- a 1 in the instantaneous curve of technical influence of the paper is less than 0;
- a 1 is less than 0, and a 2 is close to 0 or less than 0;
- a 1 is close to or less than 0, and a 2 is less than 0.
- a 1 in the influence curve of the patented technology is less than -1;
- a 1 in the patent curve of the core node is less than -1.
- the invention is mainly based on the patent analysis method, supplemented by the analysis of scientific and technological big data such as papers, projects, etc., through multi-dimensional evaluation indicators and algorithms, the technical points are placed in the related technology clusters for comprehensive analysis, and the scientific and technological big data indicators Establishing a mapping with technological maturity and realizing automatic judgment are different from the traditional expert review method, which requires a lot of manual subjective work and is more objective and authentic.
- the technical solution of the present invention overcomes the shortcomings of the existing evaluation technology, especially the subjectivity of experts, and provides a method for evaluating technology maturity based on technological big data such as patents, which is more objective and authentic, and can realize the semi-automation of the machine
- the calculation steps are as follows:
- Step 1 Establish a database, algorithm library and indicator library
- the step 1 includes the following sub-steps:
- Step 11 Establish a multi-dimensional science and technology database, covering patent database, paper database (division, including SCI and EI), project database (division, including basic research projects, applied research projects and industrial promotion projects), and news database;
- Step 12 Establish an algorithm library, which specifically covers science and technology knowledge graphs and technology clusters, comprehensively define the connotation and extension of technology points and technology clusters, and quantify the mutual influence nodes of technology;
- Step 13 Establish an index database, which specifically covers patent indexes, paper indexes, project indexes, and news indexes;
- the step 13 includes the following indicators:
- Proportion of core node patent technology using complex technology network algorithms, under the technology cluster, the proportion of the value of the patent influence of the specified technology under the same technology cluster;
- Step 2 Construct a search formula, and perform data retrieval in the multi-dimensional science and technology database described in Step 11;
- the step 2 includes the following sub-steps:
- Step 21 Use existing public technology to clean and disambiguate at least one kind of objective big data from patents, papers and projects;
- Step 22 Draw a basic coordinate curve diagram according to different ordinates.
- the abscissa of the basic coordinate curve diagram is time, and the ordinate is at least one of the number of patent applications, the number of patent authorizations, the number of partition papers, the number of partition projects, and news. ;
- the number of zoning papers mainly includes two types: the number of SCI papers and the number of EI papers.
- SCI papers refer to citation index journals edited and published by the Institute of Scientific Information (ISI). 3,300 scientific and technological journals are selected from tens of thousands of journals around the world, covering more than 100 fields of natural basic science. EI papers refer to the documents included in a database Engineering Index worldwide, which mainly include important documents in the field of engineering technology.
- Step 23 Draw a reference coordinate graph according to different ordinates, the abscissa of the basic coordinate graph is time, and the ordinate is at least one indicator of the technical influence, the immediacy of the technical influence, and the core node patent .
- Step 3 Perform data calculation and sorting
- the step 3 includes the following sub-steps:
- Step 31 Use existing public technology to clean and disambiguate at least one objective big data of patents, papers, and projects; use existing public technology to perform computer sentiment calculations on news non-objective big data to calculate authenticity retention Reliable results
- Step 32 Draw a basic coordinate curve chart, the abscissa of the basic coordinate curve chart is time, and the ordinate is at least one of the number of patent applications, the number of patent authorizations, the number of partition papers, the number of partition projects, and news;
- Step 33 Draw a reference coordinate graph, the abscissa of the basic coordinate graph is time, and the ordinate is at least one indicator of the technical influence, the immediacy of the technical influence, and the core node patent.
- Step 4 Perform algorithm regression
- the step 4 includes the following sub-steps:
- Step 43 The model of the fitted curve refers to the general method of least squares, that is, the general method as follows:
- the residual value of the fitted curve is:
- Step 5 Corresponding indicators, as shown in Table 1, summarize the following conclusions:
- the influence of patent technology is gradually approaching the technological influence of papers, and the proportion of core node patents is increasing; at this time, the technological influence of papers a 1 is close to 0; the immediacy of technological influence of papers a 1 is less than 0, and a 2 is close to 0 or greater than 0;
- the patent technology influence a 1 is greater than 0, a 2 is close to 0 or less than 0; the immediacy of patent technology influence a 1 is less than 0, a 2 is close to 0 or less than 0; the core node patent a 1 is greater than 0, and a 2 is gradually close to 0 .
- the technical influence a 1 of the thesis is less than -1; the patent technical influence a 1 is less than -1; the core node patent a 1 is less than -1.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Business, Economics & Management (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Human Resources & Organizations (AREA)
- Artificial Intelligence (AREA)
- Software Systems (AREA)
- Economics (AREA)
- Strategic Management (AREA)
- Mathematical Physics (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Quality & Reliability (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Entrepreneurship & Innovation (AREA)
- Development Economics (AREA)
- Computing Systems (AREA)
- Evolutionary Computation (AREA)
- Operations Research (AREA)
- Game Theory and Decision Science (AREA)
- Marketing (AREA)
- General Business, Economics & Management (AREA)
- Tourism & Hospitality (AREA)
- Probability & Statistics with Applications (AREA)
- Educational Administration (AREA)
- Fuzzy Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
一种基于科技大数据的技术成熟度判断方法和系统,其中方法包括:建立数据库、算法库和指标库(100),在所述数据库中进行数据检索(110);对检索结果进行数据计算整理(120);对整理好的数据进行回归计算,得到技术成熟度指标(130);对应所述技术成熟度指标,汇总得到判断结论(120)。上述方法基于专利分析法为主,辅助以论文、项目等科技大数据分析,通过多维度评价指标和算法,将技术点放在与之相关联的技术集群中进行综合分析,将科技大数据指标与技术成熟度建立映射,实现自动化判断,有别于传统的专家评审法等需大量依靠人工主观性工作,更具客观真实性。
Description
本发明涉及文献分析的技术领域,特别是一种基于科技大数据的技术成熟度判断方法和系统。
技术成熟度,是人们在大量工程实践基础上,对技术成熟规律的总结。为了加强科技工程项目的科学决策的管理能力,提高科研队伍应对技术风险的技术水平,有效控制工程的技术风险,在科技工程项目研制过程中并行开展技术成熟度评价工作。通过技术成熟度评价,客观评定关键技术成熟度等级,准确把握关键技术研发状态,综合分析存在的问题和差距,以期加强对工程研制的指导作用。技术成熟度,不仅可以指导按照技术成熟规律制定技术发展路线、科学合理安排科研活动,也可以评价当前技术发展状态对于项目预期目标的满足程度,还可以作为技术转移的出口准则和技术风险管理的工具。通过技术成熟度评价,准确认识当前技术的发展状态,以及当前技术状态与目标状态的差距,为制定技术成熟计划提供基础。
目前技术成熟度评价的方法主要包括以下三种:
(1)专家评审法,这种方法较为常用,但其准确性对专家的主观判断有较大的依赖,受专家的认识水平、情绪和主观印象影响很大,评价含糊而且容易受部门利益或个人偏见的影响,缺乏规范而客观的评价标准,难以有效地支撑决策。
(2)专利分析法,该方法强调对专利数量及质量的分析,该方法的代表是基于Altshuller的TRIZ理论(发明问题解决理论)的专利分析模型,TRIZ理论将技术生命周期分为萌芽期、成长期、成熟期和衰退期四个阶段,这四个阶段的技术性能和时间关系呈现为“S”曲线,这一方法主要从专利数量、专利等级、性能和经济收益四个方面描述技术各个阶段的特征。首先对专利信息基于时间轴进行统计分析,然后将所得到的当前技术的S曲线与标准的S曲线进行对比,从而判断该技术所处在的生命周期的发展阶段。TRIZ理论还包括基于专利获利能力及专利创新程度的技术成熟度评估模型,然而,在实践中发现,专利获利能力受到社会经济大环境、企业经营水平等多种因素的影响,难以准确反映技术成熟度,而专利创新程度通常由专家阅读专利文献并主观判定,工作量大且难以做到客观准确。
(3)技术成熟度标准(TechnologyReadinessLevels,TRL),是一种比较系统的技术成熟度评价标准,由美国航空航天局(NASA)于1995年首先提出并在航天领域应用,随后被美国及英国的国防部采纳应用。TRL按技术发展过程将技术的成熟度划分为九级:TRL1~TRL9。这一方法需要全面考察该项技术的各项活动,从技术状态、技术在系统中的集成度、进行演示或验证的环境三方面属性综合判断技术成熟度,需要大量的专家现场调研工作,存在主观性,此外该方法存在笼统地描述技术是如何成熟的,由于TRL的定义由于要体现它的通用性,无法体现专用性,一个评价检查单也难以适应属性变化的多种类型,难以清晰的区别各级的准则。
公开号为CN105184078A的发明专利公开了一种基于专利相对量分析的技术成熟度评价方法,通过对待分析技术的专利绝对增长进行统计,并基于专利总量增长倍数及专利质量等级对绝对增长量进行修正,得到专利相对增长量,并基于年份对归一化的专利相对增长量进行曲线拟合,将拟合曲线和标准专利增长量曲线图进行对比,从而确定当前的技术成熟度。本发明包括以下步骤:(a)检索统计待分析技术的专利绝对增长量;(b)计算待分析技术的专利相对增长量;(c)对专利相对增长量进行归一化处理;(d)对专利相对增长量归一化值进行曲线拟合。该方法的缺点是在专利质量判断上过于简单。
发明内容
为了解决上述的技术问题,本发明提出基于科技大数据的技术成熟度判断方法和系统,基于专利分析法为主,辅助以论文、项目等科技大数据分析,通过多维度评价指标和算法,将技术点放在与之相关联的技术集群中进行综合分析,将科技大数据指标与技术成熟度建立映射,实现自动化判断,有别于传统的专家评审法等需大量依靠人工主观性工作,更具客观真实性。
本发明的第一目的是提供了一种基于科技大数据的技术成熟度判断方法,包括建立数据库、算法库和指标库,还包括以下步骤:
步骤1:在所述数据库中进行数据检索;
步骤2:对检索结果进行数据计算整理;
步骤3:对整理好的数据进行回归计算,得到技术成熟度指标;
步骤4:对应所述技术成熟度指标,汇总得到判断结论。
优选的是,所述数据库为多维科技数据库,所述多维科技数据库包括专利库、 论文库、项目库和新闻库中至少一种。
在上述任一方案中优选的是,所述算法库包括科技知识图谱和/或技术集群,所述科技知识图谱和/或技术集群用于定义技术点、技术集群和与其相关技术内容。
在上述任一方案中优选的是,所述指标库中的指标包括技术影响力指标、技术影响力即时性指标和核心节点专利技术占比中至少一种指标。
在上述任一方案中优选的是,所述步骤1包括以下子步骤:
步骤11:利用已有的科技知识图谱,输入待分析评价技术点的名称,通过所述待分析评价技术点之间的联系与步距,确定所述待分析评价技术点的特征的多个关键词语义;
步骤12:将所述步骤11得到的所述关键词语义,在论文、项目、新闻库中,进行整体数据检索;
步骤13:将所述步骤11得到的所述关键词语义,与已有的技术集群进行交集,将所述交集的结果在专利库中进行整体数据检索;
步骤14:将所述步骤12和所述步骤13的数据检索结果综合。
在上述任一方案中优选的是,所述步骤2包括以下子步骤:
步骤21:对专利、论文和项目中至少一种客观大数据进行数据清洗和消歧,对新闻非客观大数据进行计算机情感计算,推算真实性保留可靠结果;
步骤22:根据不同纵坐标绘制基本坐标曲线图,所述基本坐标曲线图的横坐标为时间,纵坐标为专利申请数量、专利授权数量、分区论文数量、分区项目数量和新闻中至少一种数据;
步骤23:根据不同纵坐标绘制参照坐标曲线图,所述参照坐标曲线图的横坐标为时间,纵坐标为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标。
在上述任一方案中优选的是,所述步骤3包括以下子步骤:
步骤31:拟合曲线f
1(x)=a
1x+b
1,其中,x为时间,f
1(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,a
1为相应指标的变化斜率,b
1为曲线截距;
步骤32:拟合曲线f
2(x)=a
2x
2+b
2x+c,其中x为时间,f
2(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,其中,a
2为相应指 标的变化斜率的增长率,b
2为拟合一次项参数,c为曲线截距;
步骤33:参照最小二乘法通用方法拟合曲线的模型。
在上述任一方案中优选的是,所述判断结论包括判定技术成熟度处于以下阶段中的一种:萌芽期阶段、成长期阶段、成熟期阶段和衰退期阶段。
在上述任一方案中优选的是,当所述基本坐标曲线中的纵坐标数据主体为所述分区论数量时,判定为萌芽期阶段,所述萌芽期包括萌芽期早期阶段和萌芽期向成长期阶段。
在上述任一方案中优选的是,当满足下述判断条件时,判定所述技术成熟度处于所述萌芽期早期阶段:
1)所述论文技术影响力大于所述专利技术影响力,或所述专利技术影响力为零但所述论文技术影响力为正值;
2)所述论文技术影响力曲线中的a
1大于1,a
2大于0。
在上述任一方案中优选的是,当满足下述判断条件时,判定所述技术成熟度处于所述萌芽期向成长期阶段:
1)论文逐步从SCI论文向EI论文过渡,所述论文技术影响力大于所述专利技术影响力;
2)所述论文技术影响力曲线中的a
1大于0,a
2接近0或小于0,增速下降;
3)所述论文技术影响力即时性曲线中的a
1小于0,a
2接近0或大于0;
4)所述专利技术影响力曲线中的a
1大于0,a
2大于0;
5)所述核心节点专利曲线中的a
1大于0,a
2大于0。
在上述任一方案中优选的是,当满足下述判断条件时,判定所述技术成熟度处于所述成长期阶段:
1)所述基本坐标曲线中的纵坐标数据主体专利数据超出论文数据,基本坐标曲线中的专利数据趋于平稳,工程化研究项目和新闻数据开始出现;
2)所述参照坐标曲线中的专利技术影响力超过技术论文影响力,技术影响力即时性比萌芽期下降,核心节点专利技术占比趋于稳定;
3)所述论文技术影响力曲线中的a
1小于0;
4)所述论文技术影响力即时性曲线中的a
1小于0;
5)所述专利技术影响力曲线中的a
1接近或小于0,a
2接近0或小于0;
6)所述专利技术影响力即时性曲线中的a
1小于0,a
2接近0或小于0;
7)所述核心节点专利曲线中的a
1接近或小于0,a
2小于0。
在上述任一方案中优选的是,当满足下述判断条件时,判定所述技术成熟度处于所述成熟期阶段:
1)所述基本坐标曲线中的纵坐标数据主体专利数据超出论文数据,基本坐标曲线中的专利数据趋于平稳,工程化研究项目和新闻数据开始出现;
2)所述参照坐标曲线中的专利技术影响力超过技术论文影响力,技术影响力即时性比萌芽期下降,核心节点专利技术占比趋于稳定;
3)所述论文技术影响力曲线中的a
1小于0;
4)所述论文技术影响力即时性曲线中的a
1小于0;
5)所述专利技术影响力曲线中的a
1接近或小于0,a
2接近0或小于0;
6)所述专利技术影响力即时性曲线中的a
1小于0,a
2接近0或小于0;
7)所述核心节点专利曲线中的a
1接近或小于0,a
2小于0。
在上述任一方案中优选的是,当满足下述判断条件时,判定所述技术成熟度处于所述衰退期阶段:
1)所述新闻数据为主体,专利与论文数据均下降;
2)所述论文技术影响力曲线中的a
1小于-1;
3)所述专利技术影响力曲线中的a
1小于-1;
4)所述核心节点专利曲线中的a
1小于-1。
本发明的第二目的是提供了一种基于科技大数据的技术成熟度判断系统,包括数据库、算法库和指标库,还包括以下模块:
数据检索模块:用于在所述数据库中进行数据检索;
数据整理模块:用于对检索结果进行数据计算整理;
数据计算模块:用于对整理好的数据进行回归计算,得到技术成熟度指标;
汇总判断模块:用于对应所述技术成熟度指标,汇总得到判断结论。
优选的是,所述数据库为多维科技数据库,所述多维科技数据库包括专利库、论文库、项目库和新闻库中至少一种。
在上述任一方案中优选的是,所述算法库包括科技知识图谱和/或技术集群,所述科技知识图谱和/或技术集群用于定义技术点、技术集群和与其相关技术内容。
在上述任一方案中优选的是,所述指标库中的指标包括技术影响力指标、技 术影响力即时性指标和核心节点专利技术占比中至少一种指标。
在上述任一方案中优选的是,所述数据检索模块包括以下子模块:
整体检索子模块:用于利用已有的科技知识图谱,输入待分析评价技术点的名称,通过所述待分析评价技术点之间的联系与步距,确定所述待分析评价技术点的特征的多个关键词语义;
数据挖掘子模块:用于将所述整体检索子模块中得到的所述关键词语义,在论文、项目、新闻库中,进行整体数据检索;
指定检索子模块:用于将所述整体检索子模块中得到的所述关键词语义,与已有的技术集群进行交集,将所述交集的结果在专利库中进行整体数据检索;
综合子模块:用于将所述数据挖掘子模块和所述指定检索子模块的数据检索结果综合。
在上述任一方案中优选的是,所述数据整理模块包括以下子模块:
数据清洗子模块:用于对专利、论文和项目中至少一种客观大数据进行数据清洗和消歧,对新闻非客观大数据进行计算机情感计算,推算真实性保留可靠结果;曲线图绘制子模块:用于根据不同纵坐标绘制基本坐标曲线图,所述基本坐标曲线图的横坐标为时间,纵坐标为专利申请数量、专利授权数量、分区论文数量、分区项目数量和新闻中至少一种数据;
曲线图修订子模块:用于根据不同纵坐标绘制参照坐标曲线图,所述参照坐标曲线图的横坐标为时间,纵坐标为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标。
在上述任一方案中优选的是,所述数据计算模块的工作包括以下步骤:
步骤31:拟合曲线f
1(x)=a
1x+b
1,其中,x为时间,f
1(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,a
1为相应指标的变化斜率,b
1为曲线截距;
步骤32:拟合曲线f
2(x)=a
2x
2+b
2x+c,其中x为时间,f
2(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,其中,a
2为相应指标的变化斜率的增长率,b
2为拟合一次项参数,c为曲线截距;
步骤33:参照最小二乘法通用方法拟合曲线的模型。
在上述任一方案中优选的是,所述判断结论包括判定技术成熟度处于以下阶段中的一种:萌芽期阶段、成长期阶段、成熟期阶段和衰退期阶段。
在上述任一方案中优选的是,当所述基本坐标曲线中的纵坐标数据主体为所述分区论数量时,判定为萌芽期阶段,所述萌芽期包括萌芽期早期阶段和萌芽期向成长期阶段。
在上述任一方案中优选的是,当满足下述判断条件时,判定所述技术成熟度处于所述萌芽期早期阶段:
1)所述论文技术影响力大于所述专利技术影响力,或所述专利技术影响力为零但所述论文技术影响力为正值;
2)所述论文技术影响力曲线中的a
1大于1,a
2大于0。
在上述任一方案中优选的是,当满足下述判断条件时,判定所述技术成熟度处于所述萌芽期向成长期阶段:
1)论文逐步从SCI论文向EI论文过渡,所述论文技术影响力大于所述专利技术影响力;
2)所述论文技术影响力曲线中的a
1大于0,a
2接近0或小于0,增速下降;
3)所述论文技术影响力即时性曲线中的a
1小于0,a
2接近0或大于0;
4)所述专利技术影响力曲线中的a
1大于0,a
2大于0;
5)所述核心节点专利曲线中的a
1大于0,a
2大于0。
在上述任一方案中优选的是,当满足下述判断条件时,判定所述技术成熟度处于所述成长期阶段:
1)所述基本坐标曲线中的纵坐标数据主体专利数据超出论文数据,基本坐标曲线中的专利数据趋于平稳,工程化研究项目和新闻数据开始出现;
2)所述参照坐标曲线中的专利技术影响力超过技术论文影响力,技术影响力即时性比萌芽期下降,核心节点专利技术占比趋于稳定;
3)所述论文技术影响力曲线中的a
1小于0;
4)所述论文技术影响力即时性曲线中的a
1小于0;
5)所述专利技术影响力曲线中的a
1接近或小于0,a
2接近0或小于0;
6)所述专利技术影响力即时性曲线中的a
1小于0,a
2接近0或小于0;
7)所述核心节点专利曲线中的a
1接近或小于0,a
2小于0。
在上述任一方案中优选的是,当满足下述判断条件时,判定所述技术成熟度处于所述成熟期阶段:
1)所述基本坐标曲线中的纵坐标数据主体专利数据超出论文数据,基本坐标曲 线中的专利数据趋于平稳,工程化研究项目和新闻数据开始出现;
2)所述参照坐标曲线中的专利技术影响力超过技术论文影响力,技术影响力即时性比萌芽期下降,核心节点专利技术占比趋于稳定;
3)所述论文技术影响力曲线中的a
1小于0;
4)所述论文技术影响力即时性曲线中的a
1小于0;
5)所述专利技术影响力曲线中的a
1接近或小于0,a
2接近0或小于0;
6)所述专利技术影响力即时性曲线中的a
1小于0,a
2接近0或小于0;
7)所述核心节点专利曲线中的a
1接近或小于0,a
2小于0。
在上述任一方案中优选的是,当满足下述判断条件时,判定所述技术成熟度处于所述衰退期阶段:
1)所述新闻数据为主体,专利与论文数据均下降;
2)所述论文技术影响力曲线中的a
1小于-1;
3)所述专利技术影响力曲线中的a
1小于-1;
4)所述核心节点专利曲线中的a
1小于-1。
本发明提出了一种基于科技大数据的技术成熟度判断方法和系统,克服现有评价技术的不足,尤其是专家的主观性,提供一种基于专利等科技大数据进行技术成熟度评估的方法,更具客观真实性,且可实现机器的半自动化计算。
图1为按照本发明的基于科技大数据的技术成熟度判断方法的一优选实施例的流程图。
图1A为按照本发明的基于科技大数据的技术成熟度判断方法的如图1所示实施例的数据检索方法流程图。
图1B为按照本发明的基于科技大数据的技术成熟度判断方法的如图1所示实施例的数据整理方法流程图。
图1C为按照本发明的基于科技大数据的技术成熟度判断方法的如图1所示实施例的回归计算方法流程图。
图2为按照本发明的基于科技大数据的技术成熟度判断系统的一优选实施例的模块图。
下面结合附图和具体的实施例对本发明做进一步的阐述。
实施例一
如图1所示,执行步骤100,建立数据库、算法库和指标库。数据库为多维科技数据库,所述多维科技数据库包括专利库、论文库、项目库和新闻库中至少一种;算法库包括科技知识图谱和/或技术集群,所述科技知识图谱和/或技术集群用于定义技术点、技术集群和与其相关技术内容;指标库中的指标包括技术影响力指标、技术影响力即时性指标和核心节点专利技术占比中至少一种指标。
执行步骤110,在所述数据库中进行数据检索。如图1A所示,执行步骤111,利用已有的科技知识图谱,输入待分析评价技术点的名称,通过所述待分析评价技术点之间的联系与步距,确定所述待分析评价技术点的特征的多个关键词语义。执行步骤112,将步骤111得到的所述关键词语义,在论文、项目、新闻库中,进行整体数据检索。执行步骤113,将所述步骤111得到的所述关键词语义,与已有的技术集群进行交集,将所述交集的结果在专利库中进行整体数据检索。执行步骤114,将所述步骤112和所述步骤113的数据检索结果综合
执行步骤120,对检索结果进行数据计算整理。如图1B所示,执行步骤121,利用现有公开技术对专利、论文和项目中至少一种客观大数据进行数据清洗和消歧,利用现有公开技术对新闻非客观大数据进行计算机情感计算,推算真实性保留可靠结果。执行步骤122,根据不同纵坐标绘制基本坐标曲线图,所述基本坐标曲线图的横坐标为时间,纵坐标为专利申请数量、专利授权数量、分区论文数量、分区项目数量和新闻中至少一种数据。执行步骤123,根据不同纵坐标绘制参照坐标曲线图,所述参照坐标曲线图的横坐标为时间,纵坐标为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标。
执行步骤130,对整理好的数据进行回归计算,得到技术成熟度指标。如图1C所示,执行步骤131,拟合曲线f
1(x)=a
1x+b
1,其中,x为时间,f
1(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,a
1为相应指标的变化斜率,b
1为曲线截距。执行步骤132,拟合曲线f
2(x)=a
2x
2+b
2x+c,其中x为时间,f
2(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,其中,a
2为相应指标的变化斜率的增长率,b
2为拟合一次项参数,c为曲线截距。执行步骤133,照最小二乘法通用方法拟合曲线的模型。执行步骤140,对应所述技术成熟度指标,汇总得到判断结论。判断结论包括判定技术成熟度处于以下阶段中的一种:萌芽期阶段、成长期阶段、成熟期 阶段和衰退期阶段,判定方法为:
1、当所述基本坐标曲线中的纵坐标数据主体为所述分区论数量时,判定为萌芽期阶段,所述萌芽期包括萌芽期早期阶段和萌芽期向成长期阶段。
1.1、当满足下述判断条件时,判定所述技术成熟度处于所述萌芽期向成长期阶段:
1)论文逐步从SCI论文向EI论文过渡,论文技术影响力大于专利技术影响力;
2)所述论文技术影响力曲线中的a
1大于0,a
2接近0或小于0,增速下降;
3)所述论文技术影响力即时性曲线中的a
1小于0,a
2接近0或大于0;
4)所述专利技术影响力曲线中的a
1大于0,a
2大于0;
5)所述核心节点专利曲线中的a
1大于0,a
2大于0
1.2、当满足下述判断条件时,判定所述技术成熟度处于所述萌芽期向成长期阶段:
1)论文逐步从SCI论文向EI论文过渡,论文技术影响力大于专利技术影响力;
2)所述论文技术影响力曲线中的a
1大于0,a
2接近0或小于0,增速下降;
3)所述论文技术影响力即时性曲线中的a
1小于0,a
2接近0或大于0;
4)所述专利技术影响力曲线中的a
1大于0,a
2大于0;
5)所述核心节点专利曲线中的a
1大于0,a
2大于0。
2、满足下述判断条件时,判定所述技术成熟度处于成长期阶段:
1)所述基本坐标曲线中的纵坐标数据主体EI论文超过SCI论文数量;
2)所述专利数据增长率趋于稳定,所述专利技术影响力逐渐接近所述论文技术影响力,所述核心节点专利比例上升;
3)所述论文技术影响力曲线中的a
1接近0;
4)所述论文技术影响力即时性曲线中的a
1小于0,a
2接近0或大于0;
5)所述专利技术影响力曲线中的a
1大于0,a
2接近0或小于0;
6)所述专利技术影响力即时性曲线中的a
1小于0,a
2接近0或小于0;
7)所述核心节点专利曲线中的a
1大于0,a
2逐渐接近0。
3、当满足下述判断条件时,判定所述技术成熟度处于所述成熟期阶段:
1)所述基本坐标曲线中的纵坐标数据主体专利数据超出论文数据,基本坐标曲线中的专利数据趋于平稳,工程化研究项目和新闻数据开始出现;
2)所述参照坐标曲线中的专利技术影响力超过技术论文影响力,技术影响力即 时性比萌芽期下降,核心节点专利技术占比趋于稳定;
3)所述论文技术影响力曲线中的a
1小于0;
4)所述论文技术影响力即时性曲线中的a
1小于0;
5)所述专利技术影响力曲线中的a
1接近或小于0,a
2接近0或小于0;
6)所述专利技术影响力即时性曲线中的a
1小于0,a
2接近0或小于0;
7)所述核心节点专利曲线中的a
1接近或小于0,a
2小于0。
4、当满足下述判断条件时,判定所述技术成熟度处于所述衰退期阶段:
1)所述新闻数据为主体,专利与论文数据均下降;
2)所述论文技术影响力曲线中的a
1小于-1;
3)所述专利技术影响力曲线中的a
1小于-1;
4)所述核心节点专利曲线中的a
1小于-1。
实施例二
如图2所示,一种基于科技大数据的技术成熟度判断系统包括数据库200、算法库210、指标库220、数据检索模块230、数据整理模块240、数据计算模块250和汇总判断模块260。
数据库200为多维科技数据库,所述多维科技数据库包括专利库、论文库、项目库和新闻库中至少一种。
算法库210包括科技知识图谱和/或技术集群,所述科技知识图谱和/或技术集群用于定义技术点、技术集群和与其相关技术内容。
指标库220中的指标包括技术影响力指标、技术影响力即时性指标和核心节点专利技术占比中至少一种指标。
数据检索模块230:用于在所述数据库中进行数据检索,包括以下子模块:
整体检索子模块:用于利用已有的科技知识图谱,输入待分析评价技术点的名称,通过所述待分析评价技术点之间的联系与步距,确定所述待分析评价技术点的特征的多个关键词语义;
数据挖掘子模块:用于将所述整体检索子模块中得到的所述关键词语义,在论文、项目、新闻库中,进行整体数据检索;
指定检索子模块:用于将所述整体检索子模块中得到的所述关键词语义,与已有的技术集群进行交集,将所述交集的结果在专利库中进行整体数据检索;综合子模块:用于将所述数据挖掘子模块和所述指定检索子模块的数据检索结果综 合。
数据整理模块240:用于对检索结果进行数据计算整理。包括以下子模块:
数据清洗子模块:用于利用现有公开技术对专利、论文和项目中至少一种客观大数据进行数据清洗和消歧,利用现有公开技术对新闻非客观大数据进行计算机情感计算,推算真实性保留可靠结果。
曲线图绘制子模块:用于根据不同纵坐标绘制基本坐标曲线图,所述基本坐标曲线图的横坐标为时间,纵坐标为专利申请数量、专利授权数量、分区论文数量、分区项目数量和新闻中至少一种数据。
曲线图修订子模块:用于根据不同纵坐标绘制参照坐标曲线图,所述参照坐标曲线图的横坐标为时间,纵坐标为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标。
数据计算模块250:用于对整理好的数据进行回归计算,得到技术成熟度指标。数据计算模块的工作包括以下步骤:
步骤31:拟合曲线f
1(x)=a
1x+b
1,其中,x为时间,f
1(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,a
1为相应指标的变化斜率,b
1为曲线截距;
步骤32:拟合曲线f
2(x)=a
2x
2+b
2x+c,其中x为时间,f
2(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,其中,a
2为相应指标的变化斜率的增长率,b
2为拟合一次项参数,c为曲线截距;
步骤33:参照最小二乘法通用方法拟合曲线的模型。
汇总判断模块260:用于对应所述技术成熟度指标,汇总得到判断结论。判断结论包括以下几种情况:
1、当所述基本坐标曲线中的纵坐标数据主题为论文数据时,判定为萌芽期,萌芽期包括萌芽期早期阶段和萌芽期向成长期阶段。
1.1当满足下述判断条件时,判定所述技术成熟度处于所述萌芽期早期阶段:
1)论文技术影响力大于专利技术影响力,或专利技术影响力为零但论文技术影响力为正值;
2)所述论文技术影响力曲线中的a
1大于1,a
2大于0。
1.2当满足下述判断条件时,判定所述技术成熟度处于所述萌芽期向成长期阶段:
1)论文逐步从SCI论文向EI论文过渡,所述论文技术影响力大于所述专利技术影响力;
2)所述论文技术影响力曲线中的a
1大于0,a
2接近0或小于0,增速下降;
3)所述论文技术影响力即时性曲线中的a
1小于0,a
2接近0或大于0;
4)所述专利技术影响力曲线中的a
1大于0,a
2大于0;
5)所述核心节点专利曲线中的a
1大于0,a
2大于0。
2、当满足下述判断条件时,判定所述技术成熟度处于所述成长期阶段:
1)所述基本坐标曲线中的纵坐标数据主体专利数据超出论文数据,基本坐标曲线中的专利数据趋于平稳,工程化研究项目和新闻数据开始出现;
2)所述参照坐标曲线中的专利技术影响力超过技术论文影响力,技术影响力即时性比萌芽期下降,核心节点专利技术占比趋于稳定;
3)所述论文技术影响力曲线中的a
1小于0;
4)所述论文技术影响力即时性曲线中的a
1小于0;
5)所述专利技术影响力曲线中的a
1接近或小于0,a
2接近0或小于0;
6)所述专利技术影响力即时性曲线中的a
1小于0,a
2接近0或小于0;
7)所述核心节点专利曲线中的a
1接近或小于0,a
2小于0。
3、当满足下述判断条件时,判定所述技术成熟度处于所述成熟期阶段:
1)所述基本坐标曲线中的纵坐标数据主体专利数据超出论文数据,基本坐标曲线中的专利数据趋于平稳,工程化研究项目和新闻数据开始出现;
2)所述参照坐标曲线中的专利技术影响力超过技术论文影响力,技术影响力即时性比萌芽期下降,核心节点专利技术占比趋于稳定;
3)所述论文技术影响力曲线中的a
1小于0;
4)所述论文技术影响力即时性曲线中的a
1小于0;
5)所述专利技术影响力曲线中的a
1接近或小于0,a
2接近0或小于0;
6)所述专利技术影响力即时性曲线中的a
1小于0,a
2接近0或小于0;
7)所述核心节点专利曲线中的a
1接近或小于0,a
2小于0。
4、当满足下述判断条件时,判定所述技术成熟度处于所述衰退期阶段:
1)所述新闻数据为主体,专利与论文数据均下降;
2)所述论文技术影响力曲线中的a
1小于-1;
3)所述专利技术影响力曲线中的a
1小于-1;
4)所述核心节点专利曲线中的a
1小于-1。
实施例三
本发明基于专利分析法为主,辅助以论文、项目等科技大数据分析,通过多维度评价指标和算法,将技术点放在与之相关联的技术集群中进行综合分析,将科技大数据指标与技术成熟度建立映射,实现自动化判断,有别于传统的专家评审法等需大量依靠人工主观性工作,更具客观真实性。
本发明的技术方案克服现有评价技术的不足,尤其是专家的主观性,提供一种基于专利等科技大数据进行技术成熟度评估的方法,更具客观真实性,且可实现机器的半自动化计算,具体步骤如下:
步骤1:建立数据库,算法库和指标库;
所述步骤1包括以下子步骤:
步骤11:建立多维科技数据库,涵盖专利库,论文库(分区,包括SCI和EI),项目库(分区,包括基础研究项目,应用研究项目和产业推广项目),以及新闻库;
步骤12:建立算法库,具体涵盖科技知识图谱和技术集群,综合定义技术点和技术集群的内涵和外延,量化技术的相互影响节点;
步骤13:建立指标库,具体涵盖专利指标、论文指标、项目指标、新闻指标;
所述步骤13包括以下指标:
技术影响力:由专利和论文的后引数,引用者在技术集群下的排名,进行领域内归一化处理,包括专利技术影响力和论文技术影响力;
技术影响力即时性:专利和论文技术影响力的平均时间
核心节点专利技术占比:运用复杂技术网络算法,在技术集群下,指定技术的专利影响力在同技术集群下的价值占比;
步骤2:构建检索式,在步骤11所述多维科技数据库中进行数据检索;
所述步骤2包括以下子步骤:
步骤21:利用现有公开技术,对专利、论文和项目中至少一种客观大数据进行数据清洗和消歧;
步骤22:根据不同纵坐标绘制基本坐标曲线图,所述基本坐标曲线图的横坐标为时间,纵坐标为专利申请数量、专利授权数量、分区论文数量和分区项目 数量、新闻中至少一种数据;
分区论文数量具体解释为:
分区论文数量主要包括SCI论文数量与EI论文数量两类。
其中,SCI论文指美国科学信息研究所(ISI)编辑出版的引文索引类刊物,全球数万种期刊中选出3300种科技期刊,涉及自然基础科学的100余个领域。EI论文指是全球范围内的一个数据库Engineering Index所收录的文献,主要包括工程技术领域的重要文献。
步骤23:根据不同纵坐标绘制参照坐标曲线图,所述基本坐标曲线图的横坐标为时间,纵坐标为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标。
步骤3:进行数据计算整理;
所述步骤3包括以下子步骤:
步骤31:利用现有公开技术,对专利、论文和项目中至少一种客观大数据进行数据清洗和消歧;利用现有公开技术,对新闻非客观大数据进行计算机情感计算,推算真实性保留可靠结果;
步骤32:绘制基本坐标曲线图,所述基本坐标曲线图的横坐标为时间,纵坐标为专利申请数量、专利授权数量、分区论文数量和分区项目数量、新闻中至少一种数据;
步骤33:绘制参照坐标曲线图,所述基本坐标曲线图的横坐标为时间,纵坐标为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标。
步骤4:进行算法回归;
所述步骤4包括以下子步骤:
步骤41:拟合曲线f
1(x)=a
1x+b
1,其中,x为时间,f
1(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,a
1为相应指标的变化斜率,b
1为曲线截距;
步骤42:拟合曲线f
2(x)=a
2x
2+b
2x+c,其中x为时间,f
2(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,其中,a
2为相应指标的变化斜率的增长率,b
2为拟合一次项参数,c为曲线截距。
步骤43:拟合曲线的模型参照最小二乘法通用方法,即如下通用方法:
拟合曲线的残差值为:
对I求e
i的偏导值,使其为0,得到:
变换得到:
求解改方程组可得到唯一解,从而确定拟合曲线f(x)。
步骤5:对应指标,如表1所示,汇总得出如下结论:
当基本坐标曲线中的纵坐标数据主体为论文数据,判定为萌芽期;
当论文技术影响力大于所述专利技术影响力,或专利技术影响力为零但所述论文技术影响力为正值;论文技术影响力曲线中的a
1大于1,a
2大于0,为萌芽期早期,近似于TRL1-2的基础研究或概念研究阶段,需满足下述判断条件:此时论文技术影响力大于专利技术影响力,或专利技术影响力为零但论文技术影响力为正值;此时曲线回归值论文技术影响力a
1大于1,a
2大于0。
萌芽期向成长期阶段,近似于TRL3-4的应用分析或实验室研究阶段,需满足下述判断条件:此时论文逐步从SCI论文向EI论文过渡,论文技术影响力仍大于专利技术影响力;此时论文技术影响力a
1大于0,a
2接近0或小于0,增速有所下降;论文技术影响力即时性a
1小于0,a
2接近0或大于0;此时专利技术影响力a
1大于0,a
2大于0;核心节点专利a
1大于0,a
2大于0。
当基本坐标曲线中的纵坐标数据主体EI论文超过SCI论文数量,且专利数据增长率趋于稳定时,判定为成长期;
此时专利技术影响力逐渐接近论文技术影响力,核心节点专利比例上升;此时论文技术影响力a
1接近0;论文技术影响力即时性a
1小于0,a
2接近0或大于0;此时专利技术影响力a
1大于0,a
2接近0或小于0;专利技术影响力即时性a
1小于0,a
2接近0或小于0;核心节点专利a
1大于0,a
2逐渐接近0。
当基本坐标曲线中的纵坐标数据主体专利数据超出论文数据,判定为成熟期;
此时基本坐标曲线中的专利数据趋于平稳,工程化研究项目和新闻数据开始出现,参照曲线中的专利技术影响力超过技术论文影响力,技术影响力即时性比萌芽期下降,核心节点专利技术占比趋于稳定;此时论文技术影响力a
1小于0;论文技术影响力即时性a
1小于0;此时专利技术影响力a
1接近或小于0,a
2接近0或小于0;专利技术影响力即时性a
1小于0,a
2接近0或小于0;核心节点专利a
1接近或小于0,a
2小于0。
当新闻数据为主体时,专利与论文数据均显著下降时,判定进入衰退期;
此时论文技术影响力a
1小于-1;专利技术影响力a
1小于-1;核心节点专利a
1小于-1。
表1
为了更好地理解本发明,以上结合本发明的具体实施例做了详细描述,但并非是对本发明的限制。凡是依据本发明的技术实质对以上实施例所做的任何简单修改,均仍属于本发明技术方案的范围。本说明书中每个实施例重点说明的都是与其它实施例的不同之处,各个实施例之间相同或相似的部分相互参见即可。对于系统实施例而言,由于其与方法实施例基本对应,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。
Claims (28)
- 一种基于科技大数据的技术成熟度判断方法,包括建立数据库、算法库和指标库,其特征在于,还包括以下步骤:步骤1:在所述数据库中进行数据检索;步骤2:对检索结果进行数据计算整理;步骤3:对整理好的数据进行回归计算,得到技术成熟度指标;步骤4:对应所述技术成熟度指标,汇总得到判断结论。
- 如权利要求1所述的基于科技大数据的技术成熟度判断方法,其特征在于,所述数据库为多维科技数据库,所述多维科技数据库包括专利库、论文库、项目库和新闻库中至少一种。
- 如权利要求2所述的基于科技大数据的技术成熟度判断方法,其特征在于,所述算法库包括科技知识图谱和/或技术集群,所述科技知识图谱和/或技术集群用于定义技术点、技术集群和与其相关技术内容。
- 如权利要求3所述的基于科技大数据的技术成熟度判断方法,其特征在于,所述指标库中的指标包括技术影响力指标、技术影响力即时性指标和核心节点专利技术占比中至少一种指标。
- 如权利要求4所述的基于科技大数据的技术成熟度判断方法,其特征在于,所述步骤1包括以下子步骤:步骤11:利用已有的科技知识图谱,输入待分析评价技术点的名称,通过所述待分析评价技术点之间的联系与步距,确定所述待分析评价技术点的特征的多个关键词语义;步骤12:将所述步骤11得到的所述关键词语义,在论文、项目、新闻库中,进行整体数据检索;步骤13:将所述步骤11得到的所述关键词语义,与已有的技术集群进行交集,将所述交集的结果在专利库中进行整体数据检索;步骤14:将所述步骤12和所述步骤13的数据检索结果综合。
- 如权利要求5所述的基于科技大数据的技术成熟度判断方法,其特征在于,所述步骤2包括以下子步骤:步骤21:对专利、论文和项目中至少一种客观大数据进行数据清洗和消歧,对新闻非客观大数据进行计算机情感计算,推算真实性保留可靠结果;步骤22:根据不同纵坐标绘制基本坐标曲线图,所述基本坐标曲线图的横 坐标为时间,纵坐标为专利申请数量、专利授权数量、分区论文数量、分区项目数量和新闻中至少一种数据;步骤23:根据不同纵坐标绘制参照坐标曲线图,所述参照坐标曲线图的横坐标为时间,纵坐标为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标。
- 如权利要求6所述的基于科技大数据的技术成熟度判断方法,其特征在于,所述步骤3包括以下子步骤:步骤31:拟合曲线f 1(x)=a 1x+b 1,其中,x为时间,f 1(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,a 1为相应指标的变化斜率,b 1为曲线截距;步骤32:拟合曲线f 2(x)=a 2x 2+b 2x+c,其中x为时间,f 2(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,其中,a 2为相应指标的变化斜率的增长率,b 2为拟合一次项参数,c为曲线截距;步骤33:参照最小二乘法通用方法拟合曲线的模型。
- 如权利要求7所述的基于科技大数据的技术成熟度判断方法,其特征在于,所述判断结论包括判定技术成熟度处于以下阶段中的一种:萌芽期阶段、成长期阶段、成熟期阶段和衰退期阶段。
- 如权利要求8所述的基于科技大数据的技术成熟度判断方法,其特征在于,当所述基本坐标曲线中的纵坐标数据主体为所述分区论数量时,判定为萌芽期阶段,所述萌芽期包括萌芽期早期阶段和萌芽期向成长期阶段。
- 如权利要求9所述的基于科技大数据的技术成熟度判断方法,其特征在于,当满足下述判断条件时,判定所述技术成熟度处于所述萌芽期早期阶段:1)所述论文技术影响力大于所述专利技术影响力,或所述专利技术影响力为零但所述论文技术影响力为正值;2)所述论文技术影响力曲线中的a 1大于1,a 2大于0。
- 如权利要求9所述的基于科技大数据的技术成熟度判断方法,其特征在于,当满足下述判断条件时,判定所述技术成熟度处于所述萌芽期向成长期阶段:1)论文逐步从SCI论文向EI论文过渡,所述论文技术影响力大于所述专利技术影响力;2)所述论文技术影响力曲线中的a 1大于0,a 2接近0或小于0,增速下降;3)所述论文技术影响力即时性曲线中的a 1小于0,a 2接近0或大于0;4)所述专利技术影响力曲线中的a 1大于0,a 2大于0;5)所述核心节点专利曲线中的a 1大于0,a 2大于0。
- 如权利要求7所述的基于科技大数据的技术成熟度判断方法,其特征在于,当满足下述判断条件时,判定所述技术成熟度处于成长期阶段:1)所述基本坐标曲线中的纵坐标数据主体EI论文超过SCI论文数量;2)所述专利数据增长率趋于稳定,所述专利技术影响力逐渐接近所述论文技术影响力,所述核心节点专利比例上升;3)所述论文技术影响力曲线中的a 1接近0;4)所述论文技术影响力即时性曲线中的a 1小于0,a 2接近0或大于0;5)所述专利技术影响力曲线中的a 1大于0,a 2接近0或小于0;6)所述专利技术影响力即时性曲线中的a 1小于0,a 2接近0或小于0;7)所述核心节点专利曲线中的a 1大于0,a 2逐渐接近0。
- 如权利要求7所述的基于科技大数据的技术成熟度判断方法,其特征在于,当满足下述判断条件时,判定所述技术成熟度处于所述成熟期阶段:1)所述基本坐标曲线中的纵坐标数据主体专利数据超出论文数据,基本坐标曲线中的专利数据趋于平稳,工程化研究项目和新闻数据开始出现;2)所述参照坐标曲线中的专利技术影响力超过技术论文影响力,技术影响力即时性比萌芽期下降,核心节点专利技术占比趋于稳定;3)所述论文技术影响力曲线中的a 1小于0;4)所述论文技术影响力即时性曲线中的a 1小于0;5)所述专利技术影响力曲线中的a 1接近或小于0,a 2接近0或小于0;6)所述专利技术影响力即时性曲线中的a 1小于0,a 2接近0或小于0;7)所述核心节点专利曲线中的a 1接近或小于0,a 2小于0。
- 如权利要求7所述的基于科技大数据的技术成熟度判断方法,其特征在于,当满足下述判断条件时,判定所述技术成熟度处于所述衰退期阶段:1)所述新闻数据为主体,专利与论文数据均下降;2)所述论文技术影响力曲线中的a 1小于-1;3)所述专利技术影响力曲线中的a 1小于-1;4)所述核心节点专利曲线中的a 1小于-1。
- 一种基于科技大数据的技术成熟度判断系统,包括数据库、算法库和指标库,其特征在于,还包括以下模块:数据检索模块:用于在所述数据库中进行数据检索;数据整理模块:用于对检索结果进行数据计算整理;数据计算模块:用于对整理好的数据进行回归计算,得到技术成熟度指标;汇总判断模块:用于对应所述技术成熟度指标,汇总得到判断结论。
- 如权利要求15所述的基于科技大数据的技术成熟度判断系统,其特征在于,所述数据库为多维科技数据库,所述多维科技数据库包括专利库、论文库、项目库和新闻库中至少一种。
- 如权利要求16所述的基于科技大数据的技术成熟度判断系统,其特征在于,所述算法库包括科技知识图谱和/或技术集群,所述科技知识图谱和/或技术集群用于定义技术点、技术集群和与其相关技术内容。
- 如权利要求17所述的基于科技大数据的技术成熟度判断系统,其特征在于,所述指标库中的指标包括技术影响力指标、技术影响力即时性指标和核心节点专利技术占比中至少一种指标。
- 如权利要求18所述的基于科技大数据的技术成熟度判断系统,其特征在于,所述数据检索模块包括以下子模块:整体检索子模块:用于利用已有的科技知识图谱,输入待分析评价技术点的名称,通过所述待分析评价技术点之间的联系与步距,确定所述待分析评价技术点的特征的多个关键词语义;数据挖掘子模块:用于将所述整体检索子模块中得到的所述关键词语义,在论文、项目、新闻库中,进行整体数据检索;指定检索子模块:用于将所述整体检索子模块中得到的所述关键词语义,与已有的技术集群进行交集,将所述交集的结果在专利库中进行整体数据检索;综合子模块:用于将所述数据挖掘子模块和所述指定检索子模块的数据检索结果综合。
- 如权利要求19所述的基于科技大数据的技术成熟度判断系统,其特征在于,所述数据整理模块包括以下子模块:数据清洗子模块:用于对专利、论文和项目中至少一种客观大数据进行数据 清洗和消歧,对新闻非客观大数据进行计算机情感计算,推算真实性保留可靠结果;曲线图绘制子模块:用于根据不同纵坐标绘制基本坐标曲线图,所述基本坐标曲线图的横坐标为时间,纵坐标为专利申请数量、专利授权数量、分区论文数量、分区项目数量和新闻中至少一种数据;曲线图修订子模块:用于根据不同纵坐标绘制参照坐标曲线图,所述参照坐标曲线图的横坐标为时间,纵坐标为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标。
- 如权利要求15所述的基于科技大数据的技术成熟度判断系统,其特征在于,所述数据计算模块的工作包括以下步骤:步骤31:拟合曲线f 1(x)=a 1x+b 1,其中,x为时间,f 1(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,a 1为相应指标的变化斜率,b 1为曲线截距;步骤32:拟合曲线f 2(x)=a 2x 2+b 2x+c,其中x为时间,f 2(x)为所述技术影响力、所述技术影响力即时性和核心节点专利中至少一种指标,其中,a 2为相应指标的变化斜率的增长率,b 2为拟合一次项参数,c为曲线截距;步骤33:参照最小二乘法通用方法拟合曲线的模型。
- 如权利要求21所述的基于科技大数据的技术成熟度判断系统,其特征在于,所述判断结论包括判定技术成熟度处于以下阶段中的一种:萌芽期阶段、成长期阶段、成熟期阶段和衰退期阶段。
- 如权利要求22所述的基于科技大数据的技术成熟度判断系统,其特征在于,当所述基本坐标曲线中的纵坐标数据主体为所述分区论数量时,判定为萌芽期阶段,所述萌芽期包括萌芽期早期阶段和萌芽期向成长期阶段。
- 如权利要求23所述的基于科技大数据的技术成熟度判断系统,其特征在于,当满足下述判断条件时,判定所述技术成熟度处于所述萌芽期早期阶段:1)论文技术影响力大于专利技术影响力,或专利技术影响力为零但论文技术影响力为正值;2)所述论文技术影响力曲线中的a 1大于1,a 2大于0。
- 如权利要求23所述的基于科技大数据的技术成熟度判断系统,其特征在于,当满足下述判断条件时,判定所述技术成熟度处于所述萌芽期向成长期阶 段:1)论文逐步从SCI论文向EI论文过渡,所述论文技术影响力大于所述专利技术影响力;2)所述论文技术影响力曲线中的a 1大于0,a 2接近0或小于0,增速下降;3)所述论文技术影响力即时性曲线中的a 1小于0,a 2接近0或大于0;4)所述专利技术影响力曲线中的a 1大于0,a 2大于0;5)所述核心节点专利曲线中的a 1大于0,a 2大于0。
- 如权利要求22所述的基于科技大数据的技术成熟度判断系统,其特征在于,当满足下述判断条件时,判定所述技术成熟度处于成长期阶段:1)所述基本坐标曲线中的纵坐标数据主体EI论文超过SCI论文数量;2)所述专利数据增长率趋于稳定,所述专利技术影响力逐渐接近所述论文技术影响力,所述核心节点专利比例上升;3)所述论文技术影响力曲线中的a 1接近0;4)所述论文技术影响力即时性曲线中的a 1小于0,a 2接近0或大于0;5)所述专利技术影响力曲线中的a 1大于0,a 2接近0或小于0;6)所述专利技术影响力即时性曲线中的a 1小于0,a 2接近0或小于0;7)所述核心节点专利曲线中的a 1大于0,a 2逐渐接近0。
- 如权利要求22所述的基于科技大数据的技术成熟度判断系统,其特征在于,。当满足下述判断条件时,判定所述技术成熟度处于所述成熟期阶段:1)所述基本坐标曲线中的纵坐标数据主体专利数据超出论文数据,基本坐标曲线中的专利数据趋于平稳,工程化研究项目和新闻数据开始出现;2)所述参照坐标曲线中的专利技术影响力超过技术论文影响力,技术影响力即时性比萌芽期下降,核心节点专利技术占比趋于稳定;3)所述论文技术影响力曲线中的a 1小于0;4)所述论文技术影响力即时性曲线中的a 1小于0;5)所述专利技术影响力曲线中的a 1接近或小于0,a 2接近0或小于0;6)所述专利技术影响力即时性曲线中的a 1小于0,a 2接近0或小于0;7)所述核心节点专利曲线中的a 1接近或小于0,a 2小于0。
- 如权利要求22所述的基于科技大数据的技术成熟度判断系统,其特征在于,当满足下述判断条件时,判定所述技术成熟度处于所述衰退期阶段:1)所述新闻数据为主体,专利与论文数据均下降;2)所述论文技术影响力曲线中的a 1小于-1;3)所述专利技术影响力曲线中的a 1小于-1;4)所述核心节点专利曲线中的a 1小于-1。
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP20904895.8A EP4080429A4 (en) | 2019-12-27 | 2020-01-20 | METHOD AND SYSTEM TO DETERMINE TECHNOLOGICAL MATURITY BASED ON SCIENTIFIC AND TECHNOLOGICAL BIG DATA |
| US17/849,732 US20220327398A1 (en) | 2019-12-27 | 2022-06-27 | Technology maturity judgment method and system based on science and technology data |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201911375474.3 | 2019-12-27 | ||
| CN201911375474.3A CN111126865B (zh) | 2019-12-27 | 2019-12-27 | 一种基于科技大数据的技术成熟度判断方法和系统 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US17/849,732 Continuation-In-Part US20220327398A1 (en) | 2019-12-27 | 2022-06-27 | Technology maturity judgment method and system based on science and technology data |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2021128523A1 true WO2021128523A1 (zh) | 2021-07-01 |
Family
ID=70503918
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2020/073118 Ceased WO2021128523A1 (zh) | 2019-12-27 | 2020-01-20 | 一种基于科技大数据的技术成熟度判断方法和系统 |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US20220327398A1 (zh) |
| EP (1) | EP4080429A4 (zh) |
| CN (1) | CN111126865B (zh) |
| WO (1) | WO2021128523A1 (zh) |
Families Citing this family (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112231349B (zh) * | 2020-09-10 | 2024-06-21 | 广州众成大数据科技有限公司 | 产品技术成熟度曲线的处理方法、系统和存储介质 |
| CN112685562B (zh) * | 2020-12-28 | 2021-12-07 | 中科院计算技术研究所大数据研究院 | 一种基于XGBoost模型的多维指标集成的技术评价方法 |
| CN114492402A (zh) * | 2021-12-28 | 2022-05-13 | 北京航天智造科技发展有限公司 | 一种科技新词识别方法及装置 |
| CN115471483A (zh) * | 2022-09-21 | 2022-12-13 | 北京智谱华章科技有限公司 | 基于雷达图的多维度技术预警方法 |
| CN115544114A (zh) * | 2022-09-21 | 2022-12-30 | 北京智谱华章科技有限公司 | 基于大数据的多维度技术评估方法和系统 |
| CN117252262B (zh) * | 2023-09-28 | 2024-07-26 | 四川大学 | 知识图谱构建与专利信息检索方法及装置 |
| CN117744927A (zh) * | 2023-12-19 | 2024-03-22 | 深圳市力合产业研究有限公司 | 基于论文及专利的技术成熟度评估方法、设备及存储介质 |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20060184372A1 (en) * | 2003-04-04 | 2006-08-17 | Ellie Okada | Intelligence value index calculation method |
| CN101276345A (zh) * | 2007-03-29 | 2008-10-01 | 上海汉光知识产权数据科技有限公司 | 专利数据分析系统 |
| CN105184078A (zh) | 2015-09-06 | 2015-12-23 | 华南理工大学 | 基于专利相对量分析的技术成熟度评价方法 |
| CN105678659A (zh) * | 2016-01-29 | 2016-06-15 | 中国兵器工业新技术推广研究所 | 一种国防科技成果推广成熟度评价方法 |
| CN106570616A (zh) * | 2016-10-19 | 2017-04-19 | 国网上海市电力公司 | 一种科技项目评估用定量评价方法 |
| CN108614867A (zh) * | 2018-04-12 | 2018-10-02 | 科技部科技评估中心 | 基于学术论文的技术前沿性指数计算方法及系统 |
Family Cites Families (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070294232A1 (en) * | 2006-06-15 | 2007-12-20 | Andrew Gibbs | System and method for analyzing patent value |
| US9177249B2 (en) * | 2012-06-29 | 2015-11-03 | Ut-Battelle, Llc | Scientometric methods for identifying emerging technologies |
| CN106384163A (zh) * | 2016-09-08 | 2017-02-08 | 国网能源研究院 | 一种新能源发电技术成熟度预测方法及装置 |
| AU2017356848A1 (en) * | 2016-11-10 | 2019-06-06 | Search Technology, Inc. | Technological emergence scoring and analysis platform |
| KR101932517B1 (ko) * | 2017-11-27 | 2018-12-26 | 한국발명진흥회 | 다중회귀모델을 활용한 특허 평가 방법 및 시스템 |
| CN109902168B (zh) * | 2019-01-25 | 2022-02-11 | 北京创新者信息技术有限公司 | 一种专利评价方法和系统 |
-
2019
- 2019-12-27 CN CN201911375474.3A patent/CN111126865B/zh active Active
-
2020
- 2020-01-20 WO PCT/CN2020/073118 patent/WO2021128523A1/zh not_active Ceased
- 2020-01-20 EP EP20904895.8A patent/EP4080429A4/en not_active Withdrawn
-
2022
- 2022-06-27 US US17/849,732 patent/US20220327398A1/en active Pending
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20060184372A1 (en) * | 2003-04-04 | 2006-08-17 | Ellie Okada | Intelligence value index calculation method |
| CN101276345A (zh) * | 2007-03-29 | 2008-10-01 | 上海汉光知识产权数据科技有限公司 | 专利数据分析系统 |
| CN105184078A (zh) | 2015-09-06 | 2015-12-23 | 华南理工大学 | 基于专利相对量分析的技术成熟度评价方法 |
| CN105678659A (zh) * | 2016-01-29 | 2016-06-15 | 中国兵器工业新技术推广研究所 | 一种国防科技成果推广成熟度评价方法 |
| CN106570616A (zh) * | 2016-10-19 | 2017-04-19 | 国网上海市电力公司 | 一种科技项目评估用定量评价方法 |
| CN108614867A (zh) * | 2018-04-12 | 2018-10-02 | 科技部科技评估中心 | 基于学术论文的技术前沿性指数计算方法及系统 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4080429A4 |
Also Published As
| Publication number | Publication date |
|---|---|
| EP4080429A1 (en) | 2022-10-26 |
| EP4080429A4 (en) | 2023-01-18 |
| US20220327398A1 (en) | 2022-10-13 |
| CN111126865B (zh) | 2023-05-23 |
| CN111126865A (zh) | 2020-05-08 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2021128523A1 (zh) | 一种基于科技大数据的技术成熟度判断方法和系统 | |
| US10074079B2 (en) | Systems and methods for automated analysis, screening and reporting of group performance | |
| CN110689195A (zh) | 一种电力日负荷预测方法 | |
| CN105335814A (zh) | 在线大数据智能云审计方法及系统 | |
| CN105976109A (zh) | 一种基于大数据智能审计的方法及系统 | |
| CN104156411A (zh) | 专利价值数据综合处理系统 | |
| US11243951B2 (en) | Systems and methods for automated analysis, screening, and reporting of group performance | |
| CN103336771B (zh) | 基于滑动窗口的数据相似检测方法 | |
| CN116644184B (zh) | 基于数据聚类的人力资源信息管理系统 | |
| CN108389069A (zh) | 基于随机森林和逻辑回归的优质客户识别方法及装置 | |
| US20190340517A2 (en) | A method for detection and characterization of technical emergence and associated methods | |
| CN105184078A (zh) | 基于专利相对量分析的技术成熟度评价方法 | |
| WO2020259391A1 (zh) | 一种数据库脚本性能测试的方法及装置 | |
| CN106502878A (zh) | 一种基于相对成熟度的业务系统评价方法及装置 | |
| CN107729377A (zh) | 基于数据挖掘的顾客分类方法与系统 | |
| CN110310012B (zh) | 数据分析方法、装置、设备及计算机可读存储介质 | |
| CN105786810B (zh) | 类目映射关系的建立方法与装置 | |
| CN115526501A (zh) | 一种基于融合聚类的智能建筑节能评估方法 | |
| CN120216506A (zh) | 企业造价智能分析与决策支持系统 | |
| CN107944704A (zh) | 对评审专家进行绩效评价的方法 | |
| Li et al. | Big-Data Measurement-Model Research about Judges’ Actual Workload in China | |
| Dikananda et al. | Application of the K-Means Algorithm in Enhancing the Clustering Model for Job Seekers in Cirebon City | |
| CN119884103B (zh) | 一种基于大数据的综合评价数据分析方法及系统 | |
| CN119917732B (zh) | 一种基于大数据的招工就业智能推荐系统 | |
| CN119808794B (zh) | 一种基于ai驱动的大数据智能解析分析方法及系统 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20904895 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2020904895 Country of ref document: EP Effective date: 20220721 |




