EP2715525A2 - Méthode et système de consommation assistée par ordinateur d'informations provenant de fichiers de données d'applications - Google Patents
Méthode et système de consommation assistée par ordinateur d'informations provenant de fichiers de données d'applicationsInfo
- Publication number
- EP2715525A2 EP2715525A2 EP12789674.4A EP12789674A EP2715525A2 EP 2715525 A2 EP2715525 A2 EP 2715525A2 EP 12789674 A EP12789674 A EP 12789674A EP 2715525 A2 EP2715525 A2 EP 2715525A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- highlighting
- document
- user
- users
- segments
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/169—Annotation, e.g. comment data or footnotes
Definitions
- Such use of the marked-up material is one example of "crowd-based intelligence" in its simplest form.
- Prior readers of academic texts mark the important sections of the document, which leaves later readers with the advantage of more-easily identifying those and other important sections in a particular text. Some readers may choose to ignore a prior reader's notations because the new reader either disagrees with the importance of that highlight or underline, or because that reader is more concerned with another area of the text more pertinent to the particular interest of that reader at that particular time.
- the repeated marking of important sections by other readers will continually identify further areas of the document that are of interest to the readers. Continued review and marking may lead to multiple highlighting of the same section or area of the document. Multiple markings by multiple viewers will, in turn, indicate to the reader the importance of that section or area of the document for other readers for any specific purpose, and may lead to a reader more closely attempting to comprehend that section.
- "crowd-based intelligence" model forms the backbone of a method and a system in which a number of reviewers are able to highlight, edit and review content materials and share their highlights, reviews and edits with other users. This method and system operate so that as more users identify important file sections, the more the later users benefit from their collective, synthesized insights.
- iv. Provide a tool box of, and system support for, commands and actions, which allow the user to quickly generate a coarse (broad level) identification of pages, paragraphs or passages of significance, or to specify the precise, "finer level” identification of segments; these commands/actions may be implemented with graphic icons;
- Coarse page/paragraph passage/etc. identifiers may be "color coded" or otherwise differentiated to signify parameter differentiation, for example, the significance level, nature of comment etc. Sound/Audio:
- the editor reading a given document, or listening to an audio file or viewing an image or video file may mark-up an important segment, and may share what the editor views as "important" with all other users according to a definition of "important" or "significant” that is published to editors, consumers of content and other users.
- the highlighting of the data file may be accomplished in some embodiments by simply selecting the appropriate material and submitting it, either through express commands or implicitly as a result of marking-up. The editor may also have the opportunity to add further comments with or without sharing the highlighted segments.
- editor could also rate the comments. This would be considered in the calculation of comment relevance.
- editors could also comment on comments, to further levels of recursion.
- the combination is made using a matrix.
- the system would refer to a stored matrix to determine how the highlight will appear to the reader. If the two editors mark the same selection identically, then the matrix calculation may tell the system to display the highlight in their common, preset way. If the two editors disagree in classifying the highlighted section, then the matrix would calculate which of the two distinct highlights will be displayed (and how), or calculate and display a third classification synthesized from the two distinct classifications. This may be accomplished in several ways, including providing different weights to their respective highlights, optionally in combination with reverse-diagonal calculations from the matrix illustrated in Figure 11.
- the general method of using weights would allow, as a special case, for the integration of one editor's highlights into the collection of highlights created by previous editors.
- the weight for the collectivized highlighting would generally be higher than that of a single editor because it represents the collective opinions of many. In most cases, if a single editor adds a highlight that agrees with the collective, the weight of the collective increases. If the editor disagrees, then the weight decreases. As the weight increases or decreases through integration of the highlights by several editors, the display to the user reflects the change in the relevance or meaning of a highlight.
- an editor's expertise level is provided by the editor himself or herself.
- the system may include a registration system where the editor enters the fields of their expertise. This may be compared to the category of document the editor is marking-up. For example, if the editor's profile indicates a Ph.D. in Physics T for their edits of an article in the Physics category, the system may recognize them as an expert.
- the editor could also indicate his or her expertise in the sub-field of the document (nuclear reactor, e.g.). Thus, an expert with a Ph.D. in Physics may claim is no expertise in nuclear reactors.
- Another embodiment allows a user to select their role as either a reader or an editor.
- read mode the user acts as a reader and views the document with accompanying highlights. In that mode the reader can add his or her own comments or highlights for their own personal use, but may choose not to share them with others.
- the reader can also view a list of all comments for each highlight in a separate pane of the display. The reader may be allowed at any time to switch to "editor/reviewer mode,” where they can add highlights with any accompanying comments, and then share them with other users.
- LDA Latent Dirichlet Allocation
- LDA LDA
- the system when an editor implements a new comment, the system would create a probabilistic vector for the new comment. By then multiplying the vector with already existing comment vectors, the system can compare the similarity of comments. If the product is within a certain threshold, the system will group the comments together. The grouped comments may allow a reader to see what the "crowd" believes is the significance of a highlighted segment. For example, if multiple editors labeled a segment as "important,” “vital,” “essential” and “fundamental,” then the system would group these together. This would make the purpose of the highlighting immediately recognizable by the reader. This is particularly useful when the highlighted segments are long, complicated or unwieldy.
- the editor may input connections between highlights. This could be in the form of comments, for example, that allow the editor to specify references to multiple highlights. For instance, if the underlying content was a legal case, an editor could highlight a segment where the court recites the facts and then connect it to a segment where the court applies those facts to the rule of law. These connections may also allow a reader to track certain aspect of a document, such as a particular character or theme.
- the central idea behind this invention is that a consumer of information will benefit from the knowledge and expertise of all previous consumers of same or similar information, guided by their collective intelligence as to the most important sections of the data file, thereby improving consumption efficiency and comprehension, and user experience.
- the parameters may also include the amount of time a reader wishes to spend on a document ⁇ or the number of words he wishes to read. Other such parameters may also be introduced and employed. For example, a reader may wish to see the sections of a document above a specified level of relevance compared to another document previously perused by the same reader.
- a paper document that has highlights physically written on it could be scanned into the system.
- the system would convert the scanned image into one it could read, and then convert it to the same format as similar documents.
- the same concept can be applied to inputting audio, visual, or audiovisual works into the system.
- the system allows editors to compare two or more documents through highlighting.
- This comparative highlighting allows a reader to efficiently view the differences and similarities of the documents. This could be useful, for example, if presented with two long contracts from two different cell phone companies. Much of the boiler-plate may be the same, but comparative highlighting allows a reader to quickly see the differences in key-terms, allowing him or her to make a more informed decision without having to wade through the complex legal terms. This could also be accomplished automatically by the system.
- the system could produce a tabulation of the key differences in a chart or spreadsheet.
- Coarse identification of related segments may be particularly useful for relating of the objects/segments across data files.
- FIG. 1 illustrates the basic structure of the invention.
- a document When a document is retrieved by the server, it first checks to ensure that it is in the proper format, and converts accordingly. Then if the document has not been reviewed before, it is sent to the review process. If it has been previously reviewed, then the user can either choose to read or review the document themselves. The reader may also use a combination of the two processes. In this Read/Review process, the user can seamlessly switch between the editor and reader roles, which are described in detail below.
- FIG. 2 demonstrates the review process.
- the user may first retrieve a document in 10, he or she may first enter their level of expertise etc., 20, or proceed straight to editing the document, 30.
- the editor may then mark-up segments and add comments to the document, which will be saved and sent to the server periodically, 40.
- the user is asked if they want to submit the document for integration with the collective document, 50. Depending on the user input it can go to box 60, Submit for integration, or 70, Review or revise document.
- the system may also calculate an Expert Level rating, when submitted.
- the highlights and comments will be uploaded and available to all other users.
- the editor may also publish the mark-ups at any time during his or her session.
- the system displays the stored document with its associated highlights and comments, 25.
- the reader may then enter parameters at 27, to customize the mark-ups displayed. This may include only mark-ups from users above a specified Expert Level Threshold at 28, highlights and comments from a particular subject, 29, or a certain number of mark-ups that will allow the reader to read the document at a particular speed, or within a set number of words at 30.
- system backend When the system backend receives a request for a document, in addition to retrieving the document itself, it also acquires the associated integrated highlights and comments from the
- the "relevance" level of overlapping highlighting in this case is determined by a simple reverse- diagonal algorithm. Also, this example assumes that equal weightage is given to the one user's highlighting as the collective, which generally will not be true.
- tag refers to a qualitative descriptor of a segment of a document.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- User Interface Of Digital Computer (AREA)
- Document Processing Apparatus (AREA)
Abstract
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201161519578P | 2011-05-24 | 2011-05-24 | |
| US201161572826P | 2011-07-23 | 2011-07-23 | |
| PCT/US2012/039482 WO2012162572A2 (fr) | 2011-05-24 | 2012-05-24 | Méthode et système de consommation assistée par ordinateur d'informations provenant de fichiers de données d'applications |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| EP2715525A2 true EP2715525A2 (fr) | 2014-04-09 |
| EP2715525A4 EP2715525A4 (fr) | 2016-05-18 |
Family
ID=50185365
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP12789674.4A Withdrawn EP2715525A4 (fr) | 2011-05-24 | 2012-05-24 | Méthode et système de consommation assistée par ordinateur d'informations provenant de fichiers de données d'applications |
Country Status (1)
| Country | Link |
|---|---|
| EP (1) | EP2715525A4 (fr) |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7779347B2 (en) * | 2005-09-02 | 2010-08-17 | Fourteen40, Inc. | Systems and methods for collaboratively annotating electronic documents |
| US7925993B2 (en) * | 2006-03-30 | 2011-04-12 | Amazon Technologies, Inc. | Method and system for aggregating and presenting user highlighting of content |
| JP2008278088A (ja) * | 2007-04-27 | 2008-11-13 | Hitachi Ltd | 動画コンテンツに関するコメント管理装置 |
-
2012
- 2012-05-24 EP EP12789674.4A patent/EP2715525A4/fr not_active Withdrawn
Also Published As
| Publication number | Publication date |
|---|---|
| EP2715525A4 (fr) | 2016-05-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9778826B2 (en) | Method and system for computer-aided consumption of information from application data files | |
| Hase et al. | Adapting to affordances and audiences? A cross-platform, multi-modal analysis of the platformization of news on Facebook, Instagram, TikTok, and Twitter | |
| WO2025042780A1 (fr) | Systèmes de synthèse contrôlable de contenu | |
| Deori et al. | Analysis of YouTube video contents on Koha and DSpace, and sentiment analysis of viewers' comments | |
| US20140115439A1 (en) | Methods and systems for annotating web pages and managing annotations and annotated web pages | |
| WO2013016719A1 (fr) | Gestion et fourniture de contenu interactif | |
| Gondwe | Exploring the multifaceted nature of generative AI in journalism studies: A typology of scholarly definitions | |
| Ren et al. | Re-understanding of data storytelling tools from a narrative perspective | |
| McLean Davies et al. | Reading in the (post) digital age: Large databases and the future of literature in secondary English classrooms | |
| Bradley et al. | Supporting annotation as a scholarly tool—Experiences from the online Chopin Variorum edition | |
| Zhang | Development and implementation of a deep learning-based smart evaluation system for animated films | |
| Marshall et al. | How researchers use diagrams in communicating neural network systems | |
| Blekanov et al. | Mapping opinion cumulation: topic modeling-based dynamic summarization of user discussions on social networks | |
| Sankar et al. | Evaluating Qualitative Data Analysis Tools: A Comparative Study of NVivo and ATLAS. ti | |
| Xia et al. | Analyzing corporate ESG reporting through data mining: evolutionary trends and strategic model | |
| Okhdar et al. | English vocabulary learning through recommender system based on sentence complexity and vocabulary difficulty | |
| EP2715525A2 (fr) | Méthode et système de consommation assistée par ordinateur d'informations provenant de fichiers de données d'applications | |
| Gueguen | Digitized special collections and multiple user groups | |
| Guedes et al. | Machine-driven content marketing | |
| Niedbalski et al. | Use of selected CAQDA software examples in a research project based on the grounded theory methodology | |
| Gunnarsson et al. | Automation of audio descriptions of large bar charts for persons with visual impairment: Prototyping and proof of concept | |
| Bulock et al. | Opening the book | |
| Dovbysh | Automating the News: How Algorithms are Rewriting the Media: by Nicholas Diakopoulus, Harvard University Press: Cambridge, Massachusetts, 2019, 336 pp., $29.95, ISBN 9780674976986 | |
| Konch et al. | Enhancing Library Services with AI Solutions: Empowering Librarians Through Cutting-Edge Tools—A Study | |
| Mandal | 19 Digital Humanities |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| 17P | Request for examination filed |
Effective date: 20131223 |
|
| AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
| DAX | Request for extension of the european patent (deleted) | ||
| R17D | Deferred search report published (corrected) |
Effective date: 20141231 |
|
| A4 | Supplementary search report drawn up and despatched |
Effective date: 20160415 |
|
| RIC1 | Information provided on ipc code assigned before grant |
Ipc: G06F 17/24 20060101ALI20160411BHEP Ipc: G06F 9/06 20060101AFI20160411BHEP |
|
| 17Q | First examination report despatched |
Effective date: 20190207 |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
| 18D | Application deemed to be withdrawn |
Effective date: 20201201 |