WO2024251060A1 - Procédé et appareil de composition d'image et de texte, support et dispositif électronique - Google Patents
Procédé et appareil de composition d'image et de texte, support et dispositif électronique Download PDFInfo
- Publication number
- WO2024251060A1 WO2024251060A1 PCT/CN2024/096949 CN2024096949W WO2024251060A1 WO 2024251060 A1 WO2024251060 A1 WO 2024251060A1 CN 2024096949 W CN2024096949 W CN 2024096949W WO 2024251060 A1 WO2024251060 A1 WO 2024251060A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- laid out
- saliency
- map
- candidate
- candidate frame
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/53—Querying
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/5866—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, manually generated location and time information
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/30—Computing systems specially adapted for manufacturing
Definitions
- the present disclosure relates to the field of computer technology, and in particular to a graphic typesetting method, device, medium and electronic device.
- GAN Generative Adversarial Network
- the present disclosure provides a method for typesetting text and graphics, comprising: obtaining a saliency map based on a background map; searching on the saliency map in combination with a material to be laid out to obtain a candidate frame of the material to be laid out; determining a saliency value of the candidate frame; and determining a placement position of the material to be laid out from the candidate frame based on the saliency value.
- the present disclosure provides a graphic typesetting device, comprising: an acquisition module, configured to obtain a saliency map based on a background map; a search module, configured to search on the saliency map in combination with a material to be laid out, to obtain a candidate frame of the material to be laid out; a first determination module, configured to determine a saliency value of the candidate frame; and a second determination module, configured to determine a placement position of the material to be laid out from the candidate frame based on the saliency value.
- the present disclosure provides a computer-readable medium having a computer program stored thereon, which, when executed by a processing device, implements the steps of any one of the methods described in the first aspect.
- the present disclosure provides an electronic device, comprising: a storage device on which a computer program is stored; and a processing device configured to execute the computer program in the storage device to implement the steps of any one of the methods described in the first aspect.
- the present disclosure provides a computer program product, comprising instructions, which, when executed by a processor, cause The processor implements the steps of any one of the methods described in the first aspect.
- the present disclosure provides a computer program, comprising program code, which, when executed by a processor, causes the steps of any one of the methods described in the first aspect to be implemented.
- FIG1 is a flow chart of a method for typesetting text and images according to an embodiment of the present disclosure.
- FIG. 2 shows a schematic diagram of a background map and its corresponding saliency map.
- FIG. 3 is another flow chart of a method for typesetting text and images according to an embodiment of the present disclosure.
- FIG. 4 is another flow chart of a method for typesetting text and images according to an embodiment of the present disclosure.
- FIG. 5 shows a schematic diagram of extracting candidate boxes of multiple scales using a sliding window pyramid.
- FIG. 6 is a schematic diagram of placement positions of materials to be laid out according to an embodiment of the present disclosure.
- FIG. 7 is a schematic block diagram of a graphic typesetting device according to an embodiment of the present disclosure.
- FIG. 8 shows a schematic structural diagram of an electronic device suitable for implementing the embodiments of the present disclosure.
- a prompt message is sent to the user to clearly prompt the user that the operation requested to be performed will require obtaining and using the user's personal information.
- the user can autonomously choose whether to provide personal information to software or hardware such as an electronic device, application, server, or storage medium that performs the operation of the technical solution of the present disclosure according to the prompt message.
- the prompt information in response to receiving an active request from the user, may be sent to the user in the form of a pop-up window, in which the prompt information may be presented in text form.
- the pop-up window may also carry a selection control for the user to choose "agree” or “disagree” to provide personal information to the electronic device.
- the resulting image after arrangement/layout usually has the problem that the text box position is not harmonious and beautiful, or the text box blocks the main body of the poster.
- the present disclosure proposes an improved graphic layout scheme, which will be described below with reference to the accompanying drawings.
- Fig. 1 is a flow chart of a method for typesetting text and images according to an embodiment of the present disclosure. As shown in Fig. 1, the method for typesetting text and images includes the following steps S11 to S14.
- step S11 a saliency map is obtained based on the background map.
- the background image is the image on which the layout material is placed.
- the material to be laid out can be a text box or other materials.
- the saliency map is an image used to show the uniqueness of each pixel in the background image.
- the size of the saliency map is the same as that of the background image.
- Figure 2 shows a schematic diagram of the background image and its corresponding saliency map.
- the white part represents the salient area and the black part represents the non-salient area.
- step S12 a search is performed on the saliency map in combination with the material to be laid out to obtain a candidate frame of the material to be laid out.
- the search may be performed based on the shape parameters of the material to be laid out, such as size, shape, etc., for example, a frame that matches the shape of the material to be laid out is found as a candidate frame.
- the search may be performed on the saliency map in combination with the material to be laid out in a sliding window manner. That is, a window of the shape of the material to be laid out is used to perform a sliding window operation on the entire saliency map to obtain all candidate frames that can place the material to be laid out.
- the candidate frame may be one or more.
- step S13 the significance value of the candidate box is determined.
- the significance value of each candidate box among all the candidate boxes where the material to be laid out can be placed is determined.
- an integral map of the saliency map may be first determined, and then the saliency values of each candidate box may be determined based on the integral map.
- the integral map refers to a map of the integral values of each pixel in the image, and the integral value of a certain pixel is the sum of all elements contained in a rectangle with the pixel in the upper left corner of the image as the upper left vertex and the pixel as the lower right vertex.
- step S14 based on the saliency value, the placement position of the material to be laid out is determined from the candidate frames.
- a specific candidate frame is selected from all candidate frames where the material to be laid out can be placed to place the material to be laid out.
- the candidate box with the smallest significance value can be selected as the placement position of the material to be laid out; or the N candidate boxes with the smallest significance values can be selected by sorting the significance values, so that the user can independently decide in which candidate box to place the material to be laid out from these N candidate boxes.
- the placement position is determined based on the saliency value, it is possible to ensure that the material to be laid out is not placed in the salient area of the background map, but is placed in the non-salient area of the background map, thereby greatly reducing the occlusion between the materials during the automated layout process (for example, reducing the occlusion of the text box on the subject in the background map), and improving the aesthetics of the automated poster.
- FIG. 3 is another flow chart of a method for typesetting text and images according to an embodiment of the present disclosure.
- step S31 a saliency map is obtained based on the background map.
- step S32 the material to be laid out to be searched on the saliency map is selected from the multiple materials to be laid out according to the size priority.
- the multiple materials to be laid out can be sorted according to the area size, and the material to be laid out to be searched on the saliency map is selected in descending order.
- step S33 a search is performed on the saliency map in combination with the selected material to be laid out to obtain a candidate frame of the selected material to be laid out.
- step S34 the saliency value of the candidate box is determined. How to determine the saliency value has been described in detail above and will not be repeated here.
- step S35 based on the saliency value, the placement position of the selected material to be laid out is determined from the candidate frame.
- step S36 the determined placement position is marked as significant on the significance map. In this way, the placement position will be considered as a significant area in subsequent processes, and no other materials to be laid out will be placed there.
- step S37 it is determined whether all the materials to be laid out have been placed. If they have been placed, the graphic layout process ends. If not, the process goes to step S32 to place the next material to be laid out.
- Fig. 4 is another flow chart of a method for typesetting text and graphics according to an embodiment of the present disclosure.
- the method for typesetting text and graphics can be applied to a scenario where an existing poster is resized to obtain a new poster, for example, an existing horizontal poster is resized to obtain a vertical poster.
- step S41 the material to be laid out is cut out from the poster to be resized to obtain a background image, and the background image is resized to obtain a background base image.
- step S41 you can first cut out the material to be laid out (such as a text box) from the existing horizontal poster to obtain a background image, and then resize the background image to obtain a vertical background base image.
- material to be laid out such as a text box
- step S42 a saliency map is obtained based on the background map.
- step S43 the material to be laid out to be searched on the saliency map is selected from the multiple materials to be laid out according to the size priority.
- the multiple materials to be laid out can be sorted according to the area size, and the material to be laid out to be searched on the saliency map is selected in descending order.
- step S44 a search is performed on the saliency map based on the selected material to be laid out to obtain a candidate frame of the selected material to be laid out.
- step S45 the saliency value of the candidate box is determined. The above has described in detail how to determine the saliency value. This will not be elaborated on.
- step S46 the significance value is processed based on the position and size of the material to be laid out in the poster to be converted in size, and the placement position of the selected material to be laid out is determined from the candidate frame based on the processed significance value.
- the significance value is processed based on the position and size of the material to be laid out in the poster to be converted in size, which can be achieved in the following way: first, the similarity between the position and size of the candidate box and the position and size of the material to be laid out in the poster to be converted in size is calculated.
- the similarity between the two is relatively high; then, the similarity is normalized, for example, the similarity is normalized to a range of 0 to 1; then, the normalized similarity is multiplied by the significance value to obtain the processed significance value.
- step S47 the determined placement position is marked as significant on the significance map. In this way, the placement position will be considered as a significant area in subsequent processes, and no other materials to be laid out will be placed there.
- step S48 it is determined whether all the materials to be laid out have been placed. If they have been placed, the graphic layout process ends. If not, the process goes to step S43 to place the next material to be laid out.
- the material to be laid out before searching on the saliency map in combination with the material to be laid out, the material to be laid out may be subjected to a multi-scale transformation, for example, the material to be laid out may be enlarged or reduced, and the ratio of enlargement or reduction may be, for example, in the range of 1 to 2 times.
- the material to be laid out that retains the original size, the material to be laid out that is enlarged, the material to be laid out that is reduced, etc. may be obtained.
- the material to be laid out that has been multi-scale transformed may be searched on the saliency map in sequence.
- the first-scale material to be laid out, the second-scale material to be laid out, and the third-scale material to be laid out may be searched on the saliency map in sequence in combination, thereby obtaining the candidate box for the material to be laid out 1.
- This processing method may also be referred to as using a sliding window.
- Pyramid extraction of candidate boxes of multiple scales may also be referred to as using a sliding window.
- Figure 5 shows a schematic diagram of extracting candidate boxes of multiple scales using a sliding window pyramid. Each rectangular box in Figure 5 is the extracted candidate boxes of multiple scales.
- determining the placement position of the material to be laid out from the candidate frames may include: for candidate frames with similar positions and a difference in saliency values less than a preset threshold, using a non-maximum suppression method to determine the placement position of the material to be laid out from the candidate frames. In this way, it is possible to ensure that a more appropriate placement position is determined for the material to be laid out.
- FIG6 is a schematic diagram of the placement position of the material to be laid out according to an embodiment of the present disclosure.
- the rectangular box in the upper left corner is the final candidate box position, and the upper left corner of the right figure shows the material to be laid out placed in the candidate box. It can be seen that the final placement position of the material to be laid out does not block the subject in the background base map.
- the graphic typesetting device provided by the embodiment of the present disclosure can execute the graphic typesetting method provided by any embodiment of the present disclosure.
- the embodiment of the present disclosure can divide the device into functional units according to the above method example. For example, each functional module/unit can be divided corresponding to each function, or two or more functions can be integrated into one processing module. It is worth noting that the various modules/units included in the above device are only divided according to functional logic, but are not limited to the division described in the text.
- each module/unit can be implemented in various appropriate ways, such as hardware, firmware, or any appropriate combination.
- Fig. 7 is a schematic block diagram of a graphic typesetting device according to an embodiment of the present disclosure.
- the graphic typesetting device includes: an acquisition module 71, configured to obtain a saliency map based on a background base map; a search module 72, configured to search on the saliency map in combination with the material to be laid out, and obtain a candidate frame of the material to be laid out; a first determination module 73, configured to determine the saliency value of the candidate frame; and a second determination module 74, configured to determine the placement position of the material to be laid out from the candidate frame based on the saliency value.
- the number of the materials to be laid out is multiple, and the search module 72 is further configured to: According to the size priority, a material to be laid out to be searched on the saliency map is selected from the plurality of materials to be laid out; and the search is performed on the saliency map in combination with the selected material to be laid out.
- the search module 72 is further configured to: perform multi-scale transformation on the material to be laid out; and sequentially search on the saliency map based on the material to be laid out after the multi-scale transformation.
- the first determination module 73 is further configured to: determine an integral map of the saliency map; and determine a saliency value of the candidate box based on the integral map.
- the graphic typesetting apparatus further comprises a marking module configured to: after determining the placement position of the to-be-layout material from the candidate frame, mark the determined placement position as significant on the significance map.
- the graphic typesetting device further includes a cutting module configured to: cut out the material to be laid out from the poster to be converted in size to obtain a background image; perform size conversion on the background image to obtain the background base image;
- the second determination module 74 is further configured to: process the significance value based on the position and size of the material to be laid out in the poster to be converted in size; and determine the placement position of the material to be laid out from the candidate frame based on the processed significance value.
- the second determination module 74 may also be configured to: calculate the similarity between the position and size of the candidate box and the position and size of the material to be laid out in the poster to be converted in size; normalize the similarity; multiply the normalized similarity by the significance value to obtain the processed significance value.
- the second determination module 74 may also be configured to: for candidate frames that are similar in position and whose difference in significance values is less than a preset threshold, use a non-maximum suppression method to determine the placement position of the material to be laid out from the candidate frames.
- the embodiment of the present disclosure further provides a computer-readable medium having a computer program stored thereon, which implements the steps of any method in the present disclosure when the program is executed by a processing device.
- An embodiment of the present disclosure further provides an electronic device, comprising: a storage device on which a computer program is stored; and a processing device configured to execute the computer program in the storage device to implement the steps of any method in the present disclosure.
- the electronic device includes hardware structures and/or software modules corresponding to executing each function.
- the present disclosure can be implemented in the form of hardware or a combination of hardware and computer software. Whether a function is executed in the form of hardware or computer software driving hardware depends on the specific application and design constraints of the technical solution. The technicians can It is possible to use different methods to implement the described functionality, but such implementation should not be considered beyond the scope of the present disclosure.
- the terminal device in the embodiment of the present disclosure may include, but is not limited to, mobile terminals such as mobile phones, laptop computers, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablet computers), PMPs (portable multimedia players), vehicle-mounted terminals (such as vehicle-mounted navigation terminals), etc., and fixed terminals such as digital TVs, desktop computers, etc.
- mobile terminals such as mobile phones, laptop computers, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablet computers), PMPs (portable multimedia players), vehicle-mounted terminals (such as vehicle-mounted navigation terminals), etc., and fixed terminals such as digital TVs, desktop computers, etc.
- PDAs personal digital assistants
- PADs tablet computers
- PMPs portable multimedia players
- vehicle-mounted terminals such as vehicle-mounted navigation terminals
- fixed terminals such as digital TVs, desktop computers, etc.
- the electronic device shown in FIG8 is only an example and should not bring any limitation to the functions and scope of use of
- the electronic device 600 may include a processing device (e.g., a central processing unit, a graphics processing unit, etc.) 601, which can perform various appropriate actions and processes according to a program stored in a read-only memory (ROM) 602 or a program loaded from a storage device 608 to a random access memory (RAM) 603.
- a processing device e.g., a central processing unit, a graphics processing unit, etc.
- RAM random access memory
- various programs and data required for the operation of the electronic device 600 are also stored.
- the processing device 601, the ROM 602, and the RAM 603 are connected to each other via a bus 604.
- An input/output (I/O) interface 605 is also connected to the bus 604.
- the following devices may be connected to the I/O interface 605: input devices 606 including, for example, a touch screen, a touchpad, a keyboard, a mouse, a camera, a microphone, an accelerometer, a gyroscope, etc.; output devices 607 including, for example, a liquid crystal display (LCD), a speaker, a vibrator, etc.; storage devices 608 including, for example, a magnetic tape, a hard disk, etc.; and communication devices 609.
- the communication device 609 may allow the electronic device 600 to communicate wirelessly or wired with other devices to exchange data.
- FIG. 8 shows an electronic device 600 with various devices, it should be understood that it is not required to implement or have all the devices shown. More or fewer devices may be implemented or have alternatively.
- an embodiment of the present disclosure includes a computer program product, which includes a computer program carried on a non-transitory computer-readable medium, and the computer program contains program code for executing the method shown in the flowchart.
- the computer program can be downloaded and installed from a network through a communication device 609, or installed from a storage device 608, or installed from a ROM 602.
- the processing device 601 the above-mentioned functions defined in the method of the embodiment of the present disclosure are executed.
- the above-mentioned computer-readable medium of the present disclosure may be a computer-readable signal medium or a computer-readable storage medium or any combination of the above two.
- the computer-readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device or device, or any combination of the above.
- Computer-readable storage media may include, but are not limited to: an electrical connection with one or more wires, a portable computer disk, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disk read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the above.
- a computer-readable storage medium may be any tangible medium containing or storing a program that can be used or communicated with an instruction execution system, device or device.
- a computer-readable signal medium may include a data signal propagated in a baseband or as part of a carrier wave, which carries a computer-readable program code. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above.
- a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, which may send, propagate, or transmit a program for use by or in combination with an instruction execution system, device, or device.
- the program code contained on the computer-readable medium may be transmitted using any suitable medium, including but not limited to: wires, optical cables, RF (radio frequency), etc., or any suitable combination of the above.
- the client and server may communicate using any currently known or future developed network protocol such as HTTP (HyperText Transfer Protocol), and may be interconnected with any form or medium of digital data communication (e.g., a communication network).
- HTTP HyperText Transfer Protocol
- Examples of communication networks include a local area network ("LAN”), a wide area network ("WAN”), an internet (e.g., the Internet), and a peer-to-peer network (e.g., an ad hoc peer-to-peer network), as well as any currently known or future developed network.
- the computer-readable medium may be included in the electronic device, or may exist independently without being installed in the electronic device.
- the computer-readable medium carries one or more programs, and when the one or more programs are executed by the electronic device, the electronic device executes the graphic typesetting method provided in any embodiment of the present disclosure.
- Computer program code for performing the operations of the present disclosure may be written in one or more programming languages or a combination thereof, including, but not limited to, object-oriented programming languages, such as Java, Smalltalk, C++, and conventional procedural programming languages, such as "C" or similar programming languages.
- the program code may be executed entirely on the user's computer, partially on the user's computer, as a separate software package, partially on the user's computer and partially on a remote computer, or entirely on a remote computer or server.
- the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (e.g., via the Internet using an Internet service provider).
- LAN local area network
- WAN wide area network
- Internet service provider e.g., via the Internet using an Internet service provider
- each box in the flowchart or block diagram may represent a module, a program segment, or a portion of a code, which contains one or more executable instructions for implementing the specified logical functions.
- the functions marked in the boxes may also occur in an order different from that marked in the accompanying drawings. For example, two boxes represented in succession may actually be executed substantially in parallel, and they may sometimes be executed in the opposite order, depending on the functions involved.
- each box in the block diagram and/or flowchart, and the combination of boxes in the block diagram and/or flowchart may be implemented using a dedicated hardware-based system that performs the specified functions or operations, or may be implemented using dedicated hardware and a computer. It is implemented by a combination of machine instructions.
- modules involved in the embodiments described in the present disclosure may be implemented by software or hardware.
- the name of a module does not limit the module itself in some cases.
- an acquisition module may also be described as a "module for obtaining a saliency map based on a background map”.
- exemplary types of hardware logic components include: field programmable gate arrays (FPGAs), application specific integrated circuits (ASICs), application specific standard products (ASSPs), systems on chips (SOCs), complex programmable logic devices (CPLDs), and the like.
- FPGAs field programmable gate arrays
- ASICs application specific integrated circuits
- ASSPs application specific standard products
- SOCs systems on chips
- CPLDs complex programmable logic devices
- a machine-readable medium may be a tangible medium that may contain or store a program for use by or in conjunction with an instruction execution system, device, or equipment.
- a machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
- a machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or equipment, or any suitable combination of the foregoing.
- a more specific example of a machine-readable storage medium may include an electrical connection based on one or more lines, a portable computer disk, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disk read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
- RAM random access memory
- ROM read-only memory
- EPROM or flash memory erasable programmable read-only memory
- CD-ROM portable compact disk read-only memory
- CD-ROM compact disk read-only memory
- magnetic storage device or any suitable combination of the foregoing.
- Example 1 provides a graphic typesetting method, including: obtaining a saliency map based on a background map; searching on the saliency map in combination with the material to be laid out to obtain a candidate frame of the material to be laid out; determining a saliency value of the candidate frame; and determining a placement position of the material to be laid out from the candidate frame based on the saliency value.
- Example 2 provides the method of Example 1, wherein the number of the materials to be laid out is multiple, and the method further comprises: selecting the material to be laid out to be searched on the saliency map from the multiple materials to be laid out according to size priority;
- the searching on the saliency map in combination with the material to be laid out includes: searching on the saliency map in combination with the selected material to be laid out.
- Example 3 provides the method of Example 1, wherein the method further comprises: performing a multi-scale transformation on the material to be laid out;
- the searching on the saliency map in combination with the material to be laid out includes: searching on the saliency map in sequence based on the material to be laid out after multi-scale transformation.
- Example 4 provides the method of Example 1, wherein the determining The saliency value of the candidate box includes: determining an integral map of the saliency map; and determining the saliency value of the candidate box based on the integral map.
- Example 5 provides the method of Example 1, wherein the method further includes: after determining the placement position of the material to be laid out from the candidate frame, marking the determined placement position as significant on the significance map.
- Example 6 provides a method of any one of Examples 1 to 5, wherein the method further comprises: cutting out the material to be laid out from the poster to be resized to obtain a background image; resizing the background image to obtain the background base image;
- the step of determining the placement position of the material to be laid out from the candidate frame based on the significance value comprises: processing the significance value based on the position and size of the material to be laid out in the poster to be resized; and determining the placement position of the material to be laid out from the candidate frame based on the processed significance value.
- Example 7 provides the method of Example 6, wherein the significance value is processed based on the position and size of the material to be laid out in the poster to be resized, including: calculating the similarity between the position and size of the candidate box and the position and size of the material to be laid out in the poster to be resized; normalizing the similarity; multiplying the normalized similarity by the significance value to obtain the processed significance value.
- Example 8 provides a method of any one of Examples 1 to 5 and 7, wherein the step of determining the placement position of the material to be laid out from the candidate frame based on the significance value comprises: for candidate frames that are similar in position and whose significance value difference is less than a preset threshold, using a non-maximum suppression method to determine the placement position of the material to be laid out from the candidate frame.
- Example 9 provides a graphic typesetting device, comprising: an acquisition module, configured to obtain a saliency map based on a background map; a search module, configured to search on the saliency map in combination with the material to be laid out, to obtain a candidate frame of the material to be laid out; a first determination module, configured to determine a saliency value of the candidate frame; and a second determination module, configured to determine a placement position of the material to be laid out from the candidate frame based on the saliency value.
- Example 10 provides a computer-readable medium having a computer program and/or instructions stored thereon, which implement the steps of any of the methods described in Examples 1-8 when executed by a processing device.
- Example 11 provides an electronic device, comprising: a storage device on which a computer program and/or instructions are stored; and a processing device configured to execute the computer program and/or instructions in the storage device to implement the steps of any one of the methods described in Examples 1-8.
- Example 12 provides a computer program product, comprising instructions, which, when executed by a processor, enable the processor to implement the steps of the method described in any one of Examples 1 to 8.
- Example 13 provides a computer program, including program code, which, when executed by a processor, causes the steps of the method described in any one of Examples 1 to 8 to be implemented.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Library & Information Science (AREA)
- Processing Or Creating Images (AREA)
Abstract
La présente divulgation concerne un procédé et un appareil de composition d'image et de texte, un support et un dispositif électronique. Le procédé de composition d'image et de texte consiste à : obtenir une carte de saillance sur la base d'une carte de base d'arrière-plan ; incorporer un matériau à agencer, effectuer une recherche sur la carte de saillance pour obtenir des boîtes candidates dudit matériau ; déterminer des valeurs de saillance des boîtes candidates ; et sur la base des valeurs de saillance, déterminer une position de placement dudit matériau parmi les boîtes candidates.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202310667291.9 | 2023-06-06 | ||
| CN202310667291.9A CN116541544A (zh) | 2023-06-06 | 2023-06-06 | 图文排版方法、装置、介质和电子设备 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2024251060A1 true WO2024251060A1 (fr) | 2024-12-12 |
Family
ID=87447189
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2024/096949 Ceased WO2024251060A1 (fr) | 2023-06-06 | 2024-06-03 | Procédé et appareil de composition d'image et de texte, support et dispositif électronique |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN116541544A (fr) |
| WO (1) | WO2024251060A1 (fr) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116541544A (zh) * | 2023-06-06 | 2023-08-04 | 北京字跳网络技术有限公司 | 图文排版方法、装置、介质和电子设备 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110795925A (zh) * | 2019-10-12 | 2020-02-14 | 腾讯科技(深圳)有限公司 | 基于人工智能的图文排版方法、图文排版装置及电子设备 |
| CN111428447A (zh) * | 2020-02-28 | 2020-07-17 | 华东师范大学 | 一种基于显著性检测的智能图文排版方法 |
| US20220139013A1 (en) * | 2019-03-01 | 2022-05-05 | Huawei Technologies Co., Ltd. | Image and Text Typesetting Method and Related Apparatus Thereof |
| CN114677402A (zh) * | 2022-04-07 | 2022-06-28 | 北京智源人工智能研究院 | 海报文本布局、海报生成方法及相关装置 |
| CN116541544A (zh) * | 2023-06-06 | 2023-08-04 | 北京字跳网络技术有限公司 | 图文排版方法、装置、介质和电子设备 |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107545576A (zh) * | 2017-07-31 | 2018-01-05 | 华南农业大学 | 基于构图规则的图像编辑方法 |
-
2023
- 2023-06-06 CN CN202310667291.9A patent/CN116541544A/zh active Pending
-
2024
- 2024-06-03 WO PCT/CN2024/096949 patent/WO2024251060A1/fr not_active Ceased
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20220139013A1 (en) * | 2019-03-01 | 2022-05-05 | Huawei Technologies Co., Ltd. | Image and Text Typesetting Method and Related Apparatus Thereof |
| CN110795925A (zh) * | 2019-10-12 | 2020-02-14 | 腾讯科技(深圳)有限公司 | 基于人工智能的图文排版方法、图文排版装置及电子设备 |
| CN111428447A (zh) * | 2020-02-28 | 2020-07-17 | 华东师范大学 | 一种基于显著性检测的智能图文排版方法 |
| CN114677402A (zh) * | 2022-04-07 | 2022-06-28 | 北京智源人工智能研究院 | 海报文本布局、海报生成方法及相关装置 |
| CN116541544A (zh) * | 2023-06-06 | 2023-08-04 | 北京字跳网络技术有限公司 | 图文排版方法、装置、介质和电子设备 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN116541544A (zh) | 2023-08-04 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN112487331B (zh) | 页面生成方法和装置、存储介质和电子设备 | |
| CN112258622B (zh) | 图像处理方法、装置、可读介质及电子设备 | |
| WO2024251228A1 (fr) | Procédé et appareil d'affichage de contenu multimédia, dispositif électronique et support de stockage | |
| CN112418232B (zh) | 图像分割方法、装置、可读介质及电子设备 | |
| US20250022136A1 (en) | Image cropping method and apparatus, model training method and apparatus, electronic device, and medium | |
| CN114332324B (zh) | 图像处理方法、装置、设备及介质 | |
| US20230334880A1 (en) | Hot word extraction method and apparatus, electronic device, and medium | |
| CN112418054B (zh) | 图像处理方法、装置、电子设备和计算机可读介质 | |
| CN118132056A (zh) | 页面开发方法、装置、设备及存储介质 | |
| CN110619597A (zh) | 一种半透明水印去除方法、装置、电子设备及存储介质 | |
| CN113255812B (zh) | 视频边框检测方法、装置和电子设备 | |
| CN118840453A (zh) | 文字颜色确定方法、装置、介质、电子设备及程序产品 | |
| CN112418233B (zh) | 图像处理方法、装置、可读介质及电子设备 | |
| CN112257598B (zh) | 图像中四边形的识别方法、装置、可读介质和电子设备 | |
| WO2024251060A1 (fr) | Procédé et appareil de composition d'image et de texte, support et dispositif électronique | |
| WO2024120446A1 (fr) | Procédés et appareil pour générer un élément d'effet spécial et une image à effet spécial, dispositif, et support de stockage | |
| CN113762260A (zh) | 一种版面图片的处理方法、装置、设备及存储介质 | |
| CN115731313A (zh) | Svg格式的图片的处理方法、装置、设备、介质及产品 | |
| WO2025108303A1 (fr) | Procédé et appareil de génération de vidéo, dispositif électronique et support de stockage | |
| CN111258582B (zh) | 一种窗口渲染方法、装置、计算机设备及存储介质 | |
| CN114786069B (zh) | 视频生成方法、装置、介质及电子设备 | |
| WO2025045169A1 (fr) | Procédé et appareil de traitement de modèle de traitement d'image, dispositif, support de stockage et produit | |
| WO2024152797A1 (fr) | Procédé et appareil de supplémentation de vidéo, support et dispositif électronique | |
| WO2024198952A1 (fr) | Procédé de super-résolution d'image, dispositif, support de stockage et produit programme | |
| CN113256660B (zh) | 图片处理方法、装置和电子设备 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24818592 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |