Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-2891

Improve H3 parser

    XMLWordPrintable

Details

    • Improvement
    • Resolution: Fixed
    • Major
    • 7.5
    • 7.4.3
    • Heritrix 3
    • None
    • BNF

    Description

      Add these attributes to the HTML parser :

      img srcset

      <div class="lp-image-responsive"><img class="lp-image-responsive_img" src="http://cloudfront-eu-central-1.images.arcpublishing.com/leparisien/PSAGE45Q5NEPVI75KP4AVSZ4OE.jpg" alt="Les Insoumis, Jean-Luc Mélenchon en tête, appellent tous les opposants à la réforme des retraites à battre le pavé ce lundi 1er mai. LP/Arnaud Journois" srcset="/resizer/opD2fzNY_8TCVlehObW4PRYsMQ=/120x75/cloudfront-eu-central-1.images.arcpublishing.com/leparisien/PSAGE45Q5NEPVI75KP4AVSZ4OE.jpg 120w, /resizer/xZlmD3G7rAQ1A6yHH1_YnwdL3rw=/240x150/cloudfront-eu-central-1.images.arcpublishing.com/leparisien/PSAGE45Q5NEPVI75KP4AVSZ4OE.jpg 240w, /resizer/hIvEezJm81xiHmNW4QyYKm7_y3Q=/300x187/cloudfront-eu-central-1.images.arcpublishing.com/leparisien/PSAGE45Q5NEPVI75KP4AVSZ4OE.jpg 300w, /resizer/Yq8wR5hGgk2noF6rFM3IpnLGCJI=/360x225/cloudfront-eu-central-1.images.arcpublishing.com/leparisien/PSAGE45Q5NEPVI75KP4AVSZ4OE.jpg 360w, /resizer/T_8cJnL18KoYyOlLejO-2hz2YhM=/600x375/cloudfront-eu-central-1.images.arcpublishing.com/leparisien/PSAGE45Q5NEPVI75KP4AVSZ4OE.jpg 600w" sizes="(max-width: 739px) 120px, (min-width: 740px) 300px" loading="lazy" fetchpriority="low"></div>

      img data-full-src

      <img class="article-list__cover cover fit lazyload" data-full-src="/sites/default/files/styles/330x188/public/alexander-andrews-qjyxsc4xb84-unsplash.jpg" alt="Allergique à votre animal de compagnie? Il existe une solution">

      source srcset

      <figure class=" " min-height="0px 100px" max-height="180px" max-width="249px">                                
       <picture><source srcset="https://media.charentelibre.fr/14984042/600x375/img-2425.jpg?v=1682877717" media="(min-width: 481px) and (max-width: 668px)"><img src="https://media.charentelibre.fr/14984042/330x206/img-2425.jpg?v=1682877717" style="display: none;" alt="Grosse soirée reggae en ce moment place New-York à Angoulême jusqu’à 22h" data-lazy-loaded="data-lazy-loaded"></picture><img class="lazy thumbnail lazy_image-handled" data-lazy="{"src":"https:\/\/media.charentelibre.fr\/14984042\/330x206\/img-2425.jpg?v=1682877717","srcsets":[

      {"media":"(min-width: 481px) and (max-width: 668px)","url":"https:\/\/media.charentelibre.fr\/14984042\/600x375\/img-2425.jpg?v=1682877717"}

      ]}" alt="Grosse soirée reggae en ce moment place New-York à Angoulême jusqu’à 22h" src="data:image/gif;base64,iVBORw0KGgoAAAANSUhEUgAAABAAAAAKCAQAAAAXtxYXAAAAEUlEQVR42mNkIAAYRxUQpwAACGEAC42LFg0AAAAASUVORK5CYII=">
      <!--  -->
      </figure>

      source  data-lazy-srcset
      img  data-lazy

      <picture>
      <source class="js-media-live-srcset" data-lazy-srcset="https://img.lemde.fr/2023/05/08/182/0/3543/2362/220/146/30/0/11bfa3f_1683552799386-2023-04-22-heg-manoeuvre-0318.JPG" media="(min-width: 576px)">
      <img class="js-media-live" data-lazy="https://img.lemde.fr/2023/05/08/0/260/2834/2834/120/120/30/0/11bfa3f_1683552799386-2023-04-22-heg-manoeuvre-0318.JPG" alt=""> 
      <noscript><img src="https://img.lemde.fr/2023/05/08/182/0/3543/2362/220/146/30/0/11bfa3f_1683552799386-2023-04-22-heg-manoeuvre-0318.JPG" alt=""></noscript>
      </picture>
              

      source srcset data-srcset

      <picture>
      <source srcset="/fr/rimage/ftw_webp_240/image/10/1c386e7cddfbf205b9a6fb4ca7ce9666 240w" data-srcset="/fr/rimage/ftw_webp_240/image/10/1c386e7cddfbf205b9a6fb4ca7ce9666 240w, /fr/rimage/ftw_webp_288/image/10/1c386e7cddfbf205b9a6fb4ca7ce9666 288w, /fr/rimage/ftw_webp_384/image/10/1c386e7cddfbf205b9a6fb4ca7ce9666 384w, /fr/rimage/ftw_webp_480/image/10/1c386e7cddfbf205b9a6fb4ca7ce9666 480w, /fr/rimage/ftw_webp_576/image/10/1c386e7cddfbf205b9a6fb4ca7ce9666 576w, /fr/rimage/ftw_webp_768/image/10/1c386e7cddfbf205b9a6fb4ca7ce9666 768w, /fr/rimage/ftw_webp_960/image/10/1c386e7cddfbf205b9a6fb4ca7ce9666 960w, /fr/rimage/ftw_webp_1152/image/10/1c386e7cddfbf205b9a6fb4ca7ce9666 1152w, /fr/rimage/ftw_webp_1536/image/10/1c386e7cddfbf205b9a6fb4ca7ce9666 1536w, /fr/rimage/ftw_webp_1920/image/10/1c386e7cddfbf205b9a6fb4ca7ce9666 1920w, /fr/rimage/ftw_webp_2304/image/10/1c386e7cddfbf205b9a6fb4ca7ce9666 2304w" data-aspectratio="1.7776666666667" type="image/webp" sizes="1vw">
      </picture>

      img data-src-small data-src-medium data-src

      <img class="b-lazy image-large" 
      src="data:image/gif;base64,R0lGODlhAQABAAAAACH5BAEKAAEALAAAAAABAAEAAAICTAEAOw=="
      data-src-small="http://www.104.fr/cache/media/visuels-generaux/bandeau-home/c,0,0,1733,718-cr,609,233-q,80-468ffd.jpg|http://www.104.fr/cache/media/visuels-generaux/bandeau-home/c,0,0,1733,718-cr,609,233-f551d3.jpg" 
      data-src-medium="http://www.104.fr/cache/media/visuels-generaux/bandeau-home/c,0,0,1733,718-cr,993,382-q,80-af15f9.jpg|http://www.104.fr/cache/media/visuels-generaux/bandeau-home/c,0,0,1733,718-cr,993,382-18029b.jpg" 
      data-src="http://www.104.fr/cache/media/visuels-generaux/bandeau-home/c,0,0,1733,718-cr,1350,560-q,80-8f5db0.jpg|http://www.104.fr/cache/media/visuels-generaux/bandeau-home/c,0,0,1733,718-cr,1350,560-84eebf.jpg" 
      alt=""/>

                       
              

       

      Attachments

        Activity

          People

            Unassigned Unassigned
            sara Sara Aubry
            Clara Wiatrowski Clara Wiatrowski
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: