crawler/DATA/LOG/yacy118.log
2025-03-26 09:12:37 +09:00

5457 lines
1.0 MiB
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/films/babb52f89b5488d00cb76a924d7e06eb/XPxkbzNVy36iDfcGUxaqpxFC6LJ0tI_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=christensen-benjamin, STACKING TIME = 1, PARSING TIME = 86
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/films/61e9ee86f43dcf3bfecafe032299b9f3/WU2TTrhW3pd1lKQ9tk4zH4b20JSv40_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=litvak-anatole
I 2022/06/09 11:04:20 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ucevem_26NP5 (1735154897508106240)]} 0 2
I 2022/06/09 11:04:20 Fulltext indexing: ucevem_26NP5 https://www.criterion.com/shop/browse?director=litvak-anatole
I 2022/06/09 11:04:20 SWITCHBOARD *Indexed 1186 words in URL https://www.criterion.com/shop/browse?director=litvak-anatole [ucevem_26NP5]
Description: Anatole Litvak films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12412 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:20 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=christensen-benjamin
I 2022/06/09 11:04:20 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[uXv-Sm_26NP5 (1735154897574166528)]} 0 2
I 2022/06/09 11:04:20 Fulltext indexing: uXv-Sm_26NP5 https://www.criterion.com/shop/browse/list?director=christensen-benjamin
I 2022/06/09 11:04:20 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 448, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:20 SWITCHBOARD *Indexed 1202 words in URL https://www.criterion.com/shop/browse/list?director=christensen-benjamin [uXv-Sm_26NP5]
Description: Benjamin Christensen films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12534 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:20 HTCACHE storing content of url https://www.criterion.com/current/posts/6375-ken-loach-s-sorry-we-missed-you, 72518 bytes
I 2022/06/09 11:04:20 REJECTED https://www.screendaily.com/reviews/sorry-we-missed-you-cannes-review/5139521.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.theguardian.com/film/2019/may/16/sorry-we-missed-you-review-ken-loach - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://lwlies.com/festivals/sorry-we-missed-you-cannes-film-festival-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://criterion-production.s3.amazonaws.com/HACTDUVL5NkTIZL8hjkJFG5LNjesp3.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-chang-pedro-almodovar-ken-loach-jessica-hausner-20190518-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.independent.co.uk/arts-entertainment/films/reviews/sorry-we-missed-you-cannes-film-festival-review-ken-loach-drama-a8917766.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://film.avclub.com/more-zombies-and-a-new-downer-from-a-past-cannes-winne-1834839786 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.festival-cannes.com/en/festival/films/sorry-we-missed-you - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 SWITCHBOARD CRAWL: ADDED 67 LINKS FROM https://www.criterion.com/current/posts/6375-ken-loach-s-sorry-we-missed-you, STACKING TIME = 5, PARSING TIME = 9
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://cine-vue.com/2019/05/cannes-2019-sorry-we-missed-you-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.timeout.com/london/film/sorry-we-missed-you - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/sorry-we-missed-you-ken-loach-gig-economy-drama-kris-hitchen-debbie-honeywood-newcastle - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.hollywoodreporter.com/review/sorry-we-missed-you-cannes-2019-1211221 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://cineuropa.org/en/newsdetail/372672/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.youtube.com/embed/jLlVDpWSn0c?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.telegraph.co.uk/films/0/sorry-missed-review-kenloach-insightful-clear-eyed/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/6375-ken-loach-s-sorry-we-missed-you
I 2022/06/09 11:04:20 Fulltext indexing: uS9gIG_26NP5 https://www.criterion.com/current/posts/6375-ken-loach-s-sorry-we-missed-you
I 2022/06/09 11:04:20 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[uS9gIG_26NP5 (1735154897645469696)]} 0 1
I 2022/06/09 11:04:20 SWITCHBOARD *Indexed 499 words in URL https://www.criterion.com/current/posts/6375-ken-loach-s-sorry-we-missed-you [uS9gIG_26NP5]
Description: Ken Loachs Sorry We Missed You | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5988 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:21 HTCACHE storing content of url https://www.criterion.com/boxsets/494-eclipse-series-4-raymond-bernard, 66668 bytes
I 2022/06/09 11:04:21 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 446, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:21 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/boxsets/494-eclipse-series-4-raymond-bernard, STACKING TIME = 3, PARSING TIME = 8
I 2022/06/09 11:04:21 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/de1ffcb56beae0169f9e7b23ae7b9516.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1780-c489370ce256ad8a36a096bca4ed7bb4/9aNOPIxaT7oPmfJpfNfA6Bnm1Subqd_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/ce8d7645f17327c4196bf6d407d80d9c.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/725e2cf2cb8ad819db5ae2aa50fbe3b8/nGEJ6EsfxvGN30AgK99Ylco5wtBGmf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/52b1d3054682fba202f2679beffb971a/QubSQmCC5J7OT6oFVavxjM711ui3DR_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/boxsets/494-eclipse-series-4-raymond-bernard
I 2022/06/09 11:04:21 Fulltext indexing: uMCee3_26NP5 https://www.criterion.com/boxsets/494-eclipse-series-4-raymond-bernard
I 2022/06/09 11:04:21 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[uMCee3_26NP5 (1735154897919148032)]} 0 2
I 2022/06/09 11:04:21 SWITCHBOARD *Indexed 267 words in URL https://www.criterion.com/boxsets/494-eclipse-series-4-raymond-bernard [uMCee3_26NP5]
Description: Eclipse Series 4: Raymond Bernard | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4351 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:21 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=siegel-don, 225197 bytes
I 2022/06/09 11:04:21 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse/list?director=siegel-don, STACKING TIME = 1, PARSING TIME = 21
I 2022/06/09 11:04:21 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1859-5a1d456ec7733e9205b555242f30548d/QydMAqke3toeG4h4BOBYwNcOHf6z3W_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/1fff3ac634ee3a07e07e071a03276f29/aYkEYStkCF5KUvvjpd3gN3fn5XYZP6_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/fd68e938462eb582cb0de5a34be73d61/AJyDlyL04mfIBYIkBs53hojPoQLt0A_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=siegel-don
I 2022/06/09 11:04:21 Fulltext indexing: uSmRHm_26NP5 https://www.criterion.com/shop/browse/list?director=siegel-don
I 2022/06/09 11:04:21 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[uSmRHm_26NP5 (1735154898085871616)]} 0 2
I 2022/06/09 11:04:21 SWITCHBOARD *Indexed 1201 words in URL https://www.criterion.com/shop/browse/list?director=siegel-don [uSmRHm_26NP5]
Description: Don Siegel films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12530 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:21 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 450, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:21 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 450, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:21 HTCACHE storing content of url https://www.criterion.com/current/posts/2024-eclipse-series-29-aki-kaurism-ki-s-leningrad-cowboys, 121014 bytes
I 2022/06/09 11:04:21 SWITCHBOARD CRAWL: ADDED 60 LINKS FROM https://www.criterion.com/current/posts/2024-eclipse-series-29-aki-kaurism-ki-s-leningrad-cowboys, STACKING TIME = 9, PARSING TIME = 12
I 2022/06/09 11:04:21 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/3ed41f79deffcb3052099b02c9660e9b/zQOZgJoUsBEgi8arpM5w8aT22vGo6W_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/f8bf41c3e8d2266f423881ceb3159429/58bZDer5maXJjg6GDgD8Tyrr6ZZAuT_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/images/3766-ff6aa079e27077432f53409f08cb425d/kaurismakicowboys_1484_006_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/8f12ceb5a2e46f5f1550942e055ef1af/5yl46GfrudlcteVtODCZveKlbIlys1_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bertolucci-bernardo, 224768 bytes
I 2022/06/09 11:04:21 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=bertolucci-bernardo, STACKING TIME = 1, PARSING TIME = 54
I 2022/06/09 11:04:21 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/8d41805aabbf9c68049033a9e54fc4ca/5HBkbTpi2BcdDfPwUmTIH76T5jR9jA_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/df4a3828538e371bb514327fb5be561f/URBYiTnt4otEoXtvtr2zbHA5ykxgkB_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@e6edfd3[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w8(7.7.3):C18:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772634528}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wa(7.7.3):C23:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772640046}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wb(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772645671}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wc(7.7.3):C17:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772650026}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wd(7.7.3):C4:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772650982}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_we(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772656193}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wf(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772661758}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:04:21 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:04:21 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 452, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:21 HTCACHE storing content of url https://www.criterion.com/current/posts/5657-jean-luc-godard-s-the-image-book, 74910 bytes
I 2022/06/09 11:04:21 SWITCHBOARD CRAWL: ADDED 81 LINKS FROM https://www.criterion.com/current/posts/5657-jean-luc-godard-s-the-image-book, STACKING TIME = 5, PARSING TIME = 14
I 2022/06/09 11:04:21 REJECTED https://www.telegraph.co.uk/films/2018/05/12/jean-luc-godard-facetime-press-conference-cannes-filming-boring/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.theglobeandmail.com/arts/film/article-cannes-diary-jean-luc-godard-says-goodbye-to-language-once-and-for/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://time.com/5275703/cannes-review-jean-luc-godard-the-image-book/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.villagevoice.com/2018/05/24/a-tale-of-many-godards/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.screendaily.com/reviews/the-image-book-cannes-review/5129209.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://criterion-production.s3.amazonaws.com/xpJzxC3KGLNkBjQu3UYoFw6YU2GTJj.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://film.avclub.com/jean-luc-godard-returns-to-cannes-to-make-a-dunce-out-o-1825979305 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED http://www.middleeasteye.net/in-depth/features/cannes-2018-middle-east-takes-home-jury-prize-1960489906 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-5-changless-change-jean-luc-godard-and-jia-zhangke - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED http://variety.com/2018/film/reviews/the-image-book-review-jean-luc-godard-1202807089/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://thefilmstage.com/reviews/cannes-review-jean-luc-godards-the-image-book-displays-an-infuriating-stimulating-love-of-cinema/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.filmcomment.com/blog/film-comment-podcast-cannes-day-four/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED http://desistfilm.com/cannes-2018-le-livre-dimage-by-jean-luc-godard/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/embed/FAk3c9OM8ZQ?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED http://variety.com/2018/film/global/jean-luc-godard-to-adapt-the-image-book-into-traveling-exhibit-star-in-a-vendredi-robinson-exclusive-1202805535/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED http://cineuropa.org/nw.aspx?t=newsdetail&l=en&did=354276 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED http://www.indiewire.com/2018/05/the-image-book-review-jean-luc-godard-cannes-1201963343/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/embed/TWFmQbrAYqE?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.timeout.com/london/film/the-image-book - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.slantmagazine.com/film/review/the-image-book - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://filmmakermagazine.com/105332-cannes-2018-dispatch-3-the-image-book/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.theguardian.com/film/2018/may/11/the-image-book-review-jean-luc-godard-cannes-2018 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://lundi.am/Vent-d-ouest-JL-Godard - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.hollywoodreporter.com/review/image-book-review-1111185 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.festival-cannes.com/en/festival/films/le-livre-d-image - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.pastemagazine.com/articles/2018/05/le-livre-dimage-the-image-book.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/2024-eclipse-series-29-aki-kaurism-ki-s-leningrad-cowboys
I 2022/06/09 11:04:21 REJECTED https://www.thewrap.com/the-image-book-film-review-once-again-jean-luc-godard-messes-with-viewers-heads/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.rogerebert.com/cannes/cannes-2018-the-image-book-cold-war - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 REJECTED https://mubi.com/notebook/posts/the-chamber-piece-an-interview-with-fabrice-aragno - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:21 Fulltext indexing: tyx3TG_26NP5 https://www.criterion.com/current/posts/2024-eclipse-series-29-aki-kaurism-ki-s-leningrad-cowboys
I 2022/06/09 11:04:21 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[tyx3TG_26NP5 (1735154898797854720)]} 0 12
I 2022/06/09 11:04:21 SWITCHBOARD *Indexed 957 words in URL https://www.criterion.com/current/posts/2024-eclipse-series-29-aki-kaurism-ki-s-leningrad-cowboys [tyx3TG_26NP5]
Description: Eclipse Series 29: Aki Kaurismäkis Leningrad Cowboys | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 13086 bytes |
LinkStorageTime: 17 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:22 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=bertolucci-bernardo
I 2022/06/09 11:04:22 Fulltext indexing: uHi4Lm_26NP5 https://www.criterion.com/shop/browse/list?director=bertolucci-bernardo
I 2022/06/09 11:04:22 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[uHi4Lm_26NP5 (1735154898871255040)]} 0 4
I 2022/06/09 11:04:22 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse/list?director=bertolucci-bernardo [uHi4Lm_26NP5]
Description: Bernardo Bertolucci films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12537 bytes |
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:22 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 448, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:04:22 HTCACHE storing content of url https://www.criterion.com/current/posts/5665-alice-rohrwacher-s-happy-as-lazzaro, 73320 bytes
I 2022/06/09 11:04:22 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/5657-jean-luc-godard-s-the-image-book
I 2022/06/09 11:04:22 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[tyYgsG_26NP5 (1735154898942558208)]} 0 2
I 2022/06/09 11:04:22 Fulltext indexing: tyYgsG_26NP5 https://www.criterion.com/current/posts/5657-jean-luc-godard-s-the-image-book
I 2022/06/09 11:04:22 SWITCHBOARD *Indexed 517 words in URL https://www.criterion.com/current/posts/5657-jean-luc-godard-s-the-image-book [tyYgsG_26NP5]
Description: Jean-Luc Godards The Image Book | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6336 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:22 SWITCHBOARD CRAWL: ADDED 78 LINKS FROM https://www.criterion.com/current/posts/5665-alice-rohrwacher-s-happy-as-lazzaro, STACKING TIME = 5, PARSING TIME = 14
I 2022/06/09 11:04:22 REJECTED https://thefilmstage.com/reviews/cannes-review-alice-rohrwachers-talents-fully-bloom-with-the-masterful-lazzaro-felice/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED http://lwlies.com/festivals/happy-lazzaro-cannes-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.youtube.com/embed/30KW3i3bxEo?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.rogerebert.com/cannes/cannes-2018-3-faces-happy-as-lazzaro - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://theplaylist.net/lazzaro-felice-alice-rohrwacher-eview-20180516/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED http://variety.com/2018/film/reviews/happy-as-lazzaro-review-1202808832/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED http://desistfilm.com/cannes-2018-happy-as-lazzaro-by-alice-rohrwarcher/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.villagevoice.com/2018/05/17/lazarus-come-forth-on-alice-rohrwachers-cannes-stunner-lazzaro-felice/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.hollywoodreporter.com/review/happy-as-lazzaro-lazzaro-felice-review-1111486 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.thewrap.com/happy-as-lazzaro-film-review-alice-rohrwacher-charts-the-course-of-a-holy-fool/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED http://cineuropa.org/nw.aspx?t=newsdetail&l=en&did=354354 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.theguardian.com/film/2018/may/14/happy-as-lazzaro-review-cannes-alice-rohrwacher-wonders-tobacco-sharecroppers - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://icsfilm.org/reviews/cannes-2018-review-lazzaro-felice-alice-rohrwacher/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED http://www.indiewire.com/2018/05/happy-as-lazzaro-review-alice-rohrwacher-cannes-2018-1201964121/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED http://www.anothergaze.com/cannes-review-alice-rohrwachers-happy-lazzaro-lazzaro-felice-feminist/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://filmmakermagazine.com/105338-cannes-2018-dispatch-4-shoplifters-girl-happy-as-lazzaro/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.screendaily.com/reviews/happy-as-lazzaro-cannes-review/5129305.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.timeout.com/london/film/happy-as-lazzaro - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://criterion-production.s3.amazonaws.com/5jKpSEOvYO6bPwgjJzPQpHH09IjISm.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.jigsawlounge.co.uk/film/reviews/cannes2018/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.festival-cannes.com/en/festival/films/lazzaro-felice - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-7-two-gentle-competitors-and-war-s-dirty-work - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://film.avclub.com/spike-lee-teams-up-with-jordan-peele-for-the-funny-poi-1826042384 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://cannes-ratings.herokuapp.com/Cannes2018 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED http://www.indiewire.com/2018/05/netflix-cannes-happy-as-lazarro-girl-1201966537/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/5665-alice-rohrwacher-s-happy-as-lazzaro
I 2022/06/09 11:04:22 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[txjKrG_26NP5 (1735154898985549824)]} 0 1
I 2022/06/09 11:04:22 Fulltext indexing: txjKrG_26NP5 https://www.criterion.com/current/posts/5665-alice-rohrwacher-s-happy-as-lazzaro
I 2022/06/09 11:04:22 SWITCHBOARD *Indexed 503 words in URL https://www.criterion.com/current/posts/5665-alice-rohrwacher-s-happy-as-lazzaro [txjKrG_26NP5]
Description: Alice Rohrwachers Happy as Lazzaro | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5793 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:22 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 445, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:22 HTCACHE storing content of url https://www.criterion.com/films/28727-lone-wolf-and-cub-white-heaven-in-hell, 76851 bytes
I 2022/06/09 11:04:22 SWITCHBOARD CRAWL: ADDED 67 LINKS FROM https://www.criterion.com/films/28727-lone-wolf-and-cub-white-heaven-in-hell, STACKING TIME = 1, PARSING TIME = 11
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/lone-wolf-and-cub-white-heaven-in-hell?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/films/8ce45228e2f3a463d6c77bb4be8fa192/EaTlysvdcJsvJJ3tP1pUrBI2UN2cEl_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/17177cec9104c935b7ee57a6bf36c178.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/images/7742-355bb7d45a9482fc6c2ac3db60ad9495/lone_wolf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.amazon.com/dp/B01M5LJOEO - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://itunes.apple.com/us/movie/id1170771456?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/9daffdb90cdc6bee986834ed46660b66.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/a70ff6a4b7f88021a257d0be6981d057.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/images/7830-f315cd91efe00a972e37be56081e1dbf/LWC_designs_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/films/58e97ae4dd0b361bd131525faf284b78/K10x3FugjRS01iYKEXsLTr0OlDQk4P_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/images/7773-ee7702bf1312d6cd3550dbfc426265bc/Current_LW_C1-grab65_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 SWITCHBOARD Excluded 17 words in URL https://www.criterion.com/films/28727-lone-wolf-and-cub-white-heaven-in-hell
I 2022/06/09 11:04:22 Fulltext indexing: tordge_26NP5 https://www.criterion.com/films/28727-lone-wolf-and-cub-white-heaven-in-hell
I 2022/06/09 11:04:22 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[tordge_26NP5 (1735154899239305216)]} 0 6
I 2022/06/09 11:04:22 SWITCHBOARD *Indexed 344 words in URL https://www.criterion.com/films/28727-lone-wolf-and-cub-white-heaven-in-hell [tordge_26NP5]
Description: Lone Wolf and Cub: White Heaven in Hell (1974) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3800 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:22 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 442, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:22 HTCACHE storing content of url https://www.criterion.com/current/posts/905-the-last-picture-show, 82679 bytes
I 2022/06/09 11:04:22 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/905-the-last-picture-show, STACKING TIME = 1, PARSING TIME = 51
I 2022/06/09 11:04:22 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 437, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:22 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/905-the-last-picture-show
I 2022/06/09 11:04:22 Fulltext indexing: tbPo1G_26NP5 https://www.criterion.com/current/posts/905-the-last-picture-show
I 2022/06/09 11:04:22 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[tbPo1G_26NP5 (1735154899703824384)]} 0 2
I 2022/06/09 11:04:22 SWITCHBOARD *Indexed 498 words in URL https://www.criterion.com/current/posts/905-the-last-picture-show [tbPo1G_26NP5]
Description: The Last Picture Show | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6570 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:22 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=forsyth-bill, 224164 bytes
I 2022/06/09 11:04:22 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=forsyth-bill, STACKING TIME = 1, PARSING TIME = 23
I 2022/06/09 11:04:22 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/films/404d9791922b336d3015f1500ee014eb/tG4CJ9c3PbmNziiX4KClVc9Pj6MsDU_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:22 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=forsyth-bill
I 2022/06/09 11:04:23 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[thvE3m_26NP5 (1735154899898859520)]} 0 2
I 2022/06/09 11:04:23 Fulltext indexing: thvE3m_26NP5 https://www.criterion.com/shop/browse/list?director=forsyth-bill
I 2022/06/09 11:04:23 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=forsyth-bill [thvE3m_26NP5]
Description: Bill Forsyth films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12475 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:23 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 440, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:23 HTCACHE storing content of url https://www.criterion.com/current/posts/764-fanfan-la-tulipe-en-garde, 66944 bytes
I 2022/06/09 11:04:23 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 SWITCHBOARD CRAWL: ADDED 41 LINKS FROM https://www.criterion.com/current/posts/764-fanfan-la-tulipe-en-garde, STACKING TIME = 2, PARSING TIME = 6
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/images/4369-a829ab08178ba2ab415dda3775dfdbf2/img_current_1188_094_medium.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/764-fanfan-la-tulipe-en-garde
I 2022/06/09 11:04:23 Fulltext indexing: s-lmzG_26NP5 https://www.criterion.com/current/posts/764-fanfan-la-tulipe-en-garde
I 2022/06/09 11:04:23 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=ozerov-yuri, 224265 bytes
I 2022/06/09 11:04:23 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[s-lmzG_26NP5 (1735154900279492608)]} 0 7
I 2022/06/09 11:04:23 SWITCHBOARD *Indexed 771 words in URL https://www.criterion.com/current/posts/764-fanfan-la-tulipe-en-garde [s-lmzG_26NP5]
Description: Fanfan la Tulipe: En Garde! | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 10350 bytes |
LinkStorageTime: 12 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:23 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 438, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:04:23 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=ozerov-yuri, STACKING TIME = 1, PARSING TIME = 31
I 2022/06/09 11:04:23 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/films/89fd00884467657bae436047de8cd9b2/7RuTtvX525S0ENzFaMXEcuoqKM280Y_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=ozerov-yuri
I 2022/06/09 11:04:23 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[tQd4Jm_26NP5 (1735154900428390400)]} 0 2
I 2022/06/09 11:04:23 Fulltext indexing: tQd4Jm_26NP5 https://www.criterion.com/shop/browse/list?director=ozerov-yuri
I 2022/06/09 11:04:23 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=ozerov-yuri [tQd4Jm_26NP5]
Description: Juri Ozerov films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12571 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:23 HTCACHE storing content of url https://www.criterion.com/current/author/703-jyoti-mistry, 49912 bytes
I 2022/06/09 11:04:23 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/703-jyoti-mistry, STACKING TIME = 1, PARSING TIME = 4
I 2022/06/09 11:04:23 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/current/author/703-jyoti-mistry
I 2022/06/09 11:04:23 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[s2Ej1G_26NP5 (1735154900456701952)]} 0 1
I 2022/06/09 11:04:23 Fulltext indexing: s2Ej1G_26NP5 https://www.criterion.com/current/author/703-jyoti-mistry
I 2022/06/09 11:04:23 SWITCHBOARD *Indexed 132 words in URL https://www.criterion.com/current/author/703-jyoti-mistry [s2Ej1G_26NP5]
Description: Jyoti Mistry | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1606 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:23 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 433, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:23 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 433, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:04:23 HTCACHE storing content of url https://www.criterion.com/current/posts/7513-beasts-of-no-nation-a-different-kind-of-african-war-film, 87179 bytes
I 2022/06/09 11:04:23 SWITCHBOARD CRAWL: ADDED 63 LINKS FROM https://www.criterion.com/current/posts/7513-beasts-of-no-nation-a-different-kind-of-african-war-film, STACKING TIME = 2, PARSING TIME = 8
I 2022/06/09 11:04:23 REJECTED https://criterion-production.s3.amazonaws.com/z03m9YUmrPCKLCSTxvg3l8J3XIUtDL.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7513-/AlQgKHMrEYtBYqY3vbFGVgCDASqo9R_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://criterion-production.s3.amazonaws.com/Trpy7wGy1YAlukthn4k7bSwTCWGYZN.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7003-/LPfLZBD0q2OFWw19DtZJgFLtn232KC_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://criterion-production.s3.amazonaws.com/JHSMIJKcny3omzCBbyMLDY41iArADH.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/films/d625b0e7f179fa73f1d10d4ff66873a6/KwcufB3P2S9l5e6g7eQitniSmR32hr_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://criterion-production.s3.amazonaws.com/uomtCBcQtaGxubA9zAv30rjKvmUbLO.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:23 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 HTCACHE storing content of url https://www.criterion.com/current/posts/5656-christophe-honor-s-sorry-angel, 72235 bytes
I 2022/06/09 11:04:24 SWITCHBOARD CRAWL: ADDED 76 LINKS FROM https://www.criterion.com/current/posts/5656-christophe-honor-s-sorry-angel, STACKING TIME = 5, PARSING TIME = 10
I 2022/06/09 11:04:24 REJECTED https://film.avclub.com/mads-mikkelsen-endures-a-cold-crucible-but-its-cold-wa-1825957205 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.villagevoice.com/2018/05/14/music-madness-and-memory-at-cannes-part-two-cold-war-sorry-angel-and-the-mysteries-of-love/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-12-a-generational-romance-and-closing-fragments - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED http://lwlies.com/festivals/sorry-angel-cannes-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.slantmagazine.com/house/article/cannes-film-review-yomeddine-leto-sorry-angel - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.festival-cannes.com/en/festival/films/plaire-aimer-et-courir-vite - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://criterion-production.s3.amazonaws.com/iBOHAxDZL0mZ3DmRrcjkupNQRbGaSu.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.rogerebert.com/cannes/cannes-2018-leto-sorry-angel-border - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.telegraph.co.uk/films/2018/05/12/sorry-angel-review-lovely-bittersweet-gay-romance-drippingly/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://thefilmstage.com/reviews/cannes-review-christophe-honores-sorry-angel-is-a-rote-but-practiced-age-gap-romance/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.youtube.com/embed/aEclOo9XHHY?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.screendaily.com/reviews/sorry-angel-cannes-review/5129026.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.thewrap.com/sorry-angel-film-review-aids-drama-explores-quiet-places/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED http://variety.com/2018/film/global/christophe-honore-on-sorry-angel-in-france-were-blessed-as-filmmakers-1202807288/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED http://cineuropa.org/nw.aspx?t=newsdetail&l=en&did=354138 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.theguardian.com/film/2018/may/10/sorry-angel-apology-not-accepted-for-tedious-age-gap-gay-romance - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://cine-vue.com/2018/05/cannes-2018-sorry-angel-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.hollywoodreporter.com/review/sorry-angel-review-1109656 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED http://www.indiewire.com/2018/05/sorry-angel-review-christophe-honore-cannes-2018-1201962227/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://icsfilm.org/reviews/cannes-2018-review-sorry-angel-christophe-honore/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED http://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/sorry-angel-christophe-honore-vincent-lacoste-gay-life-90s - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.vanityfair.com/hollywood/2018/05/sorry-angel-christophe-honore-cannes-review - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED http://variety.com/2018/film/reviews/sorry-angel-review-plaire-aimer-et-courir-vite-1202805122/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.timeout.com/london/film/sorry-angel - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/7513-beasts-of-no-nation-a-different-kind-of-african-war-film
I 2022/06/09 11:04:24 Fulltext indexing: smkItG_26NP5 https://www.criterion.com/current/posts/7513-beasts-of-no-nation-a-different-kind-of-african-war-film
I 2022/06/09 11:04:24 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 427, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:04:24 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[smkItG_26NP5 (1735154901082701824)]} 0 13
I 2022/06/09 11:04:24 SWITCHBOARD *Indexed 1280 words in URL https://www.criterion.com/current/posts/7513-beasts-of-no-nation-a-different-kind-of-african-war-film [smkItG_26NP5]
Description: Beasts of No Nation: A Different Kind of African War Film | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 17028 bytes |
LinkStorageTime: 15 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:24 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/5656-christophe-honor-s-sorry-angel
I 2022/06/09 11:04:24 Fulltext indexing: sUkDeG_26NP5 https://www.criterion.com/current/posts/5656-christophe-honor-s-sorry-angel
I 2022/06/09 11:04:24 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[sUkDeG_26NP5 (1735154901115207680)]} 0 1
I 2022/06/09 11:04:24 SWITCHBOARD *Indexed 430 words in URL https://www.criterion.com/current/posts/5656-christophe-honor-s-sorry-angel [sUkDeG_26NP5]
Description: Christophe Honorés Sorry Angel | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4766 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:24 HTCACHE storing content of url https://www.criterion.com/current/author/97-james-harvey, 49717 bytes
I 2022/06/09 11:04:24 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/97-james-harvey, STACKING TIME = 1, PARSING TIME = 3
I 2022/06/09 11:04:24 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 SWITCHBOARD Excluded 13 words in URL https://www.criterion.com/current/author/97-james-harvey
I 2022/06/09 11:04:24 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[sO73bG_26NP5 (1735154901325971456)]} 0 1
I 2022/06/09 11:04:24 Fulltext indexing: sO73bG_26NP5 https://www.criterion.com/current/author/97-james-harvey
I 2022/06/09 11:04:24 SWITCHBOARD *Indexed 142 words in URL https://www.criterion.com/current/author/97-james-harvey [sO73bG_26NP5]
Description: James Harvey | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2225 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:24 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 424, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:24 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 424, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:24 HTCACHE storing content of url https://www.criterion.com/current/posts/4466-critics-hail-a-newly-restored-taiwanese-masterwork, 72835 bytes
I 2022/06/09 11:04:24 SWITCHBOARD CRAWL: ADDED 55 LINKS FROM https://www.criterion.com/current/posts/4466-critics-hail-a-newly-restored-taiwanese-masterwork, STACKING TIME = 1, PARSING TIME = 6
I 2022/06/09 11:04:24 REJECTED http://www.bam.org/taipeistory - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6502-/VcDUKU8LPywb5MgflsEJCCJOgfrUNv_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6827-/sYS7xZWV366q9OANUqtCwimvTnL90D_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/films/b8aa5bf1a2e514e781a4974edeb7f07a/khYYYxHBAKOhkTFow0fq2YBsI73A0F_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.nytimes.com/2017/03/16/movies/taipei-story-review.html?_r=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/images/8116-d54635daa17b8f8473f68a57c2177797/taipei_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED http://www.villagevoice.com/film/past-and-future-tug-at-an-unstable-present-in-a-restored-masterwork-by-edward-yang-9769270 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6701-/ETx6s7z5azkjRZIl1n1268y6oIFngD_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6516-/1ReaXkdoavjctz1ZBIEqTXIfC0QqZK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/current/posts/4466-critics-hail-a-newly-restored-taiwanese-masterwork
I 2022/06/09 11:04:24 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[r64W1G_26NP5 (1735154901727576064)]} 0 1
I 2022/06/09 11:04:24 Fulltext indexing: r64W1G_26NP5 https://www.criterion.com/current/posts/4466-critics-hail-a-newly-restored-taiwanese-masterwork
I 2022/06/09 11:04:24 SWITCHBOARD *Indexed 320 words in URL https://www.criterion.com/current/posts/4466-critics-hail-a-newly-restored-taiwanese-masterwork [r64W1G_26NP5]
Description: Critics Hail a Newly Restored Taiwanese Masterwork | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3754 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:24 HTCACHE storing content of url https://www.criterion.com/current/posts/3270-y-tu-mama-tambien-dirty-happy-things, 79796 bytes
I 2022/06/09 11:04:24 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/3270-y-tu-mama-tambien-dirty-happy-things, STACKING TIME = 2, PARSING TIME = 7
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/images/5281-09e07817d94e0955561f0ba6b656cab3/28005id_146_Current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/films/5f30d2a6f02704c28b2b31a9331e1f7c/9th6Iqsdh4VJpysPfdoOhLkU6YRLB9_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:24 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 418, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:24 SWITCHBOARD Excluded 30 words in URL https://www.criterion.com/current/posts/3270-y-tu-mama-tambien-dirty-happy-things
I 2022/06/09 11:04:24 Fulltext indexing: rwbkAG_26NP5 https://www.criterion.com/current/posts/3270-y-tu-mama-tambien-dirty-happy-things
I 2022/06/09 11:04:24 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[rwbkAG_26NP5 (1735154901898493952)]} 0 3
I 2022/06/09 11:04:24 SWITCHBOARD *Indexed 1020 words in URL https://www.criterion.com/current/posts/3270-y-tu-mama-tambien-dirty-happy-things [rwbkAG_26NP5]
Description: Y tu mamá también: Dirty Happy Things | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 14336 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:25 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 418, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:25 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 418, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:25 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=coppola, 224180 bytes
I 2022/06/09 11:04:25 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
I 2022/06/09 11:04:25 org.apache.solr.update.SolrIndexWriter Calling setCommitData with IW:org.apache.solr.update.SolrIndexWriter@fed4ccdd commitCommandVersion:0
I 2022/06/09 11:04:25 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=coppola, STACKING TIME = 5, PARSING TIME = 39
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/films/28b1b85a6f4119f34d6165c5256037f6/CGCMTOMzrmzXe1qXr6Mnpt4jXgsgi0_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 422, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:25 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=coppola
I 2022/06/09 11:04:25 Fulltext indexing: rgZMmm_26NP5 https://www.criterion.com/shop/browse/list?director=coppola
I 2022/06/09 11:04:25 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[rgZMmm_26NP5 (1735154902752034816)]} 0 6
I 2022/06/09 11:04:25 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=coppola [rgZMmm_26NP5]
Description: Francis Ford Coppola films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12489 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:25 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=audiard-jacques, 224159 bytes
I 2022/06/09 11:04:25 HTCACHE storing content of url https://www.criterion.com/current/posts/2654-the-dardennes-and-the-kid-with-a-bike, 70793 bytes
I 2022/06/09 11:04:25 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:25 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=audiard-jacques, STACKING TIME = 0, PARSING TIME = 36
I 2022/06/09 11:04:25 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/films/8d1c34bd6af4fa432ec28aebc0ad55d6/FMt4h5XnG7oyewPgkLZPxdrXDUPq0v_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 426, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/images/5073-6d8164bf2a1164be530153e3404dc049/KWAB_Current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/films/ed560ce125d981a74da5f0b112c643c4/sG8YTAGNjz7HsWG1RS1uUi54VYccwx_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/2654-the-dardennes-and-the-kid-with-a-bike, STACKING TIME = 5, PARSING TIME = 21
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.youtube.com/embed/jhVq-RmbA34?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:25 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=audiard-jacques
I 2022/06/09 11:04:25 Fulltext indexing: rNGwYm_26NP5 https://www.criterion.com/shop/browse/list?director=audiard-jacques
I 2022/06/09 11:04:25 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[rNGwYm_26NP5 (1735154903011033088)]} 0 8
I 2022/06/09 11:04:26 SWITCHBOARD *Indexed 1187 words in URL https://www.criterion.com/shop/browse/list?director=audiard-jacques [rNGwYm_26NP5]
Description: Jacques Audiard films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12457 bytes |
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:26 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/2654-the-dardennes-and-the-kid-with-a-bike
I 2022/06/09 11:04:26 Fulltext indexing: qW-f6G_26NP5 https://www.criterion.com/current/posts/2654-the-dardennes-and-the-kid-with-a-bike
I 2022/06/09 11:04:26 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[qW-f6G_26NP5 (1735154903082336256)]} 0 1
I 2022/06/09 11:04:26 SWITCHBOARD *Indexed 303 words in URL https://www.criterion.com/current/posts/2654-the-dardennes-and-the-kid-with-a-bike [qW-f6G_26NP5]
Description: The Dardennes and The Kid with a Bike | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3468 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:26 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 426, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:26 HTCACHE storing content of url https://www.criterion.com/current/posts/5143-victor-sj-str-m-s-stirring-flashbacks, 67721 bytes
I 2022/06/09 11:04:26 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/5143-victor-sj-str-m-s-stirring-flashbacks, STACKING TIME = 1, PARSING TIME = 9
I 2022/06/09 11:04:26 REJECTED https://www.filmstruck.com/us/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7777-/bTILAb13gYYFwneHHCb5dHHD1VjRf7_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7707-/hZxxnAQeKCi5vjiDKzvdNuftSnkdOM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/films/7265b13395ec259ff98672237c54b4c6/hl8OoSNAgm9ND4Fh7ksjUzNXplspyF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7807-/DXN9QiWNnsXTuLss0TTJ4r9JspRu5B_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://player.vimeo.com/video/243748992 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7738-/e35rwdsj2UHHYOdCCz8E5Qr8oxxKXj_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/current/posts/5143-victor-sj-str-m-s-stirring-flashbacks
I 2022/06/09 11:04:26 Fulltext indexing: pZIkoG_26NP5 https://www.criterion.com/current/posts/5143-victor-sj-str-m-s-stirring-flashbacks
I 2022/06/09 11:04:26 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[pZIkoG_26NP5 (1735154903244865536)]} 0 1
I 2022/06/09 11:04:26 SWITCHBOARD *Indexed 261 words in URL https://www.criterion.com/current/posts/5143-victor-sj-str-m-s-stirring-flashbacks [pZIkoG_26NP5]
Description: Victor Sjöströms Stirring Flashbacks | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3314 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:26 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=ferreri-marco, 224232 bytes
I 2022/06/09 11:04:26 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=ferreri-marco, STACKING TIME = 1, PARSING TIME = 19
I 2022/06/09 11:04:26 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/films/898808a9204021a47c57b59173c72e75/ttGzJdB52PJ2tfhbXB89WxPhDBytPI_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=ferreri-marco
I 2022/06/09 11:04:26 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 426, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:26 Fulltext indexing: paAX9m_26NP5 https://www.criterion.com/shop/browse/list?director=ferreri-marco
I 2022/06/09 11:04:26 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[paAX9m_26NP5 (1735154903452483584)]} 0 3
I 2022/06/09 11:04:26 SWITCHBOARD *Indexed 1193 words in URL https://www.criterion.com/shop/browse/list?director=ferreri-marco [paAX9m_26NP5]
Description: Marco Ferreri films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12497 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:26 HTCACHE storing content of url https://www.criterion.com/current/posts/3803-sj-str-m-haunts-in-minnesota, 68006 bytes
I 2022/06/09 11:04:26 REJECTED https://www.youtube.com/embed/HoqCsMUoN1c?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6502-/VcDUKU8LPywb5MgflsEJCCJOgfrUNv_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 SWITCHBOARD CRAWL: ADDED 55 LINKS FROM https://www.criterion.com/current/posts/3803-sj-str-m-haunts-in-minnesota, STACKING TIME = 6, PARSING TIME = 12
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6827-/sYS7xZWV366q9OANUqtCwimvTnL90D_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED http://www.heightstheater.com/film/the-phantom-carriage/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/films/7265b13395ec259ff98672237c54b4c6/hl8OoSNAgm9ND4Fh7ksjUzNXplspyF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/images/6322-6a8d8311976bfaf0f3648d49a8b6dfb4/phantom_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6701-/ETx6s7z5azkjRZIl1n1268y6oIFngD_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6516-/1ReaXkdoavjctz1ZBIEqTXIfC0QqZK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 SWITCHBOARD Excluded 20 words in URL https://www.criterion.com/current/posts/3803-sj-str-m-haunts-in-minnesota
I 2022/06/09 11:04:26 Fulltext indexing: pNR6yG_26NP5 https://www.criterion.com/current/posts/3803-sj-str-m-haunts-in-minnesota
I 2022/06/09 11:04:26 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[pNR6yG_26NP5 (1735154903546855424)]} 0 1
I 2022/06/09 11:04:26 SWITCHBOARD *Indexed 288 words in URL https://www.criterion.com/current/posts/3803-sj-str-m-haunts-in-minnesota [pNR6yG_26NP5]
Description: Sjöström Haunts in Minnesota | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3323 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:26 HostQueue forcing crawl-delay of 199 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 425, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 51)) = 199
I 2022/06/09 11:04:26 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=ismail-usmar, 224768 bytes
I 2022/06/09 11:04:26 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=ismail-usmar, STACKING TIME = 1, PARSING TIME = 19
I 2022/06/09 11:04:26 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/films/40e3648637e296fc80b9a4d526f35951/uWUFo8TqJ9uMAr8esMMCqoDE4sgpMx_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1961-3b7b159a9ae3d500459e38e69c96a917/9P9MeXzolFQyY5OhK2XcwZ02e0Y0ZC_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:26 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 425, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:26 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=ismail-usmar
I 2022/06/09 11:04:26 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ovy1Lm_26NP5 (1735154904025006080)]} 0 2
I 2022/06/09 11:04:26 Fulltext indexing: ovy1Lm_26NP5 https://www.criterion.com/shop/browse/list?director=ismail-usmar
I 2022/06/09 11:04:26 SWITCHBOARD *Indexed 1195 words in URL https://www.criterion.com/shop/browse/list?director=ismail-usmar [ovy1Lm_26NP5]
Description: Usmar Ismail films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12513 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:26 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:04:27 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:27 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@1ee4a635[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w8(7.7.3):C18:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772634528}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wa(7.7.3):C23:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772640046}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wb(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772645671}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wc(7.7.3):C17:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772650026}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wd(7.7.3):C4:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772650982}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_we(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772656193}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wf(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772661758}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wg(7.7.3):C15:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772665682}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wh(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772665845}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wi(7.7.3):C6:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772666996}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:04:27 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:04:27 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 425, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:27 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=cazals-felipe, 224237 bytes
I 2022/06/09 11:04:27 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=cazals-felipe, STACKING TIME = 1, PARSING TIME = 37
I 2022/06/09 11:04:27 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/0edac3ad6fa837bdd9abe77bea6012f3/h4SRYVXnUjLhqLaGqxuhcGeEhix8Yq_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 426, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:27 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=cazals-felipe
I 2022/06/09 11:04:27 Fulltext indexing: n2_llm_26NP5 https://www.criterion.com/shop/browse/list?director=cazals-felipe
I 2022/06/09 11:04:27 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[n2_llm_26NP5 (1735154904577605632)]} 0 5
I 2022/06/09 11:04:27 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=cazals-felipe [n2_llm_26NP5]
Description: Felipe Cazals films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12500 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:27 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=jarman-derek, 224149 bytes
I 2022/06/09 11:04:27 SWITCHBOARD CRAWL: ADDED 42 LINKS FROM https://www.criterion.com/shop/browse/list?director=jarman-derek, STACKING TIME = 2, PARSING TIME = 40
I 2022/06/09 11:04:27 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/f20e8f5bdc458777cda90775598ef89c/aiexRhqINL31ogh7SulZjNkvJQHnRD_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=jarman-derek
I 2022/06/09 11:04:27 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 429, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:27 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[nrKyrm_26NP5 (1735154904777883648)]} 0 2
I 2022/06/09 11:04:27 Fulltext indexing: nrKyrm_26NP5 https://www.criterion.com/shop/browse/list?director=jarman-derek
I 2022/06/09 11:04:27 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse/list?director=jarman-derek [nrKyrm_26NP5]
Description: Derek Jarman films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12473 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:27 HTCACHE storing content of url https://www.criterion.com/films/28726-lone-wolf-and-cub-baby-cart-in-the-land-of-demons, 76815 bytes
I 2022/06/09 11:04:27 HTCACHE storing content of url https://www.criterion.com/films/27977-douce, 71365 bytes
I 2022/06/09 11:04:27 SWITCHBOARD CRAWL: ADDED 67 LINKS FROM https://www.criterion.com/films/28726-lone-wolf-and-cub-baby-cart-in-the-land-of-demons, STACKING TIME = 1, PARSING TIME = 18
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/3c9f09ce0317fdbf2199438da624ef26/wQrhudyvLgRCg2bvWKGyePtmhI6yh6_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/images/7742-355bb7d45a9482fc6c2ac3db60ad9495/lone_wolf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://itunes.apple.com/us/movie/id1170340182?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.amazon.com/dp/B01MQFA5CN - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/7c02624be1ea9c11cf46bd17cf27e9aa.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/87903afea7478207e48d1481cc4abe56.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/images/7830-f315cd91efe00a972e37be56081e1dbf/LWC_designs_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/58e97ae4dd0b361bd131525faf284b78/K10x3FugjRS01iYKEXsLTr0OlDQk4P_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/lone-wolf-and-cub-baby-cart-in-the-land-of-demons?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/e0d98d9b85c97e837770345f98c9f025.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/images/7773-ee7702bf1312d6cd3550dbfc426265bc/Current_LW_C1-grab65_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/films/27977-douce, STACKING TIME = 1, PARSING TIME = 107
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/douce?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1816-e188e2102b63387dbe23fc67edb6beea/DWwbQG5RL4lfG3HYvO9uUZ78amems4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/images/9541-7926a5f63159cd1b4241ec268ed9d6c9/douce_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/106b07b3cc434c73b4ae16e8ed87669d.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/f7d00ab6307c1ec81109bf1425893316.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/bbf16d8bcccc2dc396438789b2f31af7.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/460b59ca750bcf35c4b72400db769627.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/54383fec44d7e8782575c44af765b4b3/zvkyPLbQ24L9VjH6jP1r4Q0jCYbVMY_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/images/9518-74f3b68e17d1c132c4fb7dd0555570cc/Current_29404id_015_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/films/28726-lone-wolf-and-cub-baby-cart-in-the-land-of-demons
I 2022/06/09 11:04:27 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[nZ4dNe_26NP5 (1735154904990744576)]} 0 1
I 2022/06/09 11:04:27 Fulltext indexing: nZ4dNe_26NP5 https://www.criterion.com/films/28726-lone-wolf-and-cub-baby-cart-in-the-land-of-demons
I 2022/06/09 11:04:27 SWITCHBOARD *Indexed 342 words in URL https://www.criterion.com/films/28726-lone-wolf-and-cub-baby-cart-in-the-land-of-demons [nZ4dNe_26NP5]
Description: Lone Wolf and Cub: Baby Cart in the Land of Demons (1973) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3804 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:27 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/films/27977-douce
I 2022/06/09 11:04:27 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[nSOfce_26NP5 (1735154905009618944)]} 0 1
I 2022/06/09 11:04:27 Fulltext indexing: nSOfce_26NP5 https://www.criterion.com/films/27977-douce
I 2022/06/09 11:04:27 SWITCHBOARD *Indexed 269 words in URL https://www.criterion.com/films/27977-douce [nSOfce_26NP5]
Description: Douce (1943) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3106 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:27 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 429, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:04:27 HTCACHE storing content of url https://www.criterion.com/current/posts/1170-bergman-and-i, 85838 bytes
I 2022/06/09 11:04:27 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/1170-bergman-and-i, STACKING TIME = 1, PARSING TIME = 6
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7820-/YKRO4oMLJouJhzJ0UVxMAsXpd2O4sF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7799-/BcSOGzRAmQVrNktlZ6a8juCv8YVP7g_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7821-/DIxDSXUzZ7yk0yAo3JyIs4aQdkSJqq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/images/4446-87cf72e6b02a2e66949e31ff7289fc20/bergmanisland_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/tout_image/7812-/cfFE90t4MLOwAR88lqDIdl2lvUw6SX_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/1170-bergman-and-i
I 2022/06/09 11:04:28 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[nPg55G_26NP5 (1735154905126010880)]} 0 2
I 2022/06/09 11:04:28 Fulltext indexing: nPg55G_26NP5 https://www.criterion.com/current/posts/1170-bergman-and-i
I 2022/06/09 11:04:28 SWITCHBOARD *Indexed 477 words in URL https://www.criterion.com/current/posts/1170-bergman-and-i [nPg55G_26NP5]
Description: Bergman and I | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6238 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:28 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 428, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:28 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 428, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:28 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=voss-kurt, 224171 bytes
I 2022/06/09 11:04:28 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=voss-kurt, STACKING TIME = 1, PARSING TIME = 39
I 2022/06/09 11:04:28 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://s3.amazonaws.com/criterion-production/films/2131124bf11dd19cde56a791c8fc54f9/o5Y9AGkM9iZWMr9AQm46QYzEAvubcV_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=voss-kurt
I 2022/06/09 11:04:28 Fulltext indexing: nLywem_26NP5 https://www.criterion.com/shop/browse/list?director=voss-kurt
I 2022/06/09 11:04:28 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[nLywem_26NP5 (1735154905735233536)]} 0 2
I 2022/06/09 11:04:28 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=voss-kurt [nLywem_26NP5]
Description: Kurt Voss films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12487 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:28 HostQueue forcing crawl-delay of 187 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 429, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 63)) = 187
I 2022/06/09 11:04:28 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=mann-anthony, 224159 bytes
I 2022/06/09 11:04:28 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=mann-anthony, STACKING TIME = 2, PARSING TIME = 31
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://s3.amazonaws.com/criterion-production/films/27347e78f7beca764a3920161b531e11/9TTjP15bKoZruNTlx8uvvB6oPAMC2O_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:28 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 432, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:28 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=mann-anthony
I 2022/06/09 11:04:28 Fulltext indexing: m7bKdm_26NP5 https://www.criterion.com/shop/browse/list?director=mann-anthony
I 2022/06/09 11:04:28 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[m7bKdm_26NP5 (1735154906116915200)]} 0 2
I 2022/06/09 11:04:28 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=mann-anthony [m7bKdm_26NP5]
Description: Anthony Mann films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12465 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:29 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=holland-agnieszka, 224213 bytes
I 2022/06/09 11:04:29 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=holland-agnieszka, STACKING TIME = 2, PARSING TIME = 27
I 2022/06/09 11:04:29 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/films/1975c824e3f44554d1755f66ac8e9901/op5e76062WLo9yZJeiOlU7gNFEjL5m_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 434, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:29 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=berger-ludwig, 224196 bytes
I 2022/06/09 11:04:29 HTCACHE storing content of url https://www.criterion.com/current/posts/5321-sundance-2018-paul-dano-s-wildlife, 95467 bytes
I 2022/06/09 11:04:29 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=holland-agnieszka
I 2022/06/09 11:04:29 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=berger-ludwig, STACKING TIME = 12, PARSING TIME = 39
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/films/41132bea28bb3bbea12d52e78c20b378/kl6ebsg1AK3m1ejL3zvPJa17lTmxcW_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 Fulltext indexing: m2CQem_26NP5 https://www.criterion.com/shop/browse/list?director=holland-agnieszka
I 2022/06/09 11:04:29 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[m2CQem_26NP5 (1735154906512228352)]} 0 6
I 2022/06/09 11:04:29 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=holland-agnieszka [m2CQem_26NP5]
Description: Agnieszka Holland films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12487 bytes |
LinkStorageTime: 12 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:29 SWITCHBOARD CRAWL: ADDED 71 LINKS FROM https://www.criterion.com/current/posts/5321-sundance-2018-paul-dano-s-wildlife, STACKING TIME = 2, PARSING TIME = 32
I 2022/06/09 11:04:29 REJECTED https://theplaylist.net/carey-mulligan-wildlife-sundance-review-20180121/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://thefilmstage.com/reviews/sundance-review-wildlife-is-a-remarkably-assured-directorial-debut-for-paul-dano/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.youtube.com/embed/DpGk2oebiDY - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.avclub.com/laura-dern-digs-deep-in-the-most-powerful-and-disturbin-1822326316 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.youtube.com/embed/8yFxapmKLdM - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED http://www.sundance.org/projects/wildlife - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.thewrap.com/wildlife-review-paul-danos-directorial-debut-austere-portrait-family-crisis/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.theguardian.com/film/2018/jan/22/wildlife-review-carey-mulligan-paul-dano-directorial-debut-sundance - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED http://www.vulture.com/2018/01/wildlife-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.screendaily.com/reviews/wildlife-sundance-review/5125766.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.filmcomment.com/blog/film-comment-podcast-sundance-day-four/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.hollywoodreporter.com/review/wildlife-review-1076443 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.rogerebert.com/sundance/sundance-2018-wildlife - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED http://variety.com/2018/film/reviews/wildlife-review-carey-mulligan-1202671259/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED http://filmmakermagazine.com/104656-film-is-about-making-magic-with-these-kind-of-challenges-dp-diego-garcia-on-wildlife/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/images/9525-3f1277f09f93bbce17dddd1309aedb17/wildlife01242018_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED http://www.latimes.com/entertainment/movies/la-et-mn-sundance-wildlife-paul-dano-20180120-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED https://www.cityweekly.net/BuzzBlog/archives/2018/01/24/sundance-film-festival-2018-day-6-capsules - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 REJECTED http://www.indiewire.com/2018/01/wildlife-review-paul-dano-carey-mulligan-jake-gyllenhaal-sundance-2018-1201919723/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:29 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 435, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:29 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=berger-ludwig
I 2022/06/09 11:04:29 Fulltext indexing: muBRVm_26NP5 https://www.criterion.com/shop/browse/list?director=berger-ludwig
I 2022/06/09 11:04:29 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[muBRVm_26NP5 (1735154906775420928)]} 0 11
I 2022/06/09 11:04:29 SWITCHBOARD *Indexed 1187 words in URL https://www.criterion.com/shop/browse/list?director=berger-ludwig [muBRVm_26NP5]
Description: Ludwig Berger films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12478 bytes |
LinkStorageTime: 18 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:29 HTCACHE storing content of url https://www.criterion.com/current/posts/1972-a-gondry-tribute-to-jean-vigo, 57032 bytes
I 2022/06/09 11:04:29 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/5321-sundance-2018-paul-dano-s-wildlife
I 2022/06/09 11:04:29 Fulltext indexing: mWwnsG_26NP5 https://www.criterion.com/current/posts/5321-sundance-2018-paul-dano-s-wildlife
I 2022/06/09 11:04:29 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[mWwnsG_26NP5 (1735154906879229952)]} 0 3
I 2022/06/09 11:04:29 SWITCHBOARD *Indexed 605 words in URL https://www.criterion.com/current/posts/5321-sundance-2018-paul-dano-s-wildlife [mWwnsG_26NP5]
Description: Sundance 2018: Paul Danos Wildlife | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 7444 bytes |
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:29 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=pichel-irving, 224720 bytes
I 2022/06/09 11:04:29 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 433, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 8)) = 243
I 2022/06/09 11:04:29 HTCACHE storing content of url https://www.criterion.com/current/author/628-terry-southern, 48869 bytes
I 2022/06/09 11:04:29 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 430, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 11)) = 240
I 2022/06/09 11:04:30 HeapReader generating index for /root/yacy/DATA/HTCACHE/file.array/YpLgnuY9YMRB.20220609110430002.blob, 0 MB. Please wait.
I 2022/06/09 11:04:30 HeapReader finished index generation for /root/yacy/DATA/HTCACHE/file.array/YpLgnuY9YMRB.20220609110430002.blob, 0 entries, 0 gaps.
I 2022/06/09 11:04:30 Heap initializing heap /root/yacy/DATA/HTCACHE/file.array/YpLgnuY9YMRB.20220609110430002.blob
I 2022/06/09 11:04:30 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 SWITCHBOARD CRAWL: ADDED 39 LINKS FROM https://www.criterion.com/current/posts/1972-a-gondry-tribute-to-jean-vigo, STACKING TIME = 2, PARSING TIME = 5
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/current/posts/1972-a-gondry-tribute-to-jean-vigo
I 2022/06/09 11:04:30 Fulltext indexing: mRRRZG_26NP5 https://www.criterion.com/current/posts/1972-a-gondry-tribute-to-jean-vigo
I 2022/06/09 11:04:30 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[mRRRZG_26NP5 (1735154907310194688)]} 0 5
I 2022/06/09 11:04:30 SWITCHBOARD *Indexed 99 words in URL https://www.criterion.com/current/posts/1972-a-gondry-tribute-to-jean-vigo [mRRRZG_26NP5]
Description: A Gondry Tribute to Jean Vigo | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1355 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:30 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1865-2b74f037f454df1d78013f06dc4aaea4/0TUeLtsha8fzPrVMeQ8rNOnpUVmvME_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://s3.amazonaws.com/criterion-production/films/1f4199efee0716b73e643f44cffd628f/FsFD7z9JPS8zGJwwhpboY5pnNOOmIc_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=pichel-irving, STACKING TIME = 3, PARSING TIME = 44
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/628-terry-southern, STACKING TIME = 1, PARSING TIME = 11
I 2022/06/09 11:04:30 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=pichel-irving
I 2022/06/09 11:04:30 Fulltext indexing: mVARsm_26NP5 https://www.criterion.com/shop/browse/list?director=pichel-irving
I 2022/06/09 11:04:30 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 430, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:30 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[mVARsm_26NP5 (1735154907506278400)]} 0 12
I 2022/06/09 11:04:30 SWITCHBOARD *Indexed 1195 words in URL https://www.criterion.com/shop/browse/list?director=pichel-irving [mVARsm_26NP5]
Description: Irving Pichel films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12504 bytes |
LinkStorageTime: 23 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:30 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/current/author/628-terry-southern
I 2022/06/09 11:04:30 Fulltext indexing: lpTHqG_26NP5 https://www.criterion.com/current/author/628-terry-southern
I 2022/06/09 11:04:30 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[lpTHqG_26NP5 (1735154907527249920)]} 0 0
I 2022/06/09 11:04:30 SWITCHBOARD *Indexed 112 words in URL https://www.criterion.com/current/author/628-terry-southern [lpTHqG_26NP5]
Description: Terry Southern | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1343 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:30 YACY rulebasedUpdateInfo: not an automatic update selected
I 2022/06/09 11:04:30 RESOURCE OBSERVER resources ok
I 2022/06/09 11:04:30 SWITCHBOARD postprocessing deactivated: field process_sxt is not enabled
I 2022/06/09 11:04:30 SWITCHBOARD postprocessing deactivated: no enough ram (420048312), needed 536870912, to force change field postprocessing.minimum_ram
I 2022/06/09 11:04:30 SWITCHBOARD postprocessing deactivated: constraints violated
I 2022/06/09 11:04:30 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=armstrong-gillian, 224199 bytes
I 2022/06/09 11:04:30 HostQueue forcing crawl-delay of 247 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 439, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 4)) = 247
I 2022/06/09 11:04:30 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=kenton-erle-c, 224183 bytes
I 2022/06/09 11:04:30 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=armstrong-gillian, STACKING TIME = 1, PARSING TIME = 33
I 2022/06/09 11:04:30 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://s3.amazonaws.com/criterion-production/films/1076892f9021ddb7c859fd3c8e320e2a/Ci81X9eJjj5UeZxOsh0yqJlCk9CGOm_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:30 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=kenton-erle-c, STACKING TIME = 3, PARSING TIME = 43
I 2022/06/09 11:04:31 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/films/9de0eda5b161e218aec5e45c9f71bc46/CIZxKmCjJsPRkHDp0FKzMMDIX4n72l_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=armstrong-gillian
I 2022/06/09 11:04:31 Fulltext indexing: lR4q4m_26NP5 https://www.criterion.com/shop/browse/list?director=armstrong-gillian
I 2022/06/09 11:04:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[lR4q4m_26NP5 (1735154908351430656)]} 0 4
I 2022/06/09 11:04:31 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=armstrong-gillian [lR4q4m_26NP5]
Description: Gillian Armstrong films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12474 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:31 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=kenton-erle-c
I 2022/06/09 11:04:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[lZg9rm_26NP5 (1735154908420636672)]} 0 2
I 2022/06/09 11:04:31 Fulltext indexing: lZg9rm_26NP5 https://www.criterion.com/shop/browse/list?director=kenton-erle-c
I 2022/06/09 11:04:31 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse/list?director=kenton-erle-c [lZg9rm_26NP5]
Description: Erle C. Kenton films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12467 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:31 HTCACHE storing content of url https://www.criterion.com/current/posts/6369-mendon-a-and-dornelles-s-bacurau, 71186 bytes
I 2022/06/09 11:04:31 SWITCHBOARD CRAWL: ADDED 62 LINKS FROM https://www.criterion.com/current/posts/6369-mendon-a-and-dornelles-s-bacurau, STACKING TIME = 2, PARSING TIME = 60
I 2022/06/09 11:04:31 REJECTED https://www.theguardian.com/film/2019/may/15/bacurau-review-brazil-outback-western-cannes - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://lwlies.com/festivals/bacarau-cannes-film-festival-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 437, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 7)) = 244
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.festival-cannes.com/en/festival/films/bacurau - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.youtube.com/embed/Hr49Ayyb3zs?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.telegraph.co.uk/films/0/bacurau-review-bloodsoaked-brazilian-sci-fi-western-shades-john/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://criterion-production.s3.amazonaws.com/LuFa9XKpt7y280rU9YIY3VdsTEwMsM.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-3-when-push-comes-to-shove - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-chang-notebook-bacurau-les-miserables-deerskin-20190516-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.hollywoodreporter.com/review/bacurau-review-1211067 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://thefilmstage.com/reviews/cannes-review-bacurau-is-a-john-carpenter-inspired-politically-fueled-revenge-fantasy/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 SWITCHBOARD Excluded 25 words in URL https://www.criterion.com/current/posts/6369-mendon-a-and-dornelles-s-bacurau
I 2022/06/09 11:04:31 Fulltext indexing: lGV5yG_26NP5 https://www.criterion.com/current/posts/6369-mendon-a-and-dornelles-s-bacurau
I 2022/06/09 11:04:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[lGV5yG_26NP5 (1735154908558000128)]} 0 2
I 2022/06/09 11:04:31 SWITCHBOARD *Indexed 473 words in URL https://www.criterion.com/current/posts/6369-mendon-a-and-dornelles-s-bacurau [lGV5yG_26NP5]
Description: Mendonça and Dornelless Bacurau | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5403 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:31 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 437, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:31 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=kerrigan-lodge, 224160 bytes
I 2022/06/09 11:04:31 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=kerrigan-lodge, STACKING TIME = 2, PARSING TIME = 33
I 2022/06/09 11:04:31 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/films/072341219cb469ae34678a376c4cd241/Zj2dWjjyBYisz6T1eembGfuvePjxGp_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 437, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 12)) = 239
I 2022/06/09 11:04:31 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=kerrigan-lodge
I 2022/06/09 11:04:31 Fulltext indexing: k873Wm_26NP5 https://www.criterion.com/shop/browse/list?director=kerrigan-lodge
I 2022/06/09 11:04:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[k873Wm_26NP5 (1735154909134716928)]} 0 3
I 2022/06/09 11:04:31 SWITCHBOARD *Indexed 1187 words in URL https://www.criterion.com/shop/browse/list?director=kerrigan-lodge [k873Wm_26NP5]
Description: Lodge Kerrigan films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12460 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:31 HTCACHE storing content of url https://www.criterion.com/current/posts/6373-kantemir-balagov-s-beanpole, 72068 bytes
I 2022/06/09 11:04:31 SWITCHBOARD CRAWL: ADDED 64 LINKS FROM https://www.criterion.com/current/posts/6373-kantemir-balagov-s-beanpole, STACKING TIME = 2, PARSING TIME = 7
I 2022/06/09 11:04:31 REJECTED https://www.festival-cannes.com/en/films/beanpole - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://criterion-production.s3.amazonaws.com/n8sCcCzSAzLL1KiugbHE7WM5fJ7SHf.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://variety.com/2019/film/festivals/filmmaker-kantemir-balagov-talks-about-his-cannes-un-certain-regard-drama-beanpole-1203216225/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.indiewire.com/2019/05/beanpole-review-cannes-2019-1202141983/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.rogerebert.com/cannes/cannes-2019-for-sama-beanpole - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.youtube.com/embed/-2K0_PfthrY?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://film.avclub.com/postwar-drama-and-an-unnerving-spin-on-a-sci-fi-classic-1834865079 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.hollywoodreporter.com/review/beanpole-review-1211204 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-3-when-push-comes-to-shove - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.ioncinema.com/reviews/kantemir-balagov-beanpole-review - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://variety.com/2019/film/markets-festivals/beanpole-review-1203215728/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 REJECTED https://www.screendaily.com/reviews/beanpole-cannes-review/5139505.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:31 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/6373-kantemir-balagov-s-beanpole
I 2022/06/09 11:04:31 Fulltext indexing: k4x7HG_26NP5 https://www.criterion.com/current/posts/6373-kantemir-balagov-s-beanpole
I 2022/06/09 11:04:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[k4x7HG_26NP5 (1735154909267886080)]} 0 1
I 2022/06/09 11:04:31 SWITCHBOARD *Indexed 522 words in URL https://www.criterion.com/current/posts/6373-kantemir-balagov-s-beanpole [k4x7HG_26NP5]
Description: Kantemir Balagovs Beanpole | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6179 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:31 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 437, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:32 HTCACHE storing content of url https://www.criterion.com/current/posts/6383-robert-eggers-s-the-lighthouse, 71911 bytes
I 2022/06/09 11:04:32 SWITCHBOARD CRAWL: ADDED 65 LINKS FROM https://www.criterion.com/current/posts/6383-robert-eggers-s-the-lighthouse, STACKING TIME = 1, PARSING TIME = 53
I 2022/06/09 11:04:32 REJECTED https://www.rogerebert.com/cannes/cannes-2019-the-lighthouse-lux-aeterna - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.vanityfair.com/hollywood/2019/05/robert-pattinson-the-lighthouse-movie-review-cannes - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/posts/7823-tribeca-2022 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/posts/7819-early-summer-reading - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://lwlies.com/festivals/the-lighthouse-cannes-film-festival-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/series/did-you-see-this - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://twitter.com/A24/status/1130602426946543616 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://variety.com/2019/film/reviews/the-lighthouse-review-robert-pattinson-willem-dafoe-1203220127/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-6-teenage-martyrs-and-lighthouse-keepers - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.quinzaine-realisateurs.com/en/film/the-lighthouse/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/category/1-on-film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/series/cannes-2019 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.thedailybeast.com/robert-pattinson-loses-his-damn-mind-in-cannes-film-festivals-the-lighthouse - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://thefilmstage.com/reviews/the-light-house-cannes-review-robert-pattinson-willem-dafoe/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://theplaylist.net/lighthouse-cannes-review-20190519/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/posts/7822-irma-vep-revamp - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://criterion-production.s3.amazonaws.com/6LxWp33V8LLIofe5AHi4VLIOD0FDMX.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/category/20-the-daily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/author/654-david-hudson - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/posts/7818-american-neorealism-now - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.telegraph.co.uk/films/0/lighthouse-review-film-will-make-head-soul-ring/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 SWITCHBOARD Excluded 26 words in URL https://www.criterion.com/current/posts/6383-robert-eggers-s-the-lighthouse
I 2022/06/09 11:04:32 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kYbziG_26NP5 (1735154909454532608)]} 0 2
I 2022/06/09 11:04:32 Fulltext indexing: kYbziG_26NP5 https://www.criterion.com/current/posts/6383-robert-eggers-s-the-lighthouse
I 2022/06/09 11:04:32 SWITCHBOARD *Indexed 559 words in URL https://www.criterion.com/current/posts/6383-robert-eggers-s-the-lighthouse [kYbziG_26NP5]
Description: Robert Eggerss The Lighthouse | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6375 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:32 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 435, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 14)) = 237
I 2022/06/09 11:04:32 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=nakahira-ko, 224182 bytes
I 2022/06/09 11:04:32 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=nakahira-ko, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:04:32 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://s3.amazonaws.com/criterion-production/films/1d775c19a4731003ec45c846f2134507/OJ4jbjYEb96JR4ixXoUpfsFPa3Yvmb_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:04:32 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 435, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:32 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=nakahira-ko
I 2022/06/09 11:04:32 Fulltext indexing: kX7U9m_26NP5 https://www.criterion.com/shop/browse/list?director=nakahira-ko
I 2022/06/09 11:04:32 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kX7U9m_26NP5 (1735154909934780416)]} 0 4
I 2022/06/09 11:04:32 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=nakahira-ko [kX7U9m_26NP5]
Description: Kô Nakahira films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12475 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:32 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:32 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@5b6ac81a[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wk(7.7.3):C19:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772672584}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:04:32 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:04:32 HTCACHE storing content of url https://www.criterion.com/current/author/856-robert-daniels, 50068 bytes
I 2022/06/09 11:04:32 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/856-robert-daniels, STACKING TIME = 0, PARSING TIME = 4
I 2022/06/09 11:04:32 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kVg7QG_26NP5 (1735154910045929472)]} 0 1
I 2022/06/09 11:04:32 SWITCHBOARD Excluded 12 words in URL https://www.criterion.com/current/author/856-robert-daniels
I 2022/06/09 11:04:32 Fulltext indexing: kVg7QG_26NP5 https://www.criterion.com/current/author/856-robert-daniels
I 2022/06/09 11:04:32 SWITCHBOARD *Indexed 135 words in URL https://www.criterion.com/current/author/856-robert-daniels [kVg7QG_26NP5]
Description: Robert Daniels | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1981 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:32 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 433, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:32 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=nakagawa-nobuo, 224175 bytes
I 2022/06/09 11:04:32 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 436, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:33 SWITCHBOARD CRAWL: ADDED 42 LINKS FROM https://www.criterion.com/shop/browse/list?director=nakagawa-nobuo, STACKING TIME = 1, PARSING TIME = 30
I 2022/06/09 11:04:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/755cc14a06822abce0859f66f77ba87d/4ojfoQo0qFPOFyenaCA774vHWYBMWt_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=nakagawa-nobuo
I 2022/06/09 11:04:33 Fulltext indexing: kWH4_m_26NP5 https://www.criterion.com/shop/browse/list?director=nakagawa-nobuo
I 2022/06/09 11:04:33 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kWH4_m_26NP5 (1735154910451728384)]} 0 2
I 2022/06/09 11:04:33 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=nakagawa-nobuo [kWH4_m_26NP5]
Description: Nobuo Nakagawa films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12491 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:33 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 436, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:33 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bruckman-clyde, 224211 bytes
I 2022/06/09 11:04:33 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 439, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 11)) = 240
I 2022/06/09 11:04:33 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=gremillon-jean, 225904 bytes
I 2022/06/09 11:04:33 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=bruckman-clyde, STACKING TIME = 1, PARSING TIME = 144
I 2022/06/09 11:04:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/3a4a52811b630a9836c1b10cb2c55a38/1DZVBE8PnMfkggyvh5s9f7K2TSAiF0_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=gremillon-jean, STACKING TIME = 1, PARSING TIME = 68
I 2022/06/09 11:04:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1810-937c264a8f71a4e7b7e56fd9bb1f6573/RH8phccQiFbDuedTfBmuDM9ZgEXY4E_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/2246578d818804de9a010ec2f6761940/LS7nGzD3CGCHvbfnW7CFToCn83HJRf_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/1819692b0157948bf277f85e34504c85/HU1NSd7jljSoSS3ka7bypfce5Wx7X4_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/9f4c81fc85a20a9ca4af74f218536096/4MH6hVt4vxfz8BFDRrOc5UO5HC841g_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=bruckman-clyde
I 2022/06/09 11:04:33 Fulltext indexing: kUor1m_26NP5 https://www.criterion.com/shop/browse/list?director=bruckman-clyde
I 2022/06/09 11:04:33 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kUor1m_26NP5 (1735154911115476992)]} 0 3
I 2022/06/09 11:04:33 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=bruckman-clyde [kUor1m_26NP5]
Description: Clyde Bruckman films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12484 bytes |
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:33 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 440, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 9)) = 242
I 2022/06/09 11:04:33 HTCACHE storing content of url https://www.criterion.com/current/posts/2516-pennebaker-on-mailer-wild-90-and-beyond-the-law, 71327 bytes
I 2022/06/09 11:04:33 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/2516-pennebaker-on-mailer-wild-90-and-beyond-the-law, STACKING TIME = 5, PARSING TIME = 14
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.youtube.com/embed/9NR--aUs7gY?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/b069abe928385f97336491824e25923b/MjNYU96ZyBqDUuGvnpoqXLFz6MVKPS_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/f01fd5cd8ceeda0a6b33f78e25f81d98/dYeHQUY0SAGvBWP5y5tDXGWAYDhgrh_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/images/5009-0fef25298fac51d7cbd5e4fc73c07983/Mailer_Episode_1_Current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=gremillon-jean
I 2022/06/09 11:04:33 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kUQ9Xm_26NP5 (1735154911315755008)]} 0 2
I 2022/06/09 11:04:33 Fulltext indexing: kUQ9Xm_26NP5 https://www.criterion.com/shop/browse/list?director=gremillon-jean
I 2022/06/09 11:04:33 SWITCHBOARD *Indexed 1211 words in URL https://www.criterion.com/shop/browse/list?director=gremillon-jean [kUQ9Xm_26NP5]
Description: Jean Grémillon films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12622 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:33 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/current/posts/2516-pennebaker-on-mailer-wild-90-and-beyond-the-law
I 2022/06/09 11:04:33 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[jn7tkG_26NP5 (1735154911333580800)]} 0 1
I 2022/06/09 11:04:33 Fulltext indexing: jn7tkG_26NP5 https://www.criterion.com/current/posts/2516-pennebaker-on-mailer-wild-90-and-beyond-the-law
I 2022/06/09 11:04:33 SWITCHBOARD *Indexed 275 words in URL https://www.criterion.com/current/posts/2516-pennebaker-on-mailer-wild-90-and-beyond-the-law [jn7tkG_26NP5]
Description: Pennebaker on Mailer: Wild 90 and Beyond the Law | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3231 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:33 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=menzies-william-cameron, 224247 bytes
I 2022/06/09 11:04:33 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=menzies-william-cameron, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:04:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/8acc91247738186c941d26d985dd25d1/Tq53ScFvzrnFOqKLNLZfExM0t52C7k_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=menzies-william-cameron
I 2022/06/09 11:04:34 Fulltext indexing: kPW78m_26NP5 https://www.criterion.com/shop/browse/list?director=menzies-william-cameron
I 2022/06/09 11:04:34 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 442, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 12)) = 239
I 2022/06/09 11:04:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kPW78m_26NP5 (1735154911474089984)]} 0 5
I 2022/06/09 11:04:34 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=menzies-william-cameron [kPW78m_26NP5]
Description: William Cameron Menzies films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12495 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:34 HTCACHE storing content of url https://www.criterion.com/current/author/179-pauline-kael, 54697 bytes
I 2022/06/09 11:04:34 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/current/author/179-pauline-kael, STACKING TIME = 2, PARSING TIME = 4
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[i91qcG_26NP5 (1735154911712116736)]} 0 1
I 2022/06/09 11:04:34 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/current/author/179-pauline-kael
I 2022/06/09 11:04:34 Fulltext indexing: i91qcG_26NP5 https://www.criterion.com/current/author/179-pauline-kael
I 2022/06/09 11:04:34 SWITCHBOARD *Indexed 185 words in URL https://www.criterion.com/current/author/179-pauline-kael [i91qcG_26NP5]
Description: Pauline Kael | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2270 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:34 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 439, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:34 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=turell-saul-j, 225976 bytes
I 2022/06/09 11:04:34 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=turell-saul-j, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:04:34 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://s3.amazonaws.com/criterion-production/films/d77d4a38d63c784669e245e92055766d/Z8aTrBp2gBHeTKrc8NwYihqR92yy9L_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://s3.amazonaws.com/criterion-production/films/69a755ba1d29f769584b674d4114ac40/gGzssRokuyhN2VwbxmWfgZJVIq2EM7_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1873-f368b5675c9f0eb398727cb9a03b6e7b/vDeZBqbuBzw7Bpsb3KEK6hBnY6zACK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 442, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 12)) = 239
I 2022/06/09 11:04:34 SWITCHBOARD Excluded 11 words in URL https://www.criterion.com/shop/browse/list?director=turell-saul-j
I 2022/06/09 11:04:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[jV-1sm_26NP5 (1735154912029835264)]} 0 2
I 2022/06/09 11:04:34 Fulltext indexing: jV-1sm_26NP5 https://www.criterion.com/shop/browse/list?director=turell-saul-j
I 2022/06/09 11:04:34 SWITCHBOARD *Indexed 1210 words in URL https://www.criterion.com/shop/browse/list?director=turell-saul-j [jV-1sm_26NP5]
Description: Saul J. Turell films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12647 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:34 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=rees-dee, 224126 bytes
I 2022/06/09 11:04:34 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=rees-dee, STACKING TIME = 0, PARSING TIME = 84
I 2022/06/09 11:04:34 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://s3.amazonaws.com/criterion-production/films/a68a36ba70947ac3704098a3860aa0b8/wHzhBwq8bLT9UrBKTI9PpiwWDV3YAM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 442, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:34 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=rees-dee
I 2022/06/09 11:04:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[i9ZO_m_26NP5 (1735154912336019456)]} 0 2
I 2022/06/09 11:04:34 Fulltext indexing: i9ZO_m_26NP5 https://www.criterion.com/shop/browse/list?director=rees-dee
I 2022/06/09 11:04:34 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=rees-dee [i9ZO_m_26NP5]
Description: Dee Rees films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12452 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:34 HTCACHE storing content of url https://www.criterion.com/current/author/404-hope-parrish, 48381 bytes
I 2022/06/09 11:04:34 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/404-hope-parrish, STACKING TIME = 0, PARSING TIME = 4
I 2022/06/09 11:04:34 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:34 SWITCHBOARD Excluded 13 words in URL https://www.criterion.com/current/author/404-hope-parrish
I 2022/06/09 11:04:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[iecx0G_26NP5 (1735154912428294144)]} 0 1
I 2022/06/09 11:04:34 Fulltext indexing: iecx0G_26NP5 https://www.criterion.com/current/author/404-hope-parrish
I 2022/06/09 11:04:34 SWITCHBOARD *Indexed 120 words in URL https://www.criterion.com/current/author/404-hope-parrish [iecx0G_26NP5]
Description: Hope Parrish | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1421 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:35 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 439, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:35 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=fukunaga-cary-joji, 224197 bytes
I 2022/06/09 11:04:35 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=fukunaga-cary-joji, STACKING TIME = 0, PARSING TIME = 22
I 2022/06/09 11:04:35 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/d625b0e7f179fa73f1d10d4ff66873a6/KwcufB3P2S9l5e6g7eQitniSmR32hr_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 HTCACHE storing content of url https://www.criterion.com/shop/collection/141-volker-schl-ndorff/list, 55104 bytes
I 2022/06/09 11:04:35 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=fukunaga-cary-joji
I 2022/06/09 11:04:35 SWITCHBOARD CRAWL: ADDED 49 LINKS FROM https://www.criterion.com/shop/collection/141-volker-schl-ndorff/list, STACKING TIME = 1, PARSING TIME = 67
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/1522135761424c32d477da9851f016ff/33GcylWNQvIKPneqIDDpcJnFPoUtSg_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/a8c39a413f3134b2940ede42c30c02d3/qL6ZEBtqgDUf0j76xJ6GFEpaGQYXl3_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/ea8616829a288b5b6d680c9f6b66ba59/03UzOLZzQogXDtQOTkIp8BbLpZWGYM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/d2c4d40b1ce44f03b1c60a2ae9829ded/EGvQdEyNez1O8QmzlhTj1a1gyGcKHg_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[iySfDm_26NP5 (1735154912798441472)]} 0 2
I 2022/06/09 11:04:35 Fulltext indexing: iySfDm_26NP5 https://www.criterion.com/shop/browse?director=fukunaga-cary-joji
I 2022/06/09 11:04:35 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/22a10f46e2d950d3ab907e62e119bd61/QMb2egiiChRALGyT7ZrOLX40p36N5I_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/6813575ce7945498b15effc2cef1777a/Kcg04nsDzd1F6UYUmtIJ4pn5qRcDCz_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/bed1dc8df02842d6a75325665e718ebd/da8xTBLVhcfx0KQXSyOOMImKRe6s2r_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 403, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:35 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse?director=fukunaga-cary-joji [iySfDm_26NP5]
Description: Cary Joji Fukunaga films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12425 bytes |
LinkStorageTime: 17 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:35 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ic4nAm_26NP5 (1735154912814170112)]} 0 1
I 2022/06/09 11:04:35 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/collection/141-volker-schl-ndorff/list
I 2022/06/09 11:04:35 Fulltext indexing: ic4nAm_26NP5 https://www.criterion.com/shop/collection/141-volker-schl-ndorff/list
I 2022/06/09 11:04:35 SWITCHBOARD *Indexed 157 words in URL https://www.criterion.com/shop/collection/141-volker-schl-ndorff/list [ic4nAm_26NP5]
Description: Volker Schlöndorff | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1707 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:35 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 403, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:35 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 403, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:35 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=makavejev-dusan, 227186 bytes
I 2022/06/09 11:04:35 SWITCHBOARD CRAWL: ADDED 48 LINKS FROM https://www.criterion.com/shop/browse/list?director=makavejev-dusan, STACKING TIME = 1, PARSING TIME = 44
I 2022/06/09 11:04:35 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/5eb3b02cc20d9090e506fdd942b92cab/WsP71MgPQ5GmNEtT7QghC3MI3epzGp_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/c8e739d705dfcfc85250cd8e7964cac5/Z2weqqbJY8cvVdxGqDCakEj4IjlDMF_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/561ccce505c4622a24bd32dfae8565e5/G3vBQac0x5LaFJBmoSIKMWRIRLhO2a_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/31ec952d32145bfe242a4c1b187021fa/bNa4o4AivhUBPdacbpJN6IKxha7994_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1794-050c8e7aa2d3ba1ddbd296d108239cf1/shvY9H0K9PWixHcIAuuAUuKL7GcP2V_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/688737ce839fa08fa7fda02cbabcbb27/IHT6Dhx73vSSKCSX8RLh2ThYArDcGb_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 SWITCHBOARD Excluded 11 words in URL https://www.criterion.com/shop/browse/list?director=makavejev-dusan
I 2022/06/09 11:04:36 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[iBitwm_26NP5 (1735154913560756224)]} 0 3
I 2022/06/09 11:04:36 Fulltext indexing: iBitwm_26NP5 https://www.criterion.com/shop/browse/list?director=makavejev-dusan
I 2022/06/09 11:04:36 SWITCHBOARD *Indexed 1228 words in URL https://www.criterion.com/shop/browse/list?director=makavejev-dusan [iBitwm_26NP5]
Description: Dušan Makavejev films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12802 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:36 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=edgren-gustaf, 224803 bytes
I 2022/06/09 11:04:36 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 462, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 5)) = 245
I 2022/06/09 11:04:36 HTCACHE storing content of url https://www.criterion.com/current/posts/5289-sundance-2018-tamara-jenkins-s-private-life, 119555 bytes
I 2022/06/09 11:04:36 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=edgren-gustaf, STACKING TIME = 1, PARSING TIME = 125
I 2022/06/09 11:04:36 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1915-d024b93e4429f5b05d7f0bdc8d59c415/shXfQUMZTWnY6hrjrAHIhcBlosHu4c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/films/9ae2fdb4db278ca13d1d6f9f72c7f9d1/7AOCgcEUHNbk2QLpyr837sD4mNACMd_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 SWITCHBOARD CRAWL: ADDED 75 LINKS FROM https://www.criterion.com/current/posts/5289-sundance-2018-tamara-jenkins-s-private-life, STACKING TIME = 6, PARSING TIME = 20
I 2022/06/09 11:04:36 REJECTED http://www.vulture.com/2018/01/review-private-life-is-a-dazzling-comedy-about-families.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED http://variety.com/2018/film/reviews/private-life-review-sundance-paul-giamatti-kathryn-hahn-1202668747/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.ioncinema.com/reviews/private-life-tamara-jenkins-review - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED http://beta.latimes.com/entertainment/movies/la-et-mn-sundance-tamara-jenkins-private-life-20180119-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.youtube.com/embed/_CBQzRTPHRo - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.timeout.com/us/film/private-life - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.pastemagazine.com/articles/2018/01/private-life.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED http://uproxx.com/filmdrunk/private-life-movie-review-sundance/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.cityweekly.net/BuzzBlog/archives/2018/01/19/sundance-film-festival-2018-day-1-capsules - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED http://www.latimes.com/entertainment/movies/la-et-mn-sundance-diary-justin-chang-20180120-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED http://www.indiewire.com/2018/01/sundance-2018-private-life-tamara-jenkins-netflix-1201918180/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED http://www.indiewire.com/2018/01/private-life-review-tamara-jenkins-paul-giamatti-kathryn-hahn-sundance-2018-1201919179/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.rogerebert.com/sundance/sundance-2018-private-life - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.thedailybeast.com/private-life-the-perfect-sundance-opening-night-film-11-years-in-the-making - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://theplaylist.net/private-life-sundance-review-20180119/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.filmcomment.com/blog/film-comment-podcast-sundance-day-two/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/images/9484-bdac577c62fb0f754068f8b5a2e823d4/privatelife01192018_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.avclub.com/paul-giamatti-and-kathryn-hahn-try-to-get-pregnant-in-t-1822250381 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED http://www.sundance.org/projects/private-life - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED http://flavorwire.com/612744/the-best-and-worst-movies-of-the-2018-sundance-film-festival/8 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.rollingstone.com/movies/news/sundance-2018-blindspotting-private-life-hits-fest-sweet-spots-w515630 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.hollywoodreporter.com/review/private-life-review-1075835 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://thefilmstage.com/reviews/sundance-review-private-life-finds-hardship-honesty-and-humor-in-infertility/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=edgren-gustaf
I 2022/06/09 11:04:36 Fulltext indexing: hd5_fm_26NP5 https://www.criterion.com/shop/browse/list?director=edgren-gustaf
I 2022/06/09 11:04:36 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[hd5_fm_26NP5 (1735154913828143104)]} 0 5
I 2022/06/09 11:04:36 HTCACHE storing content of url https://www.criterion.com/current/author/405-marie-nyrer-d, 48881 bytes
I 2022/06/09 11:04:36 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 416, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:36 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/405-marie-nyrer-d, STACKING TIME = 10, PARSING TIME = 6
I 2022/06/09 11:04:36 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:36 SWITCHBOARD *Indexed 1199 words in URL https://www.criterion.com/shop/browse/list?director=edgren-gustaf [hd5_fm_26NP5]
Description: Gustaf Edgren films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12522 bytes |
LinkStorageTime: 245 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:36 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/5289-sundance-2018-tamara-jenkins-s-private-life
I 2022/06/09 11:04:36 Fulltext indexing: hdPeQG_26NP5 https://www.criterion.com/current/posts/5289-sundance-2018-tamara-jenkins-s-private-life
I 2022/06/09 11:04:36 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[hdPeQG_26NP5 (1735154914092384256)]} 0 4
I 2022/06/09 11:04:36 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[hX3j6G_26NP5 (1735154914099724288)]} 0 0
I 2022/06/09 11:04:36 SWITCHBOARD *Indexed 875 words in URL https://www.criterion.com/current/posts/5289-sundance-2018-tamara-jenkins-s-private-life [hdPeQG_26NP5]
Description: Sundance 2018: Tamara Jenkinss Private Life | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12053 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:36 SWITCHBOARD Excluded 12 words in URL https://www.criterion.com/current/author/405-marie-nyrer-d
I 2022/06/09 11:04:36 Fulltext indexing: hX3j6G_26NP5 https://www.criterion.com/current/author/405-marie-nyrer-d
I 2022/06/09 11:04:36 SWITCHBOARD *Indexed 118 words in URL https://www.criterion.com/current/author/405-marie-nyrer-d [hX3j6G_26NP5]
Description: Marie Nyreröd | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1416 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:36 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 416, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:36 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 416, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:36 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=morris-errol, 226428 bytes
I 2022/06/09 11:04:37 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bay-michael, 224667 bytes
I 2022/06/09 11:04:37 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse/list?director=morris-errol, STACKING TIME = 1, PARSING TIME = 26
I 2022/06/09 11:04:37 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1776-78eb25c559a866198d58680c9874465e/N2awPkaE0xOiPEj26tjI9yv2eLLXZM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/cc08b14f22e5cb4d4abc35e2bd1e76eb/duZpzWtzx94ixvPnyavkOi9YOZqWR6_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/3217a6e471ba2f9239cbe6cb398aa02f/dgNduEohOof1NohG0fGVRpPOyb322m_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/939ec46490d500823654f65419a8db04/f57BMlw9kdr7YF0EiuQtIM1i67rFHU_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/5199593d0fcdff78d678ad5ac1745fa9/vZ1yJIDRTUcAEmQs90SEEQ7tQvHCqu_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 446, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:37 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=bay-michael, STACKING TIME = 1, PARSING TIME = 103
I 2022/06/09 11:04:37 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/07f4eeb745b2223e67ad82c6dc2e3ed3/o6c4ES95CWTBBINB8ahqL4niihMuLT_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/7858f806be01113ec3f26840d2ec0cab/zcQiVXuZFCisdXVhLeyD96Hxr6Vqp8_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=morris-errol
I 2022/06/09 11:04:37 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[hWtCam_26NP5 (1735154914750889984)]} 0 2
I 2022/06/09 11:04:37 Fulltext indexing: hWtCam_26NP5 https://www.criterion.com/shop/browse/list?director=morris-errol
I 2022/06/09 11:04:37 SWITCHBOARD *Indexed 1214 words in URL https://www.criterion.com/shop/browse/list?director=morris-errol [hWtCam_26NP5]
Description: Errol Morris films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12699 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:37 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=bay-michael
I 2022/06/09 11:04:37 Fulltext indexing: g5RLVm_26NP5 https://www.criterion.com/shop/browse/list?director=bay-michael
I 2022/06/09 11:04:37 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[g5RLVm_26NP5 (1735154914815901696)]} 0 2
I 2022/06/09 11:04:37 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=bay-michael [g5RLVm_26NP5]
Description: Michael Bay films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12503 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:37 HTCACHE storing content of url https://www.criterion.com/films/1430, 76513 bytes
I 2022/06/09 11:04:37 SWITCHBOARD CRAWL: ADDED 63 LINKS FROM https://www.criterion.com/films/1430, STACKING TIME = 1, PARSING TIME = 8
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/971-7c4c45c3fdeb5631e89f5bd69b34fd70/Dzc8TKpXR0nl2nAvmoZBjShpANSY3t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/966-b448a73623a3104f16704d630fdc4d4a/tAsvl7EI5McyRgmZlFekuGnwgQ7p0o_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/posts/1158-c313ab31d62f8c87337a018fca5b9d64/IMAMURA_Rayns_still_original.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/the-insect-woman?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/7740c3bbca66b23dc22b0d7950193f24.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/9594df5f105bf7ba4cfc780077d38c50.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/98fca346151a9909ce777d3c7bcf4e14/NiN005MtaZLuhUJZOq7z7jOk0wRd02_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/955f6f5f61b98e8a440adbaff6544904/Pg6ZGiA0S3eXXGhlqVdh9PGdQtPvUw_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/dd75421ca3130652d533fe8683dbda57/gu0f7I6dfxl8kO9uUrhIaVX2xcduMR_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/2d39dc6357f78246124d9b889cfee438.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/feabbe7bae1e5ba2f447871de1c77dc0.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/1cc4a3cbedc3d14be75648a0c88c3ebe.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/a0ee213bcd52a61768546b6f50b49e93/oUj4Rhj3eltZ2KJezVL8MqVJVtmf8S_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1855-d1af5d0a5e3d807a317f3cc4e9c52f38/vdPxhXhBYsJOygUcZ3XqUkDiS3dZOl_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/ec6c0c425195715ec06d674b8f7973e5.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 SWITCHBOARD Excluded 17 words in URL https://www.criterion.com/films/1430
I 2022/06/09 11:04:37 Fulltext indexing: gtsbye_26NP5 https://www.criterion.com/films/1430
I 2022/06/09 11:04:37 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[gtsbye_26NP5 (1735154914917613568)]} 0 2
I 2022/06/09 11:04:37 SWITCHBOARD *Indexed 348 words in URL https://www.criterion.com/films/1430 [gtsbye_26NP5]
Description: The Insect Woman (1963) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4037 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:37 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 448, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:37 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:04:37 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 448, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:04:37 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:37 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@107d2a8c[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wk(7.7.3):C19:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772672584}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wl(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772677667}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:04:37 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:04:37 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=whelan-tim, 224199 bytes
I 2022/06/09 11:04:37 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=whelan-tim, STACKING TIME = 3, PARSING TIME = 35
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/41132bea28bb3bbea12d52e78c20b378/kl6ebsg1AK3m1ejL3zvPJa17lTmxcW_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:37 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=whelan-tim
I 2022/06/09 11:04:37 Fulltext indexing: gokr_m_26NP5 https://www.criterion.com/shop/browse/list?director=whelan-tim
I 2022/06/09 11:04:37 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[gokr_m_26NP5 (1735154915423027200)]} 0 4
I 2022/06/09 11:04:37 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=whelan-tim [gokr_m_26NP5]
Description: Tim Whelan films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12495 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:37 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 462, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:38 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=tanaka-tokuzo, 225536 bytes
I 2022/06/09 11:04:38 SWITCHBOARD CRAWL: ADDED 51 LINKS FROM https://www.criterion.com/shop/browse?director=tanaka-tokuzo, STACKING TIME = 7, PARSING TIME = 90
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/554a88f3e461b3af84a8d7c74395c982/ITGgOs4mQf6gDsezMZFForMyh7w2oR_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=jr, 224620 bytes
I 2022/06/09 11:04:38 HostQueue forcing crawl-delay of 250 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 0)) = 250
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/53029c4265e2623dc5d2e1437fdb0a15/natIW8grGAtqgN1rxflDsqZ0woLxK2_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/584243c99fe050be8794d19464fd5cc6/xyjrEVJ3oamY6KDEMHoxNgk8t2lRpc_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=jr, STACKING TIME = 1, PARSING TIME = 47
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/afeea27c388b3547969c9f509df11cb5/3wuQsvYjsHTc4015Rl3wQBQsb7T9kt_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1958-60bad70091eaab48ce960c49b1f07d94/zQHE891LazZ0Sw7jE3PnOSXsdP3yew_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=tanaka-tokuzo
I 2022/06/09 11:04:38 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[gdLEnm_26NP5 (1735154915879157760)]} 0 2
I 2022/06/09 11:04:38 Fulltext indexing: gdLEnm_26NP5 https://www.criterion.com/shop/browse?director=tanaka-tokuzo
I 2022/06/09 11:04:38 SWITCHBOARD *Indexed 1203 words in URL https://www.criterion.com/shop/browse?director=tanaka-tokuzo [gdLEnm_26NP5]
Description: Tokuzo Tanaka films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12543 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:38 HTCACHE storing content of url https://www.criterion.com/boxsets/689-roberto-rossellinis-war-trilogy, 72322 bytes
I 2022/06/09 11:04:38 SWITCHBOARD CRAWL: ADDED 51 LINKS FROM https://www.criterion.com/boxsets/689-roberto-rossellinis-war-trilogy, STACKING TIME = 1, PARSING TIME = 15
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1934-9121909235b34c704ecc53b432150a94/dZNZ1nCXJbEhZeNY4YNXuFLwJPn0yD_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/1901628472b08db9f6af1a6fcb778ef5.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/7cb52be2fd1572e3a5b5276e84d487fa/TECVVTeHMMxaHWxosd91eIX8qS2hMK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/7b27c9719b8fe36e334fa0ba43910a0e/GwB3ma4IYcYsr5CZ4ZizQW0i0mxyxc_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/065858072a69f217ae517f2cc87a2c68/0T7IhsxzrE4rW48efa7kEHGrSe77j0_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/e8457d67a289fb50aad38901ea42f732.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/412a7c9e10c6bb26f1d1190e87cc6014.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1852-f20e8f5bdc458777cda90775598ef89c/G2jPTXWEYclcKFXC7LYWoNEj5D7V7l_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 HTCACHE storing content of url https://www.criterion.com/current/posts/109-the-discreet-charm-of-the-bourgeosie, 71411 bytes
I 2022/06/09 11:04:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/109-the-discreet-charm-of-the-bourgeosie, STACKING TIME = 4, PARSING TIME = 13
I 2022/06/09 11:04:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 465, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:38 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=jr
I 2022/06/09 11:04:38 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[gFLNjm_26NP5 (1735154916080484352)]} 0 2
I 2022/06/09 11:04:38 Fulltext indexing: gFLNjm_26NP5 https://www.criterion.com/shop/browse/list?director=jr
I 2022/06/09 11:04:38 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=jr [gFLNjm_26NP5]
Description: JR films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12462 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:38 SWITCHBOARD Excluded 22 words in URL https://www.criterion.com/boxsets/689-roberto-rossellinis-war-trilogy
I 2022/06/09 11:04:38 Fulltext indexing: f9jsz3_26NP5 https://www.criterion.com/boxsets/689-roberto-rossellinis-war-trilogy
I 2022/06/09 11:04:38 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[f9jsz3_26NP5 (1735154916127670272)]} 0 4
I 2022/06/09 11:04:38 SWITCHBOARD *Indexed 448 words in URL https://www.criterion.com/boxsets/689-roberto-rossellinis-war-trilogy [f9jsz3_26NP5]
Description: Roberto Rossellinis War Trilogy | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 7665 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:38 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/109-the-discreet-charm-of-the-bourgeosie
I 2022/06/09 11:04:38 Fulltext indexing: fr0NEG_26NP5 https://www.criterion.com/current/posts/109-the-discreet-charm-of-the-bourgeosie
I 2022/06/09 11:04:38 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[fr0NEG_26NP5 (1735154916183244800)]} 0 2
I 2022/06/09 11:04:38 SWITCHBOARD *Indexed 668 words in URL https://www.criterion.com/current/posts/109-the-discreet-charm-of-the-bourgeosie [fr0NEG_26NP5]
Description: The Discreet Charm of the Bourgeosie | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 9166 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:38 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 465, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:04:38 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=carra-lucille, 224170 bytes
I 2022/06/09 11:04:38 HTCACHE storing content of url https://www.criterion.com/current/posts/7516-previewing-venice-2021, 78718 bytes
I 2022/06/09 11:04:38 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=carra-lucille, STACKING TIME = 3, PARSING TIME = 71
I 2022/06/09 11:04:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 453, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/9a029e3b8977bb7f3cb8290bfbd4f9a4/vXIQATcZON4HV9vCqpFDM8ZTMrR42c_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 SWITCHBOARD CRAWL: ADDED 69 LINKS FROM https://www.criterion.com/current/posts/7516-previewing-venice-2021, STACKING TIME = 1, PARSING TIME = 32
I 2022/06/09 11:04:38 REJECTED https://www.indiewire.com/2021/08/the-hand-of-god-paolo-sorrentino-interview-1234659825/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://variety.com/2021/film/news/bong-joon-ho-venice-film-festival-covid-19-netflix-1235053563/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.vanityfair.com/hollywood/2021/08/awards-insider-first-look-jane-campion-power-of-the-dog - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.vanityfair.com/hollywood/2021/08/awards-insider-maggie-gyllenhaal-lost-daughter-first-look - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://news.yahoo.com/dune-long-considered-unadaptable-screenwriters-190014204.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.labiennale.org/en/news/jamie-lee-curtis-golden-lion-lifetime-achievement - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.labiennale.org/en/cinema/2021/lineup/venezia-78-competition/madres-paralelas - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://variety.com/2021/film/spotlight/venice-barbera-oscar-1235051867/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.vulture.com/2021/08/pablo-larran-interview-on-spencer-and-biopics.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.labiennale.org/en/news/roberto-benigni-golden-lion-lifetime-achievement - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.theguardian.com/lifeandstyle/2018/oct/06/maggie-gyllenhaal-elena-ferrante-film-book - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://criterion-production.s3.amazonaws.com/h1c8DOClXrEBKiQVLBn1deSMKCAjRQ.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://deadline.com/2021/08/timothee-chalamet-interview-dune-venice-film-festival-1234824699/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.hollywoodreporter.com/movies/movie-news/edgar-wright-last-night-in-soho-baby-driver-2-1235005922/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://www.indiewire.com/2021/08/the-card-counter-paul-schrader-interview-1234660264/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://variety.com/2021/film/spotlight/how-venice-topper-barbera-earned-varietys-intl-achievement-in-film-award-1235051861/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://variety.com/2021/film/spotlight/venice-barbera-biennale-1235051970/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:38 REJECTED https://variety.com/2021/film/spotlight/jamie-lee-curtis-talks-halloween-venice-golden-lion-1235050354/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=carra-lucille
I 2022/06/09 11:04:39 Fulltext indexing: fQHXDm_26NP5 https://www.criterion.com/shop/browse/list?director=carra-lucille
I 2022/06/09 11:04:39 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[fQHXDm_26NP5 (1735154916723261440)]} 0 6
I 2022/06/09 11:04:39 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=carra-lucille [fQHXDm_26NP5]
Description: Lucille Carra films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12468 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:39 SWITCHBOARD Excluded 30 words in URL https://www.criterion.com/current/posts/7516-previewing-venice-2021
I 2022/06/09 11:04:39 Fulltext indexing: fGGLtG_26NP5 https://www.criterion.com/current/posts/7516-previewing-venice-2021
I 2022/06/09 11:04:39 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[fGGLtG_26NP5 (1735154916788273152)]} 0 2
I 2022/06/09 11:04:39 SWITCHBOARD *Indexed 821 words in URL https://www.criterion.com/current/posts/7516-previewing-venice-2021 [fGGLtG_26NP5]
Description: Previewing Venice 2021 | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 11559 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:39 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 453, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:39 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 453, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:39 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=mann-michael, 224122 bytes
I 2022/06/09 11:04:39 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=mann-michael, STACKING TIME = 2, PARSING TIME = 32
I 2022/06/09 11:04:39 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://s3.amazonaws.com/criterion-production/films/5fdd7999dcd7824705792d1d95ee538f/r9qKHNq3ldPSJH2wwBckeNlDucBRMr_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 462, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:39 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=mann-michael
I 2022/06/09 11:04:39 Fulltext indexing: e0aRXm_26NP5 https://www.criterion.com/shop/browse/list?director=mann-michael
I 2022/06/09 11:04:39 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[e0aRXm_26NP5 (1735154917446778880)]} 0 2
I 2022/06/09 11:04:39 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse/list?director=mann-michael [e0aRXm_26NP5]
Description: Michael Mann films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12447 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:39 HTCACHE storing content of url https://www.criterion.com/current/posts/6389-critics-week-awards-and-highlights, 86551 bytes
I 2022/06/09 11:04:39 SWITCHBOARD CRAWL: ADDED 97 LINKS FROM https://www.criterion.com/current/posts/6389-critics-week-awards-and-highlights, STACKING TIME = 6, PARSING TIME = 10
I 2022/06/09 11:04:39 REJECTED https://variety.com/2019/film/festivals/i-lost-my-body-director-jeremy-clapin-critics-week-breakout-1203221101/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/les-heros-ne-meurent-jamais - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.hollywoodreporter.com/review/land-ashes-review-1212491 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://criterion-production.s3.amazonaws.com/9pQeSejRjwTNYTNNInqj87g0cdDUzN.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/ahetzkSwUdw?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://variety.com/2019/film/reviews/litigante-review-1203216875/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-4-struggling-for-justice-overcoming-grief - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://cineuropa.org/en/newsdetail/373051/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/wL8G7NVhk50?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/posts/7819-early-summer-reading - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/X-PIoLQ8OsU?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/spKShkRlFgc?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/nuestras-madres - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.screendaily.com/reviews/you-deserve-a-lover-cannes-review/5139688.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/vivarium - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/the-unknown-saint - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/ikki-illa-meint - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.hollywoodreporter.com/review/heroes-don-t-die-les-heros-ne-meurent-jamais-review-1210499 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://cineuropa.org/en/newsdetail/372503/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.screendaily.com/reviews/our-mothers-cannes-review/5139788.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/posts/7822-irma-vep-revamp - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://twitter.com/midmarauder/status/1125844260706705408 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/tu-merites-un-amour - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://variety.com/2019/film/reviews/vivarium-review-jesse-eisenberg-imogen-poots-1203219403/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.screendaily.com/reviews/dwelling-in-the-fuchun-mountains-cannes-review/5139844.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/AXyZmuR_mBA?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/she-runs - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.hollywoodreporter.com/review/you-deserve-a-lover-1212183 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/posts/7823-tribeca-2022 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/WodOCZtv1EY?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/abou-leila - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.screendaily.com/reviews/vivarium-cannes-review/5139622.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/series/did-you-see-this - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.screendaily.com/reviews/i-lost-my-body-cannes-review/5139271.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/litigante - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/category/1-on-film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/series/cannes-2019 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/dwelling-in-the-fushun-mountains - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/unknown-saint-alaa-eddine-aljem-moroccan-buried-loot-comedy - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/hvitur-hvitur-dagur - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://womenandhollywood.com/cannes-2019-women-directors-meet-sofia-quiros-ceniza-negra/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/jai-perdu-mon-corps - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/O7oNgj0H788?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/ceniza-negra - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://player.vimeo.com/video/336205899 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/category/20-the-daily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.hollywoodreporter.com/review/i-lost-my-body-jai-perdu-mon-corps-review-1210449 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/author/654-david-hudson - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/posts/7818-american-neorealism-now - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/news/winners-of-the-58supthsup-semaine-de-la-critique - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://variety.com/2019/film/reviews/heroes-dont-die-review-1203221561/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/b5doRU9tQ78?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 REJECTED https://www.hollywoodreporter.com/review/vivarium-review-1211969 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:39 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/6389-critics-week-awards-and-highlights
I 2022/06/09 11:04:39 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[eshkTG_26NP5 (1735154917631328256)]} 0 2
I 2022/06/09 11:04:39 Fulltext indexing: eshkTG_26NP5 https://www.criterion.com/current/posts/6389-critics-week-awards-and-highlights
I 2022/06/09 11:04:39 SWITCHBOARD *Indexed 883 words in URL https://www.criterion.com/current/posts/6389-critics-week-awards-and-highlights [eshkTG_26NP5]
Description: Critics Week Awards and Highlights | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 11269 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:39 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 459, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:40 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=ivory-james, 224711 bytes
I 2022/06/09 11:04:40 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=ivory-james, STACKING TIME = 0, PARSING TIME = 24
I 2022/06/09 11:04:40 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/2a684a78d6b7c2e6aabb3341e08f0cf4/49RLuNCJcOFrAKVB1w00pKqmEQmXOs_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/5a82e5535265fb02f88a49f6b2fe730c/JbuzGZpZS8bG3GgdRDT5pEXIFKeAoa_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 481, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:40 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=ivory-james
I 2022/06/09 11:04:40 Fulltext indexing: e0JBOm_26NP5 https://www.criterion.com/shop/browse/list?director=ivory-james
I 2022/06/09 11:04:40 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[e0JBOm_26NP5 (1735154918013009920)]} 0 3
I 2022/06/09 11:04:40 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=ivory-james [e0JBOm_26NP5]
Description: James Ivory films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12521 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:40 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=manchevski-milcho, 224199 bytes
I 2022/06/09 11:04:40 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=manchevski-milcho, STACKING TIME = 1, PARSING TIME = 23
I 2022/06/09 11:04:40 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/f50d7a7d8ec2588c4bd6e4db00a8120f/jgAIXDTNR403btDmNLF1s6QJ7VILif_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 491, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:40 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=manchevski-milcho
I 2022/06/09 11:04:40 Fulltext indexing: d2YYkm_26NP5 https://www.criterion.com/shop/browse/list?director=manchevski-milcho
I 2022/06/09 11:04:40 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[d2YYkm_26NP5 (1735154918247890944)]} 0 2
I 2022/06/09 11:04:40 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=manchevski-milcho [d2YYkm_26NP5]
Description: Milcho Manchevski films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12485 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:40 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=s-yeaworth-jr-irvin, 224219 bytes
I 2022/06/09 11:04:40 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=s-yeaworth-jr-irvin, STACKING TIME = 1, PARSING TIME = 22
I 2022/06/09 11:04:40 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/6790c92a200a349fa2918b59929a5b7c/SxJFUmRYI29BBqY4Im7uNTWIYo2pJi_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=s-yeaworth-jr-irvin
I 2022/06/09 11:04:40 Fulltext indexing: d18l5m_26NP5 https://www.criterion.com/shop/browse/list?director=s-yeaworth-jr-irvin
I 2022/06/09 11:04:40 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[d18l5m_26NP5 (1735154918436634624)]} 0 2
I 2022/06/09 11:04:40 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=s-yeaworth-jr-irvin [d18l5m_26NP5]
Description: Irvin S. Yeaworth Jr. films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12498 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:40 HTCACHE storing content of url https://www.criterion.com/current/posts/1944-janus-films-acquires-kaurism-ki-s-latest, 71114 bytes
I 2022/06/09 11:04:40 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/1944-janus-films-acquires-kaurism-ki-s-latest, STACKING TIME = 2, PARSING TIME = 6
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/3ed41f79deffcb3052099b02c9660e9b/zQOZgJoUsBEgi8arpM5w8aT22vGo6W_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/f8bf41c3e8d2266f423881ceb3159429/58bZDer5maXJjg6GDgD8Tyrr6ZZAuT_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/8f12ceb5a2e46f5f1550942e055ef1af/5yl46GfrudlcteVtODCZveKlbIlys1_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:04:40 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:40 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
I 2022/06/09 11:04:40 org.apache.solr.update.SolrIndexWriter Calling setCommitData with IW:org.apache.solr.update.SolrIndexWriter@fed4ccdd commitCommandVersion:0
I 2022/06/09 11:04:40 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/current/posts/1944-janus-films-acquires-kaurism-ki-s-latest
I 2022/06/09 11:04:40 Fulltext indexing: dxNXWG_26NP5 https://www.criterion.com/current/posts/1944-janus-films-acquires-kaurism-ki-s-latest
I 2022/06/09 11:04:40 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[dxNXWG_26NP5 (1735154918501646336)]} 0 8
I 2022/06/09 11:04:40 SWITCHBOARD *Indexed 296 words in URL https://www.criterion.com/current/posts/1944-janus-films-acquires-kaurism-ki-s-latest [dxNXWG_26NP5]
Description: Janus Films Acquires Kaurismäkis Latest | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3248 bytes |
LinkStorageTime: 9 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:40 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:40 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:41 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=fincher-david, 224767 bytes
I 2022/06/09 11:04:41 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=fincher-david, STACKING TIME = 0, PARSING TIME = 23
I 2022/06/09 11:04:41 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/films/693744d52bb74cb5166725421bb473e6/d121BfwKuez4Xs7tpGnThzfqDXpCgK_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/films/999405f5b0b718043c15d1183d04bede/6ZXWpPhvznaU1VM6grQfjigdBN06Pi_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=lee-bruce, 224739 bytes
I 2022/06/09 11:04:41 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=lee-bruce, STACKING TIME = 1, PARSING TIME = 39
I 2022/06/09 11:04:41 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1957-f90d4c48a2f932ffe7df386499f9477e/73k4EkSiXEfsdi097fieFBGdb39vlg_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/films/30b0ea18473faf0c3bf0486b76b0b761/sMo9K1Z55wY1dYmHiRgWpISyYJbV5S_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=fincher-david
I 2022/06/09 11:04:41 Fulltext indexing: dRZgkm_26NP5 https://www.criterion.com/shop/browse/list?director=fincher-david
I 2022/06/09 11:04:41 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:41 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[dRZgkm_26NP5 (1735154919033274368)]} 0 15
I 2022/06/09 11:04:41 SWITCHBOARD *Indexed 1195 words in URL https://www.criterion.com/shop/browse/list?director=fincher-david [dRZgkm_26NP5]
Description: David Fincher films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12544 bytes |
LinkStorageTime: 18 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:41 HTCACHE storing content of url https://www.criterion.com/current/posts/561-eclipse-series-2-the-documentaries-of-louis-malle, 95335 bytes
I 2022/06/09 11:04:41 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/561-eclipse-series-2-the-documentaries-of-louis-malle, STACKING TIME = 6, PARSING TIME = 14
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=lee-bruce
I 2022/06/09 11:04:41 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[dCJyGm_26NP5 (1735154919186366464)]} 0 2
I 2022/06/09 11:04:41 Fulltext indexing: dCJyGm_26NP5 https://www.criterion.com/shop/browse/list?director=lee-bruce
I 2022/06/09 11:04:41 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse/list?director=lee-bruce [dCJyGm_26NP5]
Description: Bruce Lee films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12508 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:41 HostQueue forcing crawl-delay of 230 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 489, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 20)) = 230
I 2022/06/09 11:04:41 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/561-eclipse-series-2-the-documentaries-of-louis-malle
I 2022/06/09 11:04:41 Fulltext indexing: cupuTG_26NP5 https://www.criterion.com/current/posts/561-eclipse-series-2-the-documentaries-of-louis-malle
I 2022/06/09 11:04:41 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[cupuTG_26NP5 (1735154919357284352)]} 0 9
I 2022/06/09 11:04:41 SWITCHBOARD *Indexed 1387 words in URL https://www.criterion.com/current/posts/561-eclipse-series-2-the-documentaries-of-louis-malle [cupuTG_26NP5]
Description: Eclipse Series 2:The Documentaries of Louis Malle | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 23010 bytes |
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:41 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 489, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:41 HTCACHE storing content of url https://www.criterion.com/current/posts/5670-ryusuke-hamaguchi-s-asako-i-ii, 72384 bytes
I 2022/06/09 11:04:41 SWITCHBOARD CRAWL: ADDED 73 LINKS FROM https://www.criterion.com/current/posts/5670-ryusuke-hamaguchi-s-asako-i-ii, STACKING TIME = 6, PARSING TIME = 11
I 2022/06/09 11:04:41 REJECTED http://variety.com/2018/film/asia/asako-i-ii-review-netetemo-sametemo-1202809972/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.hollywoodreporter.com/review/asako-i-ii-netemo-sametemo-film-review-cannes-2018-1111789 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.festival-cannes.com/en/festival/films/netemo-sametemo - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://criticsroundup.com/film/happy-hour/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.youtube.com/embed/6baCO63Y6ZM?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://thefilmstage.com/reviews/cannes-review-ryusuke-hamaguchis-happy-hour-follow-up-asako-i-ii-is-a-romance-lacking-in-passion/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.theguardian.com/film/2018/may/15/asako-i-ii-review-japanese-romcom-flips-gaze-ryusuke-hamaguchi - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED http://lwlies.com/festivals/asako-ii-first-look-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED http://www.indiewire.com/2018/05/asako-i-ii-review-ryusuke-hamaguchi-1201964358/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-8-transportive-doublings-and-divisive-titles - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://mubi.com/notebook/posts/hidden-in-reality-ryusuke-hamaguchi-and-asako-i-ii - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://criterion-production.s3.amazonaws.com/5PxkwlKNwsssK3VKZVW3lu0nTjQvOO.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.filmcomment.com/blog/film-comment-podcast-cannes-day-eight/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.rogerebert.com/cannes/cannes-2018-the-house-that-jack-built-at-war - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.screendaily.com/reviews/asako-i-and-ii-cannes-review/5129364.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.youtube.com/embed/7dZIy-0o9ZQ?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://icsfilm.org/reviews/cannes-2018-review-asako-i-ii-ryusuke-hamaguchi/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.filmcomment.com/blog/interview-ryusuke-hamaguchi/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://film.avclub.com/spike-lee-teams-up-with-jordan-peele-for-the-funny-poi-1826042384 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED http://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/asako-i-ii-mournful-hamaguchi-ryusuke-mournful-drama - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 REJECTED https://www.thewrap.com/asako-i-ii-film-review-leisurely-japanese-drama-explores-nature-of-love/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:41 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/5670-ryusuke-hamaguchi-s-asako-i-ii
I 2022/06/09 11:04:41 Fulltext indexing: bghe0G_26NP5 https://www.criterion.com/current/posts/5670-ryusuke-hamaguchi-s-asako-i-ii
I 2022/06/09 11:04:41 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[bghe0G_26NP5 (1735154919754694656)]} 0 2
I 2022/06/09 11:04:41 SWITCHBOARD *Indexed 427 words in URL https://www.criterion.com/current/posts/5670-ryusuke-hamaguchi-s-asako-i-ii [bghe0G_26NP5]
Description: Ryusuke Hamaguchis Asako I & II | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4890 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:42 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=eisenstein-sergei, 226536 bytes
I 2022/06/09 11:04:42 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 492, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:42 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/aed45dbeaf63414624b16890eb458dea/fSaXGJq2BBkowhHXw2FJft0UoNIdnI_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1838-ee328a31205b114ef125fd81b54b5cd0/VZGhEsbGQY3luUNqMc64IKmXGoRe9U_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse/list?director=eisenstein-sergei, STACKING TIME = 3, PARSING TIME = 41
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/19c902a73e243b9293de4c717430f639/H1rWEdJtowN7Xh9vSnOvPU9I6y0AgT_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/b653a1545863d441cf9b8a8bc50946b8/SXcj1Zf8bWoyaoUEuzlFAc4gPNGwYm_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 HTCACHE storing content of url https://www.criterion.com/films/31784-once-upon-a-time-in-china-iv, 69293 bytes
I 2022/06/09 11:04:42 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/films/31784-once-upon-a-time-in-china-iv, STACKING TIME = 1, PARSING TIME = 17
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/KuWudzfbRgcPhJ0rR2bxTuLmg9FckZXz83m8qPOg.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1987-beb71f216e96d1ff2d0f8231f5b8b975/44LVkvftLRcr5paF4enJfBFTe5mI2c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/547f1ff5b611dda3dcfb1f50cb05e5ad/dzsA6F9rZ81DMybNgmhZiptrofQ0el_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/IzTEOB8bn3M7E71d8AYa2bsODWbSqLS0zK860sB3.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/lwUiAurIs3naTuH7TldsyolVQs4eCXVbdsavoGc8.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/e8d4cd2c5dd1b2541a3c8325a6d1805f/W5gKGPvtkm5evOJ9devGYL935KMAdy_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/3cC32fOwUa9qSR0GQo7IG9efAeSLIuyCdOEvd2z3.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/hrONj5485vAB6fFCtShQlMCxwLVF5P3pchVEPSxy.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/IG65jmiJyizSSz9EUnzO6d6CSWXUVnkZHW2Ddnk7.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/cd4ae6bdbaea9fd9c9aff1d69f924bc4/5wErYoFwVfkciAfnpRbFIhPqv7tIC5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/l47N2CPqlXDTfmssaxGdQ40vxlyb18Dqez62eSYb.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/AuV1S0FxEyXHpP7VrGPLiPwjE8L0uODZsucREqz8.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=eisenstein-sergei
I 2022/06/09 11:04:42 Fulltext indexing: bzm91m_26NP5 https://www.criterion.com/shop/browse/list?director=eisenstein-sergei
I 2022/06/09 11:04:42 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[bzm91m_26NP5 (1735154920032567296)]} 0 3
I 2022/06/09 11:04:42 SWITCHBOARD *Indexed 1217 words in URL https://www.criterion.com/shop/browse/list?director=eisenstein-sergei [bzm91m_26NP5]
Description: Sergei Eisenstein films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12728 bytes |
LinkStorageTime: 14 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:42 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/films/31784-once-upon-a-time-in-china-iv
I 2022/06/09 11:04:42 Fulltext indexing: bVK-Le_26NP5 https://www.criterion.com/films/31784-once-upon-a-time-in-china-iv
I 2022/06/09 11:04:42 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[bVK-Le_26NP5 (1735154920049344512)]} 0 2
I 2022/06/09 11:04:42 SWITCHBOARD *Indexed 302 words in URL https://www.criterion.com/films/31784-once-upon-a-time-in-china-iv [bVK-Le_26NP5]
Description: Once Upon a Time in China IV (1993) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2922 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:42 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:42 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:42 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=yates-peter, 224188 bytes
I 2022/06/09 11:04:42 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:42 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:04:42 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=nicholson-jack, 224757 bytes
I 2022/06/09 11:04:42 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/4b7b707e5d1ca031e64894a0f7664f56/Ih7c7WkDI5YcytMafpe83E30nVygJD_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=yates-peter, STACKING TIME = 3, PARSING TIME = 130
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:42 HTCACHE storing content of url https://www.criterion.com/current/posts/485-la-jet-e-unchained-melody, 127274 bytes
I 2022/06/09 11:04:43 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:43 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@1dd3358e[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wk(7.7.3):C19:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772672584}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wl(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772677667}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wm(7.7.3):C12:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772680783}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wn(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772680861}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wo(7.7.3):C6:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772682978}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:04:43 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:04:43 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=nicholson-jack, STACKING TIME = 1, PARSING TIME = 129
I 2022/06/09 11:04:43 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 498, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/485-la-jet-e-unchained-melody, STACKING TIME = 9, PARSING TIME = 24
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1864-79581df36332f7a1f027b81311c9e0f9/j8fKXLhpRU7m0dGwebBdwZLgrXaO26_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/films/5dab1d4b720b8dba2951246a6e579875/SfyfBq0uCWQjg71QHpsDa6kWq59vXR_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=yates-peter
I 2022/06/09 11:04:43 Fulltext indexing: ag4sKm_26NP5 https://www.criterion.com/shop/browse/list?director=yates-peter
I 2022/06/09 11:04:43 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ag4sKm_26NP5 (1735154921021374464)]} 0 4
I 2022/06/09 11:04:43 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=yates-peter [ag4sKm_26NP5]
Description: Peter Yates films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12473 bytes |
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:43 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=nicholson-jack
I 2022/06/09 11:04:43 Fulltext indexing: aVkQcm_26NP5 https://www.criterion.com/shop/browse/list?director=nicholson-jack
I 2022/06/09 11:04:43 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[aVkQcm_26NP5 (1735154921095823360)]} 0 2
I 2022/06/09 11:04:43 SWITCHBOARD *Indexed 1202 words in URL https://www.criterion.com/shop/browse/list?director=nicholson-jack [aVkQcm_26NP5]
Description: Jack Nicholson films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12527 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:43 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=nyreroed-marie, 224129 bytes
I 2022/06/09 11:04:43 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 498, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:04:43 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/485-la-jet-e-unchained-melody
I 2022/06/09 11:04:43 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=nyreroed-marie, STACKING TIME = 1, PARSING TIME = 106
I 2022/06/09 11:04:43 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/films/4caa477448c9fe2ee28f80df08f4d89b/NmCRkRglJzsgL3AKwNshj7ENlgQZIN_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:43 Fulltext indexing: aBJHDG_26NP5 https://www.criterion.com/current/posts/485-la-jet-e-unchained-melody
I 2022/06/09 11:04:43 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[aBJHDG_26NP5 (1735154921353773056)]} 0 9
I 2022/06/09 11:04:43 SWITCHBOARD *Indexed 946 words in URL https://www.criterion.com/current/posts/485-la-jet-e-unchained-melody [aBJHDG_26NP5]
Description: La Jetée: Unchained Melody | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 14618 bytes |
LinkStorageTime: 15 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:43 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=nyreroed-marie
I 2022/06/09 11:04:43 Fulltext indexing: Z8DN1m_26NP5 https://www.criterion.com/shop/browse?director=nyreroed-marie
I 2022/06/09 11:04:43 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Z8DN1m_26NP5 (1735154921437659136)]} 0 4
I 2022/06/09 11:04:43 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse?director=nyreroed-marie [Z8DN1m_26NP5]
Description: Marie Nyreröd films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12398 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:43 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 498, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:43 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 498, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:43 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=misumi-kenji, 230595 bytes
I 2022/06/09 11:04:43 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=freeland-thornton, 224738 bytes
I 2022/06/09 11:04:44 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=freeland-thornton, STACKING TIME = 2, PARSING TIME = 34
I 2022/06/09 11:04:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/d0606bfda6ec74c0019bd85bbe973ae0/6DSu10XLoj9GjtPJZGAxaXPs113oQD_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1873-f368b5675c9f0eb398727cb9a03b6e7b/vDeZBqbuBzw7Bpsb3KEK6hBnY6zACK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 HostQueue forcing crawl-delay of 250 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 513, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 6)) = 250
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/shop/browse/list?director=misumi-kenji, STACKING TIME = 2, PARSING TIME = 151
I 2022/06/09 11:04:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/bdccef4372055a6d7eba1d9e48d671e0/KbfBzW7rIf0GLzZ6mfOiZKBiAPrKFM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/06aae5d548f169e22473a1560b8af40b/C9uzPZ2an3M9AoDXjTI7aMH8KBOYDU_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/4bc990101d88c836ce459146bb0409c8/0RtHB019kbCekdVj2WNYnd3nxS9laR_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/8fa83fea9e1e866a11e31a00dbe58c97/L5ozj9f0PX4PQ864pbUt7PYTLfOySF_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/3c9f09ce0317fdbf2199438da624ef26/wQrhudyvLgRCg2bvWKGyePtmhI6yh6_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/235f87e883adc39c3c5c392aab084c4a/e4p57PTsISfbCCMcxSYyifYr5EEYhr_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/20189bedc7f5a0720311b6fb5413302f/tjxCpBXiJPhyzAIuy6ma0sBtZKpi9w_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/ca9546c2ceb1b830a1e5f190805fb6fd/21FyOwvaD7hVXctaQrX9RVy5muIn6D_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/24eeb2dee1e42ab06aaed3c486f00939/CnXoyCADKyul3CulQ9MSBa4MeB9mZX_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 HTCACHE storing content of url https://www.criterion.com/current/posts/432-a-tribute-a-canterbury-tale, 78066 bytes
I 2022/06/09 11:04:44 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/432-a-tribute-a-canterbury-tale, STACKING TIME = 1, PARSING TIME = 15
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=freeland-thornton
I 2022/06/09 11:04:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Zj20Cm_26NP5 (1735154922159079424)]} 0 2
I 2022/06/09 11:04:44 Fulltext indexing: Zj20Cm_26NP5 https://www.criterion.com/shop/browse/list?director=freeland-thornton
I 2022/06/09 11:04:44 SWITCHBOARD *Indexed 1195 words in URL https://www.criterion.com/shop/browse/list?director=freeland-thornton [Zj20Cm_26NP5]
Description: Thornton Freeland films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12506 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:44 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=misumi-kenji
I 2022/06/09 11:04:44 Fulltext indexing: Z4Lgxm_26NP5 https://www.criterion.com/shop/browse/list?director=misumi-kenji
I 2022/06/09 11:04:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Z4Lgxm_26NP5 (1735154922245062656)]} 0 7
I 2022/06/09 11:04:44 SWITCHBOARD *Indexed 1246 words in URL https://www.criterion.com/shop/browse/list?director=misumi-kenji [Z4Lgxm_26NP5]
Description: Kenji Misumi films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 13097 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:44 HTCACHE storing content of url https://www.criterion.com/films/28018-the-man-in-grey, 71256 bytes
I 2022/06/09 11:04:44 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/432-a-tribute-a-canterbury-tale
I 2022/06/09 11:04:44 HostQueue forcing crawl-delay of 251 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 502, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 0)) = 251
I 2022/06/09 11:04:44 Fulltext indexing: Zcv07G_26NP5 https://www.criterion.com/current/posts/432-a-tribute-a-canterbury-tale
I 2022/06/09 11:04:44 SWITCHBOARD CRAWL: ADDED 61 LINKS FROM https://www.criterion.com/films/28018-the-man-in-grey, STACKING TIME = 73, PARSING TIME = 13
I 2022/06/09 11:04:44 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=klos-elmar, 224205 bytes
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/ee3210abb4d599864dbe91e0ece052be.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/b13056ccb124d15b8d7516bac7576c8e/VduGrRfHEEl6sOgrkY5looQFWyjKvX_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Zcv07G_26NP5 (1735154922382426112)]} 0 19
I 2022/06/09 11:04:44 SWITCHBOARD *Indexed 594 words in URL https://www.criterion.com/current/posts/432-a-tribute-a-canterbury-tale [Zcv07G_26NP5]
Description: A Tribute: A Canterbury Tale | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 7802 bytes |
LinkStorageTime: 26 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/images/3913-13e0f359ffe35a4c4e0598e2e9db3246/madonnaof7moons_1432_003_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/d4d406ae48cd0fbfc78216e2efb5bf43/NEeX7phTkN3xjrd1deXb0RmokIXDG3_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/74be83402097d3f4c5d9e7331de31471/43YJwgdfANxgafSJNaTDUwGxR8Gait_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/4e7a58bca1bc1539f053cc08e0e1ca82.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/9dd9777e1aa8821e6a74e257ccfd7348.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://itunes.apple.com/us/movie/id811526103?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1812-2bdeb818c107e95738f59894990c22b2/oTM1jWGYaWx6KHPFFGsXiyVmbdpCPi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/the-man-in-grey?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.amazon.com/dp/B00JP33FH8 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/bf29c35e8cf957012e14fd777d65aa52.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/27426a8ea03aee693162a016f3af1fb9/p7llBxrsJUQX2Ov6G91uPDpYYs0fWN_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/films/28018-the-man-in-grey
I 2022/06/09 11:04:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ZPBnPe_26NP5 (1735154922482040832)]} 0 1
I 2022/06/09 11:04:44 Fulltext indexing: ZPBnPe_26NP5 https://www.criterion.com/films/28018-the-man-in-grey
I 2022/06/09 11:04:44 SWITCHBOARD *Indexed 289 words in URL https://www.criterion.com/films/28018-the-man-in-grey [ZPBnPe_26NP5]
Description: The Man in Grey (1943) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3045 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/17a09b267d7df228c099117fcc503b0b/HEN7Igx0rZ7xS24SClFcPTBRs9HxSL_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=klos-elmar, STACKING TIME = 3, PARSING TIME = 43
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=klos-elmar
I 2022/06/09 11:04:44 Fulltext indexing: ZjADWm_26NP5 https://www.criterion.com/shop/browse/list?director=klos-elmar
I 2022/06/09 11:04:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ZjADWm_26NP5 (1735154922581655552)]} 0 2
I 2022/06/09 11:04:44 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=klos-elmar [ZjADWm_26NP5]
Description: Elmar Klos films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12502 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:44 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 513, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 10)) = 246
I 2022/06/09 11:04:44 HTCACHE storing content of url https://www.criterion.com/films/28723-lone-wolf-and-cub-baby-cart-at-the-river-styx, 77603 bytes
I 2022/06/09 11:04:44 SWITCHBOARD CRAWL: ADDED 68 LINKS FROM https://www.criterion.com/films/28723-lone-wolf-and-cub-baby-cart-at-the-river-styx, STACKING TIME = 2, PARSING TIME = 17
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/5639adc405d0157ff9fa02b1dedb6653.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/c438eda5cb94c4b6b8daa783749e4f2b.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/24eeb2dee1e42ab06aaed3c486f00939/CnXoyCADKyul3CulQ9MSBa4MeB9mZX_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/lone-wolf-and-cub-baby-cart-at-the-river-styx?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/images/7742-355bb7d45a9482fc6c2ac3db60ad9495/lone_wolf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.amazon.com/dp/B01M6EAKVM - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://itunes.apple.com/us/movie/id1169371082?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/c177fa8a7f4dfbc5dd12bf23b2332471.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/images/7830-f315cd91efe00a972e37be56081e1dbf/LWC_designs_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/58e97ae4dd0b361bd131525faf284b78/K10x3FugjRS01iYKEXsLTr0OlDQk4P_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/9c35734b41649852c32c085956e55337.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/images/7773-ee7702bf1312d6cd3550dbfc426265bc/Current_LW_C1-grab65_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:44 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/films/28723-lone-wolf-and-cub-baby-cart-at-the-river-styx
I 2022/06/09 11:04:44 Fulltext indexing: ZJnYce_26NP5 https://www.criterion.com/films/28723-lone-wolf-and-cub-baby-cart-at-the-river-styx
I 2022/06/09 11:04:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ZJnYce_26NP5 (1735154922894131200)]} 0 1
I 2022/06/09 11:04:44 SWITCHBOARD *Indexed 343 words in URL https://www.criterion.com/films/28723-lone-wolf-and-cub-baby-cart-at-the-river-styx [ZJnYce_26NP5]
Description: Lone Wolf and Cub: Baby Cart at the River Styx (1972) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3869 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:44 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 510, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 10)) = 245
I 2022/06/09 11:04:45 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 510, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 11)) = 244
I 2022/06/09 11:04:45 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=matarazzo-raffaello, 226510 bytes
I 2022/06/09 11:04:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/224ad3784ebfb34f98b1af628337f3da/gf5q2Dxvw2rDGLoNCNOnF3L53EUKqK_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/c3671b7d05dd992c80de898da6f724a8/iKAAnLTwUhBFo0X62zBb8ijm258Sey_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=gomez-muriel-emilio, 224779 bytes
I 2022/06/09 11:04:45 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse/list?director=matarazzo-raffaello, STACKING TIME = 5, PARSING TIME = 29
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/27426a8ea03aee693162a016f3af1fb9/p7llBxrsJUQX2Ov6G91uPDpYYs0fWN_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1803-6a8b76d7af61cbee31aced4a8191a85a/zKIrZpwHhlb5ETRsoB0UujdI9OwYFz_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/b59efc718fc3309a4eb76255310280b8/MpKeZ33lims6VNmUQPeUZ06u1HCgd5_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 513, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 13)) = 243
I 2022/06/09 11:04:45 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=gomez-muriel-emilio, STACKING TIME = 1, PARSING TIME = 59
I 2022/06/09 11:04:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1976-ece4132f4abef8c4e7beb0a0edffc9a8/y26UyQwNxt4FguJSgQIZWpCNlLsjHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/89594ab78e17a9778dc78f275076d760/kADie75znXN9EHJ7qBhLrNwch9t918_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=matarazzo-raffaello
I 2022/06/09 11:04:45 Fulltext indexing: ZJj0am_26NP5 https://www.criterion.com/shop/browse/list?director=matarazzo-raffaello
I 2022/06/09 11:04:45 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ZJj0am_26NP5 (1735154923591434240)]} 0 3
I 2022/06/09 11:04:45 SWITCHBOARD *Indexed 1210 words in URL https://www.criterion.com/shop/browse/list?director=matarazzo-raffaello [ZJj0am_26NP5]
Description: Raffaello Matarazzo films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12707 bytes |
LinkStorageTime: 90 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:45 HTCACHE storing content of url https://www.criterion.com/current/posts/663-trafic-watching-the-wheels, 80500 bytes
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/663-trafic-watching-the-wheels, STACKING TIME = 4, PARSING TIME = 18
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/images/4352-1cf1f70c7926a0f997ce964c45e36f81/img_current_545_007_medium.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 513, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 10)) = 246
I 2022/06/09 11:04:45 HTCACHE storing content of url https://www.criterion.com/current/posts/6274-robert-zemeckis-looks-back-on-his-debut-film-jitters, 64277 bytes
I 2022/06/09 11:04:45 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/6274-robert-zemeckis-looks-back-on-his-debut-film-jitters, STACKING TIME = 8, PARSING TIME = 11
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://player.vimeo.com/video/321824302 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/66ba5fe958ad045a4f5b5ebe54570c97/A3LCcKjo5itwNBka6dKwl6td7sSpB0_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=gomez-muriel-emilio
I 2022/06/09 11:04:45 Fulltext indexing: ZFuk9m_26NP5 https://www.criterion.com/shop/browse/list?director=gomez-muriel-emilio
I 2022/06/09 11:04:45 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ZFuk9m_26NP5 (1735154923820023808)]} 0 2
I 2022/06/09 11:04:45 SWITCHBOARD *Indexed 1200 words in URL https://www.criterion.com/shop/browse/list?director=gomez-muriel-emilio [ZFuk9m_26NP5]
Description: Emilio Gómez Muriel films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12526 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:45 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/663-trafic-watching-the-wheels
I 2022/06/09 11:04:45 Fulltext indexing: Y_9D5G_26NP5 https://www.criterion.com/current/posts/663-trafic-watching-the-wheels
I 2022/06/09 11:04:45 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Y_9D5G_26NP5 (1735154923917541376)]} 0 7
I 2022/06/09 11:04:45 SWITCHBOARD *Indexed 1063 words in URL https://www.criterion.com/current/posts/663-trafic-watching-the-wheels [Y_9D5G_26NP5]
Description: Trafic: Watching the Wheels | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 14877 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:45 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/6274-robert-zemeckis-looks-back-on-his-debut-film-jitters
I 2022/06/09 11:04:45 Fulltext indexing: YrB28G_26NP5 https://www.criterion.com/current/posts/6274-robert-zemeckis-looks-back-on-his-debut-film-jitters
I 2022/06/09 11:04:45 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[YrB28G_26NP5 (1735154923940610048)]} 0 1
I 2022/06/09 11:04:45 SWITCHBOARD *Indexed 317 words in URL https://www.criterion.com/current/posts/6274-robert-zemeckis-looks-back-on-his-debut-film-jitters [YrB28G_26NP5]
Description: Robert Zemeckis Looks Back on His Debut-Film Jitters | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3721 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:45 HTCACHE storing content of url https://www.criterion.com/current/posts/1244-ingmar-bergman-s-belongings-sold-at-auction, 77676 bytes
I 2022/06/09 11:04:45 SWITCHBOARD CRAWL: ADDED 77 LINKS FROM https://www.criterion.com/current/posts/1244-ingmar-bergman-s-belongings-sold-at-auction, STACKING TIME = 3, PARSING TIME = 7
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/38-modell-av-dramatiska-teatern-skala-1-50?locale=en-US - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/6fede1f031c07b843ffa8965d47043f3/9QWkE37UXlpfhZrTIsaZHdWmooGJ1a_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/262-glasskivor-tio-stycken-till-laterna-magica - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_filmprojector.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/74A-papperskorg?locale=en-US - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/311-filmpris-golden-globe-award-for-hostsonaten-1978 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/254-fotografi-john-bryson-fotografi - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_JAWS.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://twitter.com/bukowskis/status/4454678676 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/270-schackpjaser-31-stycken - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/261-laterna-magica-lapierre-paris-ca-1870%20 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman-auction-wastebasket.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_header.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/4caa477448c9fe2ee28f80df08f4d89b/NmCRkRglJzsgL3AKwNshj7ENlgQZIN_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/33-sprattelgubbe?locale=en-US - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_chess_set.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/95d0fe890da5c6008298dc39ca2195b4/oTvnw5EnwHQLpwttOLxX4yAOWY7o0j_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_5.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/263-filmprojektor-1920-tal-e-marland-ab-stockholm - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/261-laterna-magica-lapierre-paris-ca-1870?locale=en-US - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED http://news.bbc.co.uk/2/hi/europe/8280740.stm - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_theatre_two-up.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_jumping_jack.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 501, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:46 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/current/posts/1244-ingmar-bergman-s-belongings-sold-at-auction
I 2022/06/09 11:04:46 Fulltext indexing: YrBCbG_26NP5 https://www.criterion.com/current/posts/1244-ingmar-bergman-s-belongings-sold-at-auction
I 2022/06/09 11:04:46 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[YrBCbG_26NP5 (1735154924016107520)]} 0 2
I 2022/06/09 11:04:46 SWITCHBOARD *Indexed 336 words in URL https://www.criterion.com/current/posts/1244-ingmar-bergman-s-belongings-sold-at-auction [YrBCbG_26NP5]
Description: Ingmar Bergmans Belongings Sold at Auction | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3975 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:46 HTCACHE storing content of url https://www.criterion.com/current/posts/5658-jia-zhangke-s-ash-is-purest-white, 73772 bytes
I 2022/06/09 11:04:46 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 496, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:46 SWITCHBOARD CRAWL: ADDED 80 LINKS FROM https://www.criterion.com/current/posts/5658-jia-zhangke-s-ash-is-purest-white, STACKING TIME = 5, PARSING TIME = 8
I 2022/06/09 11:04:46 REJECTED https://www.screendaily.com/reviews/ash-is-purest-white-cannes-review/5129220.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.thewrap.com/ash-purest-white-film-review-characters-growing-pains-china/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://film.avclub.com/jean-luc-godard-returns-to-cannes-to-make-a-dunce-out-o-1825979305 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED http://www.latimes.com/entertainment/movies/la-et-mn-cannes-diary-ash-is-purest-white-cold-war-20180512-htmlstory.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-5-changless-change-jean-luc-godard-and-jia-zhangke - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.filmcomment.com/blog/film-comment-podcast-cannes-day-four/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED http://variety.com/2018/film/asia/jia-zhangke-making-his-most-expensive-indie-film-ash-is-purest-white-1202805661/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://thefilmstage.com/reviews/cannes-review-with-ash-is-purest-white-jia-zhangke-stages-another-exceptional-platform-for-zhao-tao/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED http://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/ash-purest-white-jia-zhangke-zhao-tao-magisterial-mob-critique - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.theguardian.com/film/2018/may/11/ash-is-purest-white-review-chinese-gangsters-girlfriend-saga-burns-bright - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://deadline.com/2018/05/cannes-buzz-films-girls-of-the-sun-and-ash-is-purest-white-set-for-us-distribution-by-cohen-media-group-1202394690/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://criterion-production.s3.amazonaws.com/WK9FBxqSEA7Dlce2yWlaEbLstGf5jx.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.hollywoodreporter.com/review/ash-is-purest-white-cannes-2018-1111288 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED http://lwlies.com/festivals/ash-is-purest-white-cannes-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED http://www.indiewire.com/2018/05/ash-is-purest-white-review-jia-zhangke-cannes-2018-1201963491/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://cine-vue.com/2018/05/cannes-2018-ash-is-purest-white-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED http://www.vulture.com/2018/05/ash-is-purest-white-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://icsfilm.org/reviews/cannes-2018-review-ash-purest-white-jia-zhangke/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.filmcomment.com/blog/film-week-ash-purest-white/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.ioncinema.com/reviews/jia-zhangke-ash-is-purest-white-review - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.youtube.com/embed/Xr7B-GhQaTM?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED http://desistfilm.com/cannes-2018-ash-is-purest-white-by-jia-zhang-ke/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED http://variety.com/2018/film/reviews/ash-is-purest-white-review-1202802929/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.festival-cannes.com/en/festival/films/jiang-hu-er-nv - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.vanityfair.com/hollywood/2018/05/the-angel-ash-is-purest-white-cannes-movie-reviews - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.rogerebert.com/cannes/cannes-2018-ash-is-the-purest-white-girls-of-the-sun-girl - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://theplaylist.net/ash-purest-white-cannes-review-20180515/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED http://cineuropa.org/nw.aspx?t=newsdetail&l=en&did=354268 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 SWITCHBOARD Excluded 23 words in URL https://www.criterion.com/current/posts/5658-jia-zhangke-s-ash-is-purest-white
I 2022/06/09 11:04:46 Fulltext indexing: YqqKcG_26NP5 https://www.criterion.com/current/posts/5658-jia-zhangke-s-ash-is-purest-white
I 2022/06/09 11:04:46 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[YqqKcG_26NP5 (1735154924372623360)]} 0 2
I 2022/06/09 11:04:46 SWITCHBOARD *Indexed 482 words in URL https://www.criterion.com/current/posts/5658-jia-zhangke-s-ash-is-purest-white [YqqKcG_26NP5]
Description: Jia Zhangkes Ash Is Purest White | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5647 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:46 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 496, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:46 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=von-trotta-margarethe-1, 224282 bytes
I 2022/06/09 11:04:46 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:46 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=von-trotta-margarethe-1, STACKING TIME = 7, PARSING TIME = 29
I 2022/06/09 11:04:46 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 REJECTED https://s3.amazonaws.com/criterion-production/films/bed1dc8df02842d6a75325665e718ebd/da8xTBLVhcfx0KQXSyOOMImKRe6s2r_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:46 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=von-trotta-margarethe-1
I 2022/06/09 11:04:46 Fulltext indexing: YZXxhm_26NP5 https://www.criterion.com/shop/browse/list?director=von-trotta-margarethe-1
I 2022/06/09 11:04:46 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[YZXxhm_26NP5 (1735154924884328448)]} 0 3
I 2022/06/09 11:04:46 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=von-trotta-margarethe-1 [YZXxhm_26NP5]
Description: Margarethe von Trotta films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12501 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:47 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:47 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=kassovitz-mathieu, 224188 bytes
I 2022/06/09 11:04:47 HTCACHE storing content of url https://www.criterion.com/current/posts/1231-mayerling-star-crossed, 84641 bytes
I 2022/06/09 11:04:47 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=kassovitz-mathieu, STACKING TIME = 1, PARSING TIME = 25
I 2022/06/09 11:04:47 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/726755430bd298a5aa424f68a792bcea/aQ0KQhoip19olkpwhmNbrAfMY6qhAB_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/1231-mayerling-star-crossed, STACKING TIME = 2, PARSING TIME = 81
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/babb52f89b5488d00cb76a924d7e06eb/XPxkbzNVy36iDfcGUxaqpxFC6LJ0tI_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/images/4467-3a447d1c16dc2d86db906fc2a056e122/current_553_014_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=kassovitz-mathieu
I 2022/06/09 11:04:47 Fulltext indexing: X87vrm_26NP5 https://www.criterion.com/shop/browse/list?director=kassovitz-mathieu
I 2022/06/09 11:04:47 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[X87vrm_26NP5 (1735154925276495872)]} 0 6
I 2022/06/09 11:04:47 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=kassovitz-mathieu [X87vrm_26NP5]
Description: Mathieu Kassovitz films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12488 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:47 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 491, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:47 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/1231-mayerling-star-crossed
I 2022/06/09 11:04:47 Fulltext indexing: X10F5G_26NP5 https://www.criterion.com/current/posts/1231-mayerling-star-crossed
I 2022/06/09 11:04:47 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[X10F5G_26NP5 (1735154925323681792)]} 0 2
I 2022/06/09 11:04:47 SWITCHBOARD *Indexed 527 words in URL https://www.criterion.com/current/posts/1231-mayerling-star-crossed [X10F5G_26NP5]
Description: Mayerling: Star-Crossed | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6195 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:47 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=tennyson-pen, 224751 bytes
I 2022/06/09 11:04:47 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=tennyson-pen, STACKING TIME = 2, PARSING TIME = 23
I 2022/06/09 11:04:47 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/5b83c252b600e87cb7d663f2b1d1ac8d/F8v9OxoNple8ycZ2VntXSIFAXCg8pJ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1873-f368b5675c9f0eb398727cb9a03b6e7b/vDeZBqbuBzw7Bpsb3KEK6hBnY6zACK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 490, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:04:47 HTCACHE storing content of url https://www.criterion.com/films/354, 79512 bytes
I 2022/06/09 11:04:47 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=tennyson-pen
I 2022/06/09 11:04:47 SWITCHBOARD CRAWL: ADDED 72 LINKS FROM https://www.criterion.com/films/354, STACKING TIME = 3, PARSING TIME = 19
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/b653a1545863d441cf9b8a8bc50946b8/SXcj1Zf8bWoyaoUEuzlFAc4gPNGwYm_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/ivan-the-terrible-part-ii?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1838-ee328a31205b114ef125fd81b54b5cd0/VZGhEsbGQY3luUNqMc64IKmXGoRe9U_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/88b2556fa888690aab792e7454a6fe26.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 Fulltext indexing: XyrSom_26NP5 https://www.criterion.com/shop/browse/list?director=tennyson-pen
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/82523d90468b398c9e487fdf969d36d6.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/53a8103ba77901a31cb565c8bf2c7338.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[XyrSom_26NP5 (1735154925698023424)]} 0 4
I 2022/06/09 11:04:47 SWITCHBOARD *Indexed 1199 words in URL https://www.criterion.com/shop/browse/list?director=tennyson-pen [XyrSom_26NP5]
Description: Pen Tennyson films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12523 bytes |
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/6918d33a17ccf9cdc1667c362121d593.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/0870d8add1b5719448d1f445679e503a.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/1068-194f3f49c166adeecb9d66968442e517/jGwTz0i0KByW1oG0AH1KWNjVFgbkXE_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/posts/1131-e236cb8c87ec5b809c2301982224d1ec/IVAN_rosenbaum_still_1_original.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/55b413192f2d6d855cffb078b25156f4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/b1039bd526df1e78680c71b08daa9071.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/e27c982cba14c95724fdb5c647e63ded.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/76cd8b50aedb0c256bf117124487f494.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/19c902a73e243b9293de4c717430f639/H1rWEdJtowN7Xh9vSnOvPU9I6y0AgT_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/aed45dbeaf63414624b16890eb458dea/fSaXGJq2BBkowhHXw2FJft0UoNIdnI_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/b49b46aa1dacc0e1530a24cf23c764b4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/32d0f143682b34732d0fdd3ed8e0e7bb/fBklqTm33Jg32mLL79K5I4cZHSt9cE_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/c651b027da04c8a0a1553975e3098af2.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5626-/ai8UvErjxFGByr17RCrqAnwWw7xvRj_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/76b270cd5220e804477fd41e9c907d3b.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:47 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/films/354
I 2022/06/09 11:04:47 Fulltext indexing: XC-kYe_26NP5 https://www.criterion.com/films/354
I 2022/06/09 11:04:47 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[XC-kYe_26NP5 (1735154925746257920)]} 0 3
I 2022/06/09 11:04:47 SWITCHBOARD *Indexed 371 words in URL https://www.criterion.com/films/354 [XC-kYe_26NP5]
Description: Ivan the Terrible, Part II (1958) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4329 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:47 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:48 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:04:48 HTCACHE storing content of url https://www.criterion.com/current/posts/5757-the-hope-that-fueled-bowling-for-columbine, 64340 bytes
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/5757-the-hope-that-fueled-bowling-for-columbine, STACKING TIME = 8, PARSING TIME = 20
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://player.vimeo.com/video/271504473 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/films/78a86bc12fbed832f0b341609a22fa52/lun1ptGstEhxOhmc24pORDXEVkN2Ve_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/5757-the-hope-that-fueled-bowling-for-columbine
I 2022/06/09 11:04:48 Fulltext indexing: WHtrsG_26NP5 https://www.criterion.com/current/posts/5757-the-hope-that-fueled-bowling-for-columbine
I 2022/06/09 11:04:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[WHtrsG_26NP5 (1735154926169882624)]} 0 2
I 2022/06/09 11:04:48 SWITCHBOARD *Indexed 348 words in URL https://www.criterion.com/current/posts/5757-the-hope-that-fueled-bowling-for-columbine [WHtrsG_26NP5]
Description: The Hope That Fueled Bowling for Columbine | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3913 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:48 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:04:48 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=brook-peter, 224668 bytes
I 2022/06/09 11:04:48 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 9)) = 241
I 2022/06/09 11:04:48 HTCACHE storing content of url https://www.criterion.com/current/posts/6095-a-dry-white-season-justice-against-the-law, 85182 bytes
I 2022/06/09 11:04:48 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=brook-peter, STACKING TIME = 3, PARSING TIME = 94
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1865-2b74f037f454df1d78013f06dc4aaea4/0TUeLtsha8fzPrVMeQ8rNOnpUVmvME_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/films/3c8e34ba4a541897737232a90611f947/uMJyxowOApuQ9O1hh2wLe26pqL14z1_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 SWITCHBOARD CRAWL: ADDED 59 LINKS FROM https://www.criterion.com/current/posts/6095-a-dry-white-season-justice-against-the-law, STACKING TIME = 5, PARSING TIME = 20
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://criterion-production.s3.amazonaws.com/wkjM3lyFlBq2Xwm4CbquxntstEQKIL.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://criterion-production.s3.amazonaws.com/VpkypwbMpTd43ooIA0Vvm3oz1Tum8Y.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://criterion-production.s3.amazonaws.com/A5OPNRaWvdavhX6UYpXnvGUGMTzAeL.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/films/d3fa0bd2e5949b9e3c861222fb594d95/ASnRN4Kj6AdEv8RJTDrHKhIve2ZFQY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6095-/4Fs4E8gQdXRg8bDqotCILcvvsJDWRu_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@417da17a[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wk(7.7.3):C19:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772672584}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wl(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772677667}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wm(7.7.3):C12:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772680783}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wn(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772680861}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wo(7.7.3):C6:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772682978}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wp(7.7.3):C22:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772688229}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:04:48 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=brook-peter
I 2022/06/09 11:04:48 Fulltext indexing: W0frrm_26NP5 https://www.criterion.com/shop/browse/list?director=brook-peter
I 2022/06/09 11:04:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[W0frrm_26NP5 (1735154926541078528)]} 0 7
I 2022/06/09 11:04:48 SWITCHBOARD *Indexed 1200 words in URL https://www.criterion.com/shop/browse/list?director=brook-peter [W0frrm_26NP5]
Description: Peter Brook films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12492 bytes |
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:48 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/6095-a-dry-white-season-justice-against-the-law
I 2022/06/09 11:04:48 Fulltext indexing: WDcdcG_26NP5 https://www.criterion.com/current/posts/6095-a-dry-white-season-justice-against-the-law
I 2022/06/09 11:04:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[WDcdcG_26NP5 (1735154926636498944)]} 0 4
I 2022/06/09 11:04:48 SWITCHBOARD *Indexed 1045 words in URL https://www.criterion.com/current/posts/6095-a-dry-white-season-justice-against-the-law [WDcdcG_26NP5]
Description: A Dry White Season: Justice Against the Law | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 17143 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:48 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 483, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:48 HTCACHE storing content of url https://www.criterion.com/current/posts/6106-euzhan-palcy-remembers-brando-s-nerves-on-the-set-of-a-dry-white-season, 68433 bytes
I 2022/06/09 11:04:48 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/6106-euzhan-palcy-remembers-brando-s-nerves-on-the-set-of-a-dry-white-season, STACKING TIME = 1, PARSING TIME = 6
I 2022/06/09 11:04:48 REJECTED https://player.vimeo.com/video/304438092 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/films/28473-werner-herzog-eats-his-shoe
I 2022/06/09 11:04:48 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/shop/browse
I 2022/06/09 11:04:48 REJECTED https://www.criterion.com/films/28473-werner-herzog-eats-his-shoe - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/films/28473-werner-herzog-eats-his-shoe aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:35:14Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
I 2022/06/09 11:04:48 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/films/d3fa0bd2e5949b9e3c861222fb594d95/ASnRN4Kj6AdEv8RJTDrHKhIve2ZFQY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 SWITCHBOARD Excluded 20 words in URL https://www.criterion.com/current/posts/6106-euzhan-palcy-remembers-brando-s-nerves-on-the-set-of-a-dry-white-season
I 2022/06/09 11:04:48 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[V9xhxe_26NP5 (1735154926815805440)]} 0 11
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[WBRjMG_26NP5 (1735154926836776960)]} 0 1
I 2022/06/09 11:04:48 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:48 Fulltext indexing: WBRjMG_26NP5 https://www.criterion.com/current/posts/6106-euzhan-palcy-remembers-brando-s-nerves-on-the-set-of-a-dry-white-season
I 2022/06/09 11:04:48 SWITCHBOARD *Indexed 312 words in URL https://www.criterion.com/current/posts/6106-euzhan-palcy-remembers-brando-s-nerves-on-the-set-of-a-dry-white-season [WBRjMG_26NP5]
Description: Euzhan Palcy Remembers Brandos Nerves on the Set of A Dry White Season | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3724 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:48 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 480, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:49 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 480, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:49 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 480, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:49 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=lo, 225147 bytes
I 2022/06/09 11:04:49 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://s3.amazonaws.com/criterion-production/films/028d5306fb6147b73970b738eb19a93a/HQcvC6MhrRZyx3VN4DHa77hZjWiOjk_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse/list?director=lo, STACKING TIME = 3, PARSING TIME = 33
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://s3.amazonaws.com/criterion-production/films/e8d4cd2c5dd1b2541a3c8325a6d1805f/W5gKGPvtkm5evOJ9devGYL935KMAdy_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1957-f90d4c48a2f932ffe7df386499f9477e/73k4EkSiXEfsdi097fieFBGdb39vlg_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 HostQueue forcing crawl-delay of 233 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 483, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 17)) = 233
I 2022/06/09 11:04:49 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=lo
I 2022/06/09 11:04:49 Fulltext indexing: V8su2m_26NP5 https://www.criterion.com/shop/browse/list?director=lo
I 2022/06/09 11:04:49 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[V8su2m_26NP5 (1735154927844458496)]} 0 3
I 2022/06/09 11:04:49 SWITCHBOARD *Indexed 1202 words in URL https://www.criterion.com/shop/browse/list?director=lo [V8su2m_26NP5]
Description: Lo Wei films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12502 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:49 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=zinnemann-fred, 224766 bytes
I 2022/06/09 11:04:49 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 486, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:04:49 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=zinnemann-fred, STACKING TIME = 4, PARSING TIME = 69
I 2022/06/09 11:04:49 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1976-ece4132f4abef8c4e7beb0a0edffc9a8/y26UyQwNxt4FguJSgQIZWpCNlLsjHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/89594ab78e17a9778dc78f275076d760/kADie75znXN9EHJ7qBhLrNwch9t918_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=chukhrai-grigori, 224800 bytes
I 2022/06/09 11:04:50 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 490, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:04:50 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=chukhrai-grigori, STACKING TIME = 1, PARSING TIME = 45
I 2022/06/09 11:04:50 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/f89499b3cbca9503a0cf83ecba01142f/LlDUqJCmbsiL8xBf409dwhXIFKOKBt_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=zinnemann-fred
I 2022/06/09 11:04:50 Fulltext indexing: Vc_HQm_26NP5 https://www.criterion.com/shop/browse/list?director=zinnemann-fred
I 2022/06/09 11:04:50 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Vc_HQm_26NP5 (1735154928327852032)]} 0 2
I 2022/06/09 11:04:50 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse/list?director=zinnemann-fred [Vc_HQm_26NP5]
Description: Fred Zinnemann films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12520 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:50 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=chukhrai-grigori
I 2022/06/09 11:04:50 Fulltext indexing: Uu2nwm_26NP5 https://www.criterion.com/shop/browse/list?director=chukhrai-grigori
I 2022/06/09 11:04:50 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Uu2nwm_26NP5 (1735154928394960896)]} 0 2
I 2022/06/09 11:04:50 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse/list?director=chukhrai-grigori [Uu2nwm_26NP5]
Description: Grigori Chukhrai films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12532 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:50 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 490, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:50 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=ophuls-max, 225801 bytes
I 2022/06/09 11:04:50 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=ophuls-max, STACKING TIME = 2, PARSING TIME = 81
I 2022/06/09 11:04:50 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/e5c7435a9dbc4d547966c42139b17e05/CIDQTwh6cAejzHXtCFcPu4on6pTik2_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/2d9f65a009ae0df30f7268f5cad30602/hCVpEfIN7DST5IptZEPGxHXn1hTR9M_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/a536a3fb3af463edd2e64152b9661f5c/vwUO7GHWr8ltPBhvvDbO51GIRnqIDL_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/1b0975e10aabfa4e04861a3c490964d7/Lu59y3u3gBwDYBDmvQripKKH1K9bTA_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 496, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:50 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=ophuls-max
I 2022/06/09 11:04:50 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[UuT3nm_26NP5 (1735154928798662656)]} 0 2
I 2022/06/09 11:04:50 Fulltext indexing: UuT3nm_26NP5 https://www.criterion.com/shop/browse/list?director=ophuls-max
I 2022/06/09 11:04:50 SWITCHBOARD *Indexed 1206 words in URL https://www.criterion.com/shop/browse/list?director=ophuls-max [UuT3nm_26NP5]
Description: Max Ophuls films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12625 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:50 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=weill-claudia, 224184 bytes
I 2022/06/09 11:04:50 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=weill-claudia, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:04:50 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/bf7dd923004d069192e6cf732dd51e1f/x7ZTLqEHWNMZWFdmuX89ePaJ9aZNfN_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 HTCACHE storing content of url https://www.criterion.com/current/posts/2596-following-nolan-begins, 77618 bytes
I 2022/06/09 11:04:50 HostQueue forcing crawl-delay of 247 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 503, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 4)) = 247
I 2022/06/09 11:04:50 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/2596-following-nolan-begins, STACKING TIME = 2, PARSING TIME = 221
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/images/5052-29798d52bd807cc071b0cb5bf35c99be/Following_Essay_Current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/872420cba7088d5f5f64157663c6c2c5/PWMlxSDrb4crZTImCxrDbFsxYZMb4k_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:50 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=weill-claudia
I 2022/06/09 11:04:50 Fulltext indexing: Unf76m_26NP5 https://www.criterion.com/shop/browse/list?director=weill-claudia
I 2022/06/09 11:04:50 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Unf76m_26NP5 (1735154929203412992)]} 0 2
I 2022/06/09 11:04:50 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=weill-claudia [Unf76m_26NP5]
Description: Claudia Weill films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12491 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:51 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/2596-following-nolan-begins
I 2022/06/09 11:04:51 Fulltext indexing: Uc4rLG_26NP5 https://www.criterion.com/current/posts/2596-following-nolan-begins
I 2022/06/09 11:04:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Uc4rLG_26NP5 (1735154929276813312)]} 0 3
I 2022/06/09 11:04:51 SWITCHBOARD *Indexed 942 words in URL https://www.criterion.com/current/posts/2596-following-nolan-begins [Uc4rLG_26NP5]
Description: Following: Nolan Begins | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12627 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:51 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=pearce-leslie, 224219 bytes
I 2022/06/09 11:04:51 HTCACHE storing content of url https://www.criterion.com/current/posts/1049-sweet-death-veronika-voss-production-history, 160126 bytes
I 2022/06/09 11:04:51 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=pearce-leslie, STACKING TIME = 1, PARSING TIME = 33
I 2022/06/09 11:04:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/films/3a4a52811b630a9836c1b10cb2c55a38/1DZVBE8PnMfkggyvh5s9f7K2TSAiF0_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 507, robots.delay = 0, ((waitig = 253) - (timeSinceLastAccess = 10)) = 243
I 2022/06/09 11:04:51 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/1049-sweet-death-veronika-voss-production-history, STACKING TIME = 6, PARSING TIME = 105
I 2022/06/09 11:04:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 HTCACHE storing content of url https://www.criterion.com/boxsets/1110-andre-gregory-wallace-shawn-3-films, 74997 bytes
I 2022/06/09 11:04:51 SWITCHBOARD CRAWL: ADDED 50 LINKS FROM https://www.criterion.com/boxsets/1110-andre-gregory-wallace-shawn-3-films, STACKING TIME = 2, PARSING TIME = 13
I 2022/06/09 11:04:51 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/3e0bdd8b1538100c61a1c69840258f0d.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1880-d84b68ccfdfdb7ef19d89946ab43b5cb/XxW9C5BcXk4DjYmbLD9UBavXz6xwa0_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/films/55540be3417fc8d3aca18151f48009d7/1wrYMIHOtZzcuJuYju1L7NYD14iLdR_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/2e140cbe7177d67fd312f8acdea2a4d4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/films/4d12d0b6523c10060860b2695f32672f/KmC0p8LFp4dAfgyLalLkQRoZgakcY3_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/films/b5de3107dbc1cac694c7581d787b3cd8/0EwZ3HapM3kv8NAIbGAzViVdQBbJHS_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=pearce-leslie
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/5d62080f96d3e0ca24d41d44c1375a37.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[UPFfrm_26NP5 (1735154929557831680)]} 0 2
I 2022/06/09 11:04:51 Fulltext indexing: UPFfrm_26NP5 https://www.criterion.com/shop/browse/list?director=pearce-leslie
I 2022/06/09 11:04:51 SWITCHBOARD *Indexed 1193 words in URL https://www.criterion.com/shop/browse/list?director=pearce-leslie [UPFfrm_26NP5]
Description: Leslie Pearce films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12492 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:51 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 506, robots.delay = 0, ((waitig = 253) - (timeSinceLastAccess = 11)) = 242
I 2022/06/09 11:04:51 HTCACHE storing content of url https://www.criterion.com/current/posts/6386-quentin-tarantino-s-once-upon-a-time-in-hollywood, 75563 bytes
I 2022/06/09 11:04:51 SWITCHBOARD CRAWL: ADDED 69 LINKS FROM https://www.criterion.com/current/posts/6386-quentin-tarantino-s-once-upon-a-time-in-hollywood, STACKING TIME = 6, PARSING TIME = 9
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.nytimes.com/2019/05/21/movies/quentin-tarantino-once-upon-a-time-in-hollywood.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/once-upon-time-hollywood-quentin-tarantino-1960s-golden-age-meta-movie-charles-manson-sharon-tate-murders - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://criterion-production.s3.amazonaws.com/vDHqMO5u7IuFHEwOt7sBKVKcZtGu5g.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-8-tarantino-s-hollywood-elegy-and-bdsm-mourning - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.festival-cannes.com/en/festival/films/once-upon-a-time-in-hollywood - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED http://www.impawards.com/2019/once_upon_a_time_in_hollywood_ver5.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.telegraph.co.uk/films/0/upon-time-hollywood-review-tarantinos-ode-pre-manson-la-pure/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.rogerebert.com/cannes/cannes-2019-once-upon-a-time-in-hollywood - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED http://www.impawards.com/2019/once_upon_a_time_in_hollywood_ver6.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://time.com/5593402/once-upon-a-time-in-hollywood-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.esquire.com/entertainment/movies/a27458589/once-upon-a-time-in-hollywood-leonardo-dicaprio-brad-pitt-quentin-tarantino-interview/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://theplaylist.net/once-upon-time-in-hollywood-cannes-review-20190521/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://twitter.com/OnceInHollywood/status/1130414077484777472 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED http://www.impawards.com/2019/once_upon_a_time_in_hollywood_ver4.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.vulture.com/2019/05/once-upon-a-time-in-hollywood-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-quentin-tarantino-once-upon-a-time-in-hollywood-20190521-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/embed/ELeMaP8EPAA?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/1049-sweet-death-veronika-voss-production-history
I 2022/06/09 11:04:51 Fulltext indexing: UO6KaG_26NP5 https://www.criterion.com/current/posts/1049-sweet-death-veronika-voss-production-history
I 2022/06/09 11:04:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[UO6KaG_26NP5 (1735154929796907008)]} 0 14
I 2022/06/09 11:04:51 SWITCHBOARD *Indexed 1343 words in URL https://www.criterion.com/current/posts/1049-sweet-death-veronika-voss-production-history [UO6KaG_26NP5]
Description: Sweet Death:Veronika Voss Production History | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 21952 bytes |
LinkStorageTime: 15 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:51 SWITCHBOARD Excluded 23 words in URL https://www.criterion.com/boxsets/1110-andre-gregory-wallace-shawn-3-films
I 2022/06/09 11:04:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[UAhyS3_26NP5 (1735154929838850048)]} 0 2
I 2022/06/09 11:04:51 Fulltext indexing: UAhyS3_26NP5 https://www.criterion.com/boxsets/1110-andre-gregory-wallace-shawn-3-films
I 2022/06/09 11:04:51 SWITCHBOARD *Indexed 414 words in URL https://www.criterion.com/boxsets/1110-andre-gregory-wallace-shawn-3-films [UAhyS3_26NP5]
Description: André Gregory & Wallace Shawn: 3 Films | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 8508 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:51 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/6386-quentin-tarantino-s-once-upon-a-time-in-hollywood
I 2022/06/09 11:04:51 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 502, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 12)) = 239
I 2022/06/09 11:04:51 Fulltext indexing: T4pMJG_26NP5 https://www.criterion.com/current/posts/6386-quentin-tarantino-s-once-upon-a-time-in-hollywood
I 2022/06/09 11:04:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[T4pMJG_26NP5 (1735154929912250368)]} 0 3
I 2022/06/09 11:04:51 SWITCHBOARD *Indexed 662 words in URL https://www.criterion.com/current/posts/6386-quentin-tarantino-s-once-upon-a-time-in-hollywood [T4pMJG_26NP5]
Description: Quentin Tarantinos Once Upon a Time . . . in Hollywood | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 8527 bytes |
LinkStorageTime: 9 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:51 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=allen-lewis, 224135 bytes
I 2022/06/09 11:04:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=allen-lewis, STACKING TIME = 2, PARSING TIME = 31
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/films/9a47380e81bad08a322032a14158be83/8ftdZ1FybsRNtjEkE0lr8zYDcdx8lQ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 501, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 HTCACHE storing content of url https://www.criterion.com/current/posts/4434-laughing-and-crying-with-carmen-maura, 70112 bytes
I 2022/06/09 11:04:51 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=allen-lewis
I 2022/06/09 11:04:51 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/4434-laughing-and-crying-with-carmen-maura, STACKING TIME = 1, PARSING TIME = 16
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/images/8015-990bc7db67bf5bf3c2bd807e835c9113/carmen_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 Fulltext indexing: TsLKim_26NP5 https://www.criterion.com/shop/browse/list?director=allen-lewis
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/films/b93ed3824aabf3784387991675dde82c/J2DKHtUkEGO5iYbKWnV7TIatSWH6x4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/embed/ao4pKJxhZQQ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[TsLKim_26NP5 (1735154930280300544)]} 0 12
I 2022/06/09 11:04:52 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=allen-lewis [TsLKim_26NP5]
Description: Lewis Allen films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12453 bytes |
LinkStorageTime: 14 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:52 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/4434-laughing-and-crying-with-carmen-maura
I 2022/06/09 11:04:52 Fulltext indexing: Tk3xkG_26NP5 https://www.criterion.com/current/posts/4434-laughing-and-crying-with-carmen-maura
I 2022/06/09 11:04:52 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Tk3xkG_26NP5 (1735154930308612096)]} 0 1
I 2022/06/09 11:04:52 SWITCHBOARD *Indexed 294 words in URL https://www.criterion.com/current/posts/4434-laughing-and-crying-with-carmen-maura [Tk3xkG_26NP5]
Description: Laughing and Crying With Carmen Maura | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3411 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:52 HTCACHE storing content of url https://www.criterion.com/boxsets/204-eisenstein-the-sound-years, 69909 bytes
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 SWITCHBOARD CRAWL: ADDED 49 LINKS FROM https://www.criterion.com/boxsets/204-eisenstein-the-sound-years, STACKING TIME = 4, PARSING TIME = 13
I 2022/06/09 11:04:52 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/be0e802e835ac927eaf3be41589e23e0.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1838-ee328a31205b114ef125fd81b54b5cd0/VZGhEsbGQY3luUNqMc64IKmXGoRe9U_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/films/19c902a73e243b9293de4c717430f639/H1rWEdJtowN7Xh9vSnOvPU9I6y0AgT_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/films/aed45dbeaf63414624b16890eb458dea/fSaXGJq2BBkowhHXw2FJft0UoNIdnI_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/films/b653a1545863d441cf9b8a8bc50946b8/SXcj1Zf8bWoyaoUEuzlFAc4gPNGwYm_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/a4754e30730b14f21ea09489a09de732.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/76b270cd5220e804477fd41e9c907d3b.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/boxsets/204-eisenstein-the-sound-years
I 2022/06/09 11:04:52 Fulltext indexing: TSqfP3_26NP5 https://www.criterion.com/boxsets/204-eisenstein-the-sound-years
I 2022/06/09 11:04:52 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[TSqfP3_26NP5 (1735154930553978880)]} 0 2
I 2022/06/09 11:04:52 SWITCHBOARD *Indexed 321 words in URL https://www.criterion.com/boxsets/204-eisenstein-the-sound-years [TSqfP3_26NP5]
Description: Eisenstein: The Sound Years | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5861 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:52 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:52 HTCACHE storing content of url https://www.criterion.com/current/posts/1063-l-eclisse-antonioni-and-vitti, 109846 bytes
I 2022/06/09 11:04:52 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/1063-l-eclisse-antonioni-and-vitti, STACKING TIME = 1, PARSING TIME = 8
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/images/4415-66fa9593ad41744878169db79b52b613/294_017_Current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 SWITCHBOARD Excluded 30 words in URL https://www.criterion.com/current/posts/1063-l-eclisse-antonioni-and-vitti
I 2022/06/09 11:04:52 HTCACHE storing content of url https://www.criterion.com/current/posts/2079-arigato, 65630 bytes
I 2022/06/09 11:04:52 Fulltext indexing: S5oIHG_26NP5 https://www.criterion.com/current/posts/1063-l-eclisse-antonioni-and-vitti
I 2022/06/09 11:04:52 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/2079-arigato, STACKING TIME = 5, PARSING TIME = 6
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7231-/HQynBbsBpEpKg7rUCoGXwscnLL6b2Q_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7494-/aR8GPv9YcXKRW9YgcQatuPCI686YSi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/tout_image/7325-/8uSYdbbznLaIIN2ALLVLIx23JCp33f_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7615-/9oqwSPz7K69O93w5M2OzoGcJVHwTYh_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[S5oIHG_26NP5 (1735154930904203264)]} 0 9
I 2022/06/09 11:04:52 SWITCHBOARD *Indexed 737 words in URL https://www.criterion.com/current/posts/1063-l-eclisse-antonioni-and-vitti [S5oIHG_26NP5]
Description: Leclisse: Antonioni and Vitti | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 11017 bytes |
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/films/e62f1e61b5f52c2aaeabaeceaf58b629/BenTqN2hpuF2PKWN8v0M0BZkEMiLAM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.youtube.com/embed/dUOR7M0HpAE?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:52 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/current/posts/2079-arigato
I 2022/06/09 11:04:52 Fulltext indexing: S3xUSG_26NP5 https://www.criterion.com/current/posts/2079-arigato
I 2022/06/09 11:04:52 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[S3xUSG_26NP5 (1735154930932514816)]} 0 1
I 2022/06/09 11:04:52 SWITCHBOARD *Indexed 190 words in URL https://www.criterion.com/current/posts/2079-arigato [S3xUSG_26NP5]
Description: Arigato! | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2241 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:52 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 489, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:52 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 489, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:53 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bennet-spencer-g, 224746 bytes
I 2022/06/09 11:04:53 HTCACHE storing content of url https://www.criterion.com/current/posts/611-the-milky-way-easy-striders, 115423 bytes
I 2022/06/09 11:04:53 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=bennet-spencer-g, STACKING TIME = 1, PARSING TIME = 28
I 2022/06/09 11:04:53 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/films/ed3e04be1a68ad7e7438a4a16d69a556/a268Pr0Wtb1F3INlbaTesMbAGJcxGg_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1874-1783a5b7ab030641aab494f13c026174/ahGfceaQ9idYzYJHUy4cVjH4aRnxZo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/611-the-milky-way-easy-striders, STACKING TIME = 1, PARSING TIME = 27
I 2022/06/09 11:04:53 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 483, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 6)) = 244
I 2022/06/09 11:04:53 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=bennet-spencer-g
I 2022/06/09 11:04:53 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Skcohm_26NP5 (1735154931578437632)]} 0 2
I 2022/06/09 11:04:53 Fulltext indexing: Skcohm_26NP5 https://www.criterion.com/shop/browse/list?director=bennet-spencer-g
I 2022/06/09 11:04:53 SWITCHBOARD *Indexed 1196 words in URL https://www.criterion.com/shop/browse/list?director=bennet-spencer-g [Skcohm_26NP5]
Description: Spencer G. Bennet films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12516 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:53 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/611-the-milky-way-easy-striders
I 2022/06/09 11:04:53 Fulltext indexing: SXa_zG_26NP5 https://www.criterion.com/current/posts/611-the-milky-way-easy-striders
I 2022/06/09 11:04:53 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[SXa_zG_26NP5 (1735154931672809472)]} 0 3
I 2022/06/09 11:04:53 SWITCHBOARD *Indexed 914 words in URL https://www.criterion.com/current/posts/611-the-milky-way-easy-striders [SXa_zG_26NP5]
Description: The Milky Way: Easy Striders | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 13063 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:53 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 483, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:53 HTCACHE storing content of url https://www.criterion.com/current/posts/2827-the-life-of-oharu-not-reconciled, 79303 bytes
I 2022/06/09 11:04:53 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/images/5140-33579d22044380859d64f2d1e3034fb9/current_376_007_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/2827-the-life-of-oharu-not-reconciled, STACKING TIME = 6, PARSING TIME = 17
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/films/3433fcaf58b444a93922cf7375557b22/KKqwnBZIwSyvSOCP8e96G1O9fbPe10_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/2827-the-life-of-oharu-not-reconciled
I 2022/06/09 11:04:53 Fulltext indexing: SVm8mG_26NP5 https://www.criterion.com/current/posts/2827-the-life-of-oharu-not-reconciled
I 2022/06/09 11:04:53 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:53 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[SVm8mG_26NP5 (1735154932004159488)]} 0 5
I 2022/06/09 11:04:53 SWITCHBOARD *Indexed 946 words in URL https://www.criterion.com/current/posts/2827-the-life-of-oharu-not-reconciled [SVm8mG_26NP5]
Description: The Life of Oharu: Not Reconciled | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 14276 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:53 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@54dd07d7[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wk(7.7.3):C19:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772672584}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wl(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772677667}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wm(7.7.3):C12:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772680783}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wn(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772680861}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wo(7.7.3):C6:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772682978}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wp(7.7.3):C22:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772688229}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wq(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772693592}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:04:53 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:04:53 HTCACHE storing content of url https://www.criterion.com/current/posts/6397-melodrama-debauchery-comedy-un-certain-regard, 85227 bytes
I 2022/06/09 11:04:53 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 477, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:53 SWITCHBOARD CRAWL: ADDED 86 LINKS FROM https://www.criterion.com/current/posts/6397-melodrama-debauchery-comedy-un-certain-regard, STACKING TIME = 3, PARSING TIME = 14
I 2022/06/09 11:04:53 REJECTED https://variety.com/2019/film/festivals/cannes-fire-will-come-oliver-laxe-classicism-avant-guard-egos-1203223235/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.hollywoodreporter.com/review/climb-review-1211195 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/joan-arc-bruno-dumont-am-dram-trial - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/embed/B38sjPKTm3o?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/chambre-212 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://variety.com/2019/film/festivals/karim-ainouz-cannes-un-certain-regard-the-invisible-life-1203223390/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://criterion-production.s3.amazonaws.com/IVsBX8S2Ze1yYGqZ0U1qgVqOhgerH6.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://variety.com/2019/film/reviews/a-brothers-love-review-1203215650/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://filmmakermagazine.com/107572-cannes-2019-dispatch-5-fire-will-come-tommaso/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.hollywoodreporter.com/review/a-magical-night-review-1212121 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://filmmakermagazine.com/107565-cannes-2019-dispatch-4-lux-aeterna-jeanne-young-ahmed/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.ioncinema.com/reviews/christophe-honore-chambre-212-review - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://variety.com/2019/film/festivals/brazils-invisible-life-of-euridice-gusmao-wins-cannes-un-certain-regard-award-1203225505/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/embed/izUIhIj10HA?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.rogerebert.com/cannes/cannes-2019-family-romance-llc-the-climb-the-invisible-life-of-euridice-gusmao - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.hollywoodreporter.com/review/fire-will-come-cannes-2019-1213080 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/embed/GCGkZ92cpcg?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/jeanne - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/a-vida-invisivel-de-euridice-gusmao - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/liberte-1 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/embed/8fBBGRT9ga0?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/embed/IHW9ByMtfpI?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://filmmakermagazine.com/107547-cannes-2019-dispatch-3-little-joe-liberte/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/embed/hCgbSE9tzEE?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED http://www.marthabatalha.com/en/home/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/la-femme-de-mon-frere - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://variety.com/2019/film/reviews/cannes-film-review-the-invisible-life-of-euridice-gusmao-1203225913/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.hollywoodreporter.com/news/cannes-hidden-gem-invisible-life-captures-female-life-rio-de-janeiro-1211128 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED http://www.anothergaze.com/monia-chokris-la-femme-de-mon-frere-brothers-love-ventures-beyond-hellscape-self-cannes/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-11-vulgar-confessions - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.screendaily.com/reviews/joan-of-arc-cannes-review/5139228.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/the-climb - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/o-que-arde - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 SWITCHBOARD Excluded 30 words in URL https://www.criterion.com/current/posts/6397-melodrama-debauchery-comedy-un-certain-regard
I 2022/06/09 11:04:53 Fulltext indexing: SSTLUG_26NP5 https://www.criterion.com/current/posts/6397-melodrama-debauchery-comedy-un-certain-regard
I 2022/06/09 11:04:53 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[SSTLUG_26NP5 (1735154932195000320)]} 0 10
I 2022/06/09 11:04:53 SWITCHBOARD *Indexed 997 words in URL https://www.criterion.com/current/posts/6397-melodrama-debauchery-comedy-un-certain-regard [SSTLUG_26NP5]
Description: Melodrama, Debauchery, Comedy: Un Certain Regard | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 13211 bytes |
LinkStorageTime: 12 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:53 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 477, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:53 HTCACHE storing content of url https://www.criterion.com/current/posts/4522-andrzej-wajda-in-honolulu, 71831 bytes
I 2022/06/09 11:04:53 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/4522-andrzej-wajda-in-honolulu, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6502-/VcDUKU8LPywb5MgflsEJCCJOgfrUNv_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/images/8294-0aaff10f6f76b338d30cf4327625c44a/916id_010_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6827-/sYS7xZWV366q9OANUqtCwimvTnL90D_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://honolulumuseum.org/events/films/16237-kanal - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/films/ca7e507c22c7570c08f43f1504309516/9NsSbie0yQIE4RI7eKfX3o4TlX5zvP_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6701-/ETx6s7z5azkjRZIl1n1268y6oIFngD_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6516-/1ReaXkdoavjctz1ZBIEqTXIfC0QqZK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/current/posts/4522-andrzej-wajda-in-honolulu
I 2022/06/09 11:04:54 Fulltext indexing: SRP3vG_26NP5 https://www.criterion.com/current/posts/4522-andrzej-wajda-in-honolulu
I 2022/06/09 11:04:54 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[SRP3vG_26NP5 (1735154932441415680)]} 0 2
I 2022/06/09 11:04:54 SWITCHBOARD *Indexed 298 words in URL https://www.criterion.com/current/posts/4522-andrzej-wajda-in-honolulu [SRP3vG_26NP5]
Description: Andrzej Wajda in Honolulu | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3408 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:54 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 474, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:04:54 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 474, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:54 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bjoerkman-stig, 224222 bytes
I 2022/06/09 11:04:54 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=bjoerkman-stig, STACKING TIME = 1, PARSING TIME = 30
I 2022/06/09 11:04:54 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://s3.amazonaws.com/criterion-production/films/e732b8e6dcc290423dc3b347d90adb86/YS7c8q5YqFduubJtYRVVoVuWf774oA_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=bjoerkman-stig
I 2022/06/09 11:04:54 Fulltext indexing: SIDRJm_26NP5 https://www.criterion.com/shop/browse/list?director=bjoerkman-stig
I 2022/06/09 11:04:54 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[SIDRJm_26NP5 (1735154932978286592)]} 0 3
I 2022/06/09 11:04:54 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=bjoerkman-stig [SIDRJm_26NP5]
Description: Stig Björkman films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12476 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:54 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=schoedsack-ernest-b, 224747 bytes
I 2022/06/09 11:04:54 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=schoedsack-ernest-b, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:04:54 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1865-2b74f037f454df1d78013f06dc4aaea4/0TUeLtsha8fzPrVMeQ8rNOnpUVmvME_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://s3.amazonaws.com/criterion-production/films/1f4199efee0716b73e643f44cffd628f/FsFD7z9JPS8zGJwwhpboY5pnNOOmIc_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:54 HostQueue forcing crawl-delay of 170 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 475, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 80)) = 170
I 2022/06/09 11:04:54 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=schoedsack-ernest-b
I 2022/06/09 11:04:54 Fulltext indexing: Rzwf9m_26NP5 https://www.criterion.com/shop/browse/list?director=schoedsack-ernest-b
I 2022/06/09 11:04:54 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Rzwf9m_26NP5 (1735154933218410496)]} 0 2
I 2022/06/09 11:04:54 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=schoedsack-ernest-b [Rzwf9m_26NP5]
Description: Ernest B. Schoedsack films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12511 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:54 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 475, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:54 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=wilde-cornel, 224168 bytes
I 2022/06/09 11:04:55 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=wilde-cornel, STACKING TIME = 6, PARSING TIME = 20
I 2022/06/09 11:04:55 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/040b9ada9b2e7f00bf67207c219e907d/SqsWAMsxDc8dEu6YmD98XbtlIJ2Qak_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=wilde-cornel
I 2022/06/09 11:04:55 Fulltext indexing: RvqSLm_26NP5 https://www.criterion.com/shop/browse/list?director=wilde-cornel
I 2022/06/09 11:04:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[RvqSLm_26NP5 (1735154933515157504)]} 0 2
I 2022/06/09 11:04:55 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=wilde-cornel [RvqSLm_26NP5]
Description: Cornel Wilde films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12479 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:55 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 475, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:55 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=shinarbaev-ermek, 224692 bytes
I 2022/06/09 11:04:55 HTCACHE storing content of url https://www.criterion.com/films/3558, 72654 bytes
I 2022/06/09 11:04:55 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse?director=shinarbaev-ermek, STACKING TIME = 1, PARSING TIME = 85
I 2022/06/09 11:04:55 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1767-0744687c884e8c056982d471b122dce3/cgKxO604g3phzPqpONSN5STBrfnV6y_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/db96c5947256e58c76b29162412c782b/fFzRqS2k4qQ7uYfoJgI45cQIQIScQ3_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/films/3558, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/a87bb06a2e40fbf073674fb0a669feaf/TAA69Jf5iXdiViDJjty3V3PIFzKjDp_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/homicide?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/726755430bd298a5aa424f68a792bcea/aQ0KQhoip19olkpwhmNbrAfMY6qhAB_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/609ba3283ee6e7a688a8f332948af460/UQs28Jv5Fz6nuVrbcLXq70jhwKzxXV_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/1227-/8bS0fqEqZNNhLc6lbEcIPk5Z9yyfcO_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/4e478bb135ededb77bf009fbb602208c.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/f4cd870c9d0136001f98d8ec2ac268d3/qPf9kR60ROOrUxfCNGNl7HiYLiR48u_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=shinarbaev-ermek
I 2022/06/09 11:04:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Ro-ekm_26NP5 (1735154933834973184)]} 0 2
I 2022/06/09 11:04:55 Fulltext indexing: Ro-ekm_26NP5 https://www.criterion.com/shop/browse?director=shinarbaev-ermek
I 2022/06/09 11:04:55 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse?director=shinarbaev-ermek [Ro-ekm_26NP5]
Description: Ermek Shinarbaev films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12470 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:55 SWITCHBOARD Excluded 20 words in URL https://www.criterion.com/films/3558
I 2022/06/09 11:04:55 Fulltext indexing: RfUkwe_26NP5 https://www.criterion.com/films/3558
I 2022/06/09 11:04:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[RfUkwe_26NP5 (1735154933855944704)]} 0 1
I 2022/06/09 11:04:55 SWITCHBOARD *Indexed 347 words in URL https://www.criterion.com/films/3558 [RfUkwe_26NP5]
Description: Homicide (1991) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4412 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:55 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 474, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:55 HTCACHE storing content of url https://www.criterion.com/current/posts/509-watching-sal, 70401 bytes
I 2022/06/09 11:04:55 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 SWITCHBOARD CRAWL: ADDED 41 LINKS FROM https://www.criterion.com/current/posts/509-watching-sal, STACKING TIME = 2, PARSING TIME = 5
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/images/4303-6645624b4a119533b1295cd1fa38e445/img_current_45_220_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/509-watching-sal
I 2022/06/09 11:04:55 Fulltext indexing: RFF_-G_26NP5 https://www.criterion.com/current/posts/509-watching-sal
I 2022/06/09 11:04:55 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=kaufman-boris, 224644 bytes
I 2022/06/09 11:04:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[RFF_-G_26NP5 (1735154934097117184)]} 0 6
I 2022/06/09 11:04:55 SWITCHBOARD *Indexed 694 words in URL https://www.criterion.com/current/posts/509-watching-sal [RFF_-G_26NP5]
Description: Watching Salò | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 9754 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:55 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse?director=kaufman-boris, STACKING TIME = 1, PARSING TIME = 111
I 2022/06/09 11:04:55 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 470, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 13)) = 238
I 2022/06/09 11:04:55 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
I 2022/06/09 11:04:55 org.apache.solr.update.SolrIndexWriter Calling setCommitData with IW:org.apache.solr.update.SolrIndexWriter@fed4ccdd commitCommandVersion:0
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1843-6e179857a2b22f6fff1aee72814e6e1f/ucDdSNidzRCUS9xrmSXHu8tdlizbxG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/5ac9d89c2f87805b7beb1cf45f2fb262/HXRWSQehYCyfyKqH7wzDP2eO7dp7QE_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:55 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=kaufman-boris
I 2022/06/09 11:04:55 Fulltext indexing: RM-Tzm_26NP5 https://www.criterion.com/shop/browse?director=kaufman-boris
I 2022/06/09 11:04:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[RM-Tzm_26NP5 (1735154934363455488)]} 0 8
I 2022/06/09 11:04:55 SWITCHBOARD *Indexed 1196 words in URL https://www.criterion.com/shop/browse?director=kaufman-boris [RM-Tzm_26NP5]
Description: Boris Kaufman films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12482 bytes |
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:55 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:56 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 470, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 12)) = 239
I 2022/06/09 11:04:56 HTCACHE storing content of url https://www.criterion.com/current/posts/5751-bowling-for-columbine-by-any-means-necessary, 84962 bytes
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://criterion-production.s3.amazonaws.com/WUiA1p5V17eJIY6KvFiQxCXc80yBiX.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 SWITCHBOARD CRAWL: ADDED 59 LINKS FROM https://www.criterion.com/current/posts/5751-bowling-for-columbine-by-any-means-necessary, STACKING TIME = 2, PARSING TIME = 8
I 2022/06/09 11:04:56 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5751-/3e2CxdDLUQxmY46KEB1jbn00ytTKT9_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://criterion-production.s3.amazonaws.com/bbzzvn00I7zQcDvDxCS1eHVjRdrPUQ.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://criterion-production.s3.amazonaws.com/pYeegXJvMDTvetH6naGPkBhXJNgpRB.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/films/78a86bc12fbed832f0b341609a22fa52/lun1ptGstEhxOhmc24pORDXEVkN2Ve_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 6)) = 245
I 2022/06/09 11:04:56 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/5751-bowling-for-columbine-by-any-means-necessary
I 2022/06/09 11:04:56 Fulltext indexing: QbrssG_26NP5 https://www.criterion.com/current/posts/5751-bowling-for-columbine-by-any-means-necessary
I 2022/06/09 11:04:56 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[QbrssG_26NP5 (1735154934816440320)]} 0 19
I 2022/06/09 11:04:56 SWITCHBOARD *Indexed 1172 words in URL https://www.criterion.com/current/posts/5751-bowling-for-columbine-by-any-means-necessary [QbrssG_26NP5]
Description: Bowling for Columbine: By Any Means Necessary | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 16981 bytes |
LinkStorageTime: 28 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:56 HTCACHE storing content of url https://www.criterion.com/current/posts/2000-phantom-forms-the-phantom-carriage, 130340 bytes
I 2022/06/09 11:04:56 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/2000-phantom-forms-the-phantom-carriage, STACKING TIME = 1, PARSING TIME = 19
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/images/3756-5f2f10174229f1dc5a0cbf043e8dfa68/phantomcarriage_415_005_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/films/7265b13395ec259ff98672237c54b4c6/hl8OoSNAgm9ND4Fh7ksjUzNXplspyF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 468, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 13)) = 238
I 2022/06/09 11:04:56 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/2000-phantom-forms-the-phantom-carriage
I 2022/06/09 11:04:56 Fulltext indexing: P6kRKG_26NP5 https://www.criterion.com/current/posts/2000-phantom-forms-the-phantom-carriage
I 2022/06/09 11:04:56 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[P6kRKG_26NP5 (1735154935231676416)]} 0 4
I 2022/06/09 11:04:56 SWITCHBOARD *Indexed 1024 words in URL https://www.criterion.com/current/posts/2000-phantom-forms-the-phantom-carriage [P6kRKG_26NP5]
Description: Phantom Forms: The Phantom Carriage | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 14987 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:56 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=mishima-yukio, 224184 bytes
I 2022/06/09 11:04:56 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 468, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:56 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=mishima-yukio, STACKING TIME = 1, PARSING TIME = 62
I 2022/06/09 11:04:56 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/films/1afc31bda9087f75091fae936b5c1ca0/1HMg9MpF1yL5AAmyha7kzSbACv2zcV_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:56 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=mishima-yukio
I 2022/06/09 11:04:56 Fulltext indexing: PxU5tm_26NP5 https://www.criterion.com/shop/browse/list?director=mishima-yukio
I 2022/06/09 11:04:56 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[PxU5tm_26NP5 (1735154935480188928)]} 0 2
I 2022/06/09 11:04:56 SWITCHBOARD *Indexed 1193 words in URL https://www.criterion.com/shop/browse/list?director=mishima-yukio [PxU5tm_26NP5]
Description: Yukio Mishima films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12491 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:57 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=inoue-akira, 224692 bytes
I 2022/06/09 11:04:57 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:57 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=inoue-akira, STACKING TIME = 1, PARSING TIME = 19
I 2022/06/09 11:04:57 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/films/c843e10ded9d547c8f1996012140b58b/KW0pFMBe2u60vjTtDjf2MiOMTfytam_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/films/608-ingmar-bergman-makes-a-movie
I 2022/06/09 11:04:57 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/shop/browse
I 2022/06/09 11:04:57 REJECTED https://www.criterion.com/films/608-ingmar-bergman-makes-a-movie - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/films/608-ingmar-bergman-makes-a-movie aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:35:14Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
I 2022/06/09 11:04:57 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Pky7re_26NP5 (1735154935680466944)]} 0 3
I 2022/06/09 11:04:57 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=inoue-akira
I 2022/06/09 11:04:57 Fulltext indexing: Pk9dJm_26NP5 https://www.criterion.com/shop/browse/list?director=inoue-akira
I 2022/06/09 11:04:57 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Pk9dJm_26NP5 (1735154935693049856)]} 0 2
I 2022/06/09 11:04:57 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=inoue-akira [Pk9dJm_26NP5]
Description: Akira Inoue films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12480 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:57 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:57 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=sjoestroem-victor, 224191 bytes
I 2022/06/09 11:04:57 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=sjoestroem-victor, STACKING TIME = 1, PARSING TIME = 77
I 2022/06/09 11:04:57 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/films/7265b13395ec259ff98672237c54b4c6/hl8OoSNAgm9ND4Fh7ksjUzNXplspyF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=sjoestroem-victor
I 2022/06/09 11:04:57 Fulltext indexing: Pk3vlm_26NP5 https://www.criterion.com/shop/browse?director=sjoestroem-victor
I 2022/06/09 11:04:57 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Pk3vlm_26NP5 (1735154936091508736)]} 0 2
I 2022/06/09 11:04:57 SWITCHBOARD *Indexed 1187 words in URL https://www.criterion.com/shop/browse?director=sjoestroem-victor [Pk3vlm_26NP5]
Description: Victor Sjöström films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12412 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:57 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 470, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:57 HTCACHE storing content of url https://www.criterion.com/current/posts/5841-schnabel-at-nyff-dolan-at-tiff, 71108 bytes
I 2022/06/09 11:04:57 SWITCHBOARD CRAWL: ADDED 64 LINKS FROM https://www.criterion.com/current/posts/5841-schnabel-at-nyff-dolan-at-tiff, STACKING TIME = 1, PARSING TIME = 6
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.tiff.net/the-review/tiff-2018-canadian-films/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.tiff.net/tiff/what-is-democracy/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.filmlinc.org/nyff2018/daily/julian-schnabel-at-eternitys-gate-closing-night-nyff56/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.tiff.net/tiff/anthropocene/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://criterion-production.s3.amazonaws.com/KHTH1OZ0QZyKcCiHHx9h8zlVsmTa1u.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.tiff.net/tiff/the-fall-of-the-american-empire/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.instagram.com/p/BezA0SAh69_/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.youtube.com/embed/k2zvPTGiJj4?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://gallery.mailchimp.com/bed63d3ce10ec9adba60ea410/files/690ef579-f1e5-4c6a-8520-d7a65adedb8e/TIFF_ANNOUNCES_THE_WORLD_PREMIERE_OF_XAVIER_DOLAN_S_THE_DEATH_AND_LIFE_OF_JOHN_F._DONOVAN.pdf - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 REJECTED https://www.instagram.com/p/Bey2fUjg_AC/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:57 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/5841-schnabel-at-nyff-dolan-at-tiff
I 2022/06/09 11:04:57 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[PVVaRG_26NP5 (1735154936227823616)]} 0 1
I 2022/06/09 11:04:57 Fulltext indexing: PVVaRG_26NP5 https://www.criterion.com/current/posts/5841-schnabel-at-nyff-dolan-at-tiff
I 2022/06/09 11:04:57 SWITCHBOARD *Indexed 437 words in URL https://www.criterion.com/current/posts/5841-schnabel-at-nyff-dolan-at-tiff [PVVaRG_26NP5]
Description: Schnabel at NYFF, Dolan at TIFF | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5326 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:57 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 468, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:58 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 468, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:58 HTCACHE storing content of url https://www.criterion.com/current/posts/7433-the-signifyin-works-of-marlon-riggs-positive-images, 111351 bytes
I 2022/06/09 11:04:58 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=risi-dino, 224166 bytes
I 2022/06/09 11:04:58 SWITCHBOARD CRAWL: ADDED 68 LINKS FROM https://www.criterion.com/current/posts/7433-the-signifyin-works-of-marlon-riggs-positive-images, STACKING TIME = 6, PARSING TIME = 29
I 2022/06/09 11:04:58 REJECTED https://criterion-production.s3.amazonaws.com/p69daGpYEMwv1oeYjTTbMyvhbeE23f.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://criterion-production.s3.amazonaws.com/anqRb3iEvsM48b8ZZy95xyrpvn0SW0.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://criterion-production.s3.amazonaws.com/3JOp4FzRo0Yj6GGNK9WFk9JvTM7ve9.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://vhx.imgix.net/criterionchannelchartersu/assets/fe816e81-c4da-463d-852b-3b50d46072e8-0f1dcdf9.jpg?auto=format%2Ccompress&fit=crop&h=360&q=70&w=640 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://criterion-production.s3.amazonaws.com/vZ1FmsXmVr3Tl2QcaVDWbkxmlirHxz.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7433-/TaYxcGWDJ3jXRzpSOx9ehob2S9S4mm_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://criterion-production.s3.amazonaws.com/iVrw87O9tVNu8oFk2NVCDxiVbcTEMF.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7153-/nV4ZAell01TtJQf3ioJDSHLKKtpjtk_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1979-b50ddf05e790d45f0d3452882a1c5104/Yz5hb3zePSR6H1BFKoUnt0dQHSxzSj_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://criterion-production.s3.amazonaws.com/lAjYL9Xq2bJ9vfVovFjyaSRseOXVaA.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/inspired-by-marlon-riggs?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=current - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 13)) = 238
I 2022/06/09 11:04:58 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=risi-dino, STACKING TIME = 1, PARSING TIME = 67
I 2022/06/09 11:04:58 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/films/41d9df20fc245c4ecc12ef934533adc2/HrfTWPnllatTkm1Npqwr0o4Jslre1h_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/7433-the-signifyin-works-of-marlon-riggs-positive-images
I 2022/06/09 11:04:58 Fulltext indexing: PF73kG_26NP5 https://www.criterion.com/current/posts/7433-the-signifyin-works-of-marlon-riggs-positive-images
I 2022/06/09 11:04:58 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[PF73kG_26NP5 (1735154937146376192)]} 0 12
I 2022/06/09 11:04:58 SWITCHBOARD *Indexed 1878 words in URL https://www.criterion.com/current/posts/7433-the-signifyin-works-of-marlon-riggs-positive-images [PF73kG_26NP5]
Description: The Signifyin Works of Marlon Riggs: Positive Images | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 37014 bytes |
LinkStorageTime: 14 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:58 HTCACHE storing content of url https://www.criterion.com/films/28722-lone-wolf-and-cub-sword-of-vengeance, 76950 bytes
I 2022/06/09 11:04:58 HostQueue forcing crawl-delay of 248 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 467, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 6)) = 245
I 2022/06/09 11:04:58 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:04:58 SWITCHBOARD CRAWL: ADDED 67 LINKS FROM https://www.criterion.com/films/28722-lone-wolf-and-cub-sword-of-vengeance, STACKING TIME = 2, PARSING TIME = 79
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/3b88fba9072b416d90a382e62b44a6b4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/18114be2187272d203efeda6bc250cf3.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://itunes.apple.com/us/movie/id1168909411?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/images/7742-355bb7d45a9482fc6c2ac3db60ad9495/lone_wolf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/films/ca9546c2ceb1b830a1e5f190805fb6fd/21FyOwvaD7hVXctaQrX9RVy5muIn6D_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/lone-wolf-and-cub-sword-of-vengeance?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/images/7830-f315cd91efe00a972e37be56081e1dbf/LWC_designs_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://www.amazon.com/dp/B01M7ZWC9V - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/f936394d5e829494b0c15bbfa73d850e.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/films/58e97ae4dd0b361bd131525faf284b78/K10x3FugjRS01iYKEXsLTr0OlDQk4P_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/images/7773-ee7702bf1312d6cd3550dbfc426265bc/Current_LW_C1-grab65_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:58 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=risi-dino
I 2022/06/09 11:04:58 Fulltext indexing: PLGzxm_26NP5 https://www.criterion.com/shop/browse/list?director=risi-dino
I 2022/06/09 11:04:58 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[PLGzxm_26NP5 (1735154937361334272)]} 0 11
I 2022/06/09 11:04:58 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:58 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@8639c47e[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wt(7.7.3):c176:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772696001}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wu(7.7.3):C8:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772698712}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:04:58 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:04:58 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=risi-dino [PLGzxm_26NP5]
Description: Dino Risi films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12470 bytes |
LinkStorageTime: 20 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:58 SWITCHBOARD Excluded 17 words in URL https://www.criterion.com/films/28722-lone-wolf-and-cub-sword-of-vengeance
I 2022/06/09 11:04:58 Fulltext indexing: O3mESe_26NP5 https://www.criterion.com/films/28722-lone-wolf-and-cub-sword-of-vengeance
I 2022/06/09 11:04:58 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[O3mESe_26NP5 (1735154937399083008)]} 0 2
I 2022/06/09 11:04:58 SWITCHBOARD *Indexed 331 words in URL https://www.criterion.com/films/28722-lone-wolf-and-cub-sword-of-vengeance [O3mESe_26NP5]
Description: Lone Wolf and Cub: Sword of Vengeance (1972) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3782 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:58 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 467, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:58 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=drew-robert, 224233 bytes
I 2022/06/09 11:04:58 HTCACHE storing content of url https://www.criterion.com/current/posts/6791-alex-ross-perry-pays-a-visit-to-great-american-iconoclast-paul-schrader, 68316 bytes
I 2022/06/09 11:04:59 LOADER CRAWLER Redirection detected ('HTTP/1.1 301 Moved Permanently') for URL https://www.criterion.com/explore/214-martin-scorsese-s-top-10
I 2022/06/09 11:04:59 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/current/top-10-lists/214-martin-scorsese-s-top-10
I 2022/06/09 11:04:59 REJECTED https://www.criterion.com/explore/214-martin-scorsese-s-top-10 - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/explore/214-martin-scorsese-s-top-10 aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:37:10Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
I 2022/06/09 11:04:59 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=drew-robert, STACKING TIME = 2, PARSING TIME = 115
I 2022/06/09 11:04:59 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Oqv8Uk_26NP5 (1735154937629769728)]} 0 25
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/films/cd4e04ca68b6bfb60fdcbb7ea6b8386f/JtYzw2kwoN2ahEB7YqQiYXBBzRbTvp_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/6791-alex-ross-perry-pays-a-visit-to-great-american-iconoclast-paul-schrader, STACKING TIME = 7, PARSING TIME = 34
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7777-/bTILAb13gYYFwneHHCb5dHHD1VjRf7_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7707-/hZxxnAQeKCi5vjiDKzvdNuftSnkdOM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7807-/DXN9QiWNnsXTuLss0TTJ4r9JspRu5B_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/meet-the-filmmakers-paul-schrader - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7738-/e35rwdsj2UHHYOdCCz8E5Qr8oxxKXj_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://player.vimeo.com/video/385346582 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=drew-robert
I 2022/06/09 11:04:59 Fulltext indexing: O-aGQm_26NP5 https://www.criterion.com/shop/browse/list?director=drew-robert
I 2022/06/09 11:04:59 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[O-aGQm_26NP5 (1735154937834242048)]} 0 2
I 2022/06/09 11:04:59 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=drew-robert [O-aGQm_26NP5]
Description: Robert Drew films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12479 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:59 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/current/posts/6791-alex-ross-perry-pays-a-visit-to-great-american-iconoclast-paul-schrader
I 2022/06/09 11:04:59 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[O2N0WG_26NP5 (1735154937859407872)]} 0 1
I 2022/06/09 11:04:59 Fulltext indexing: O2N0WG_26NP5 https://www.criterion.com/current/posts/6791-alex-ross-perry-pays-a-visit-to-great-american-iconoclast-paul-schrader
I 2022/06/09 11:04:59 SWITCHBOARD *Indexed 301 words in URL https://www.criterion.com/current/posts/6791-alex-ross-perry-pays-a-visit-to-great-american-iconoclast-paul-schrader [O2N0WG_26NP5]
Description: Alex Ross Perry Pays a Visit to Great American Iconoclast Paul Schrader | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3758 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:59 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:59 HTCACHE storing content of url https://www.criterion.com/current/posts/1437-kap-into-darkness, 84865 bytes
I 2022/06/09 11:04:59 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/1437-kap-into-darkness, STACKING TIME = 2, PARSING TIME = 7
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/films/4513899b08a6c9c55a2705f66fe70452/kvSLd5hpDEUQl2H2vLmSiEIkfamyF5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/images/4553-dc25a80794d892dab17bfbd7ee3646a4/current_655_014_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:59 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/1437-kap-into-darkness
I 2022/06/09 11:04:59 Fulltext indexing: Oplu7G_26NP5 https://www.criterion.com/current/posts/1437-kap-into-darkness
I 2022/06/09 11:04:59 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Oplu7G_26NP5 (1735154938219069440)]} 0 3
I 2022/06/09 11:04:59 SWITCHBOARD *Indexed 527 words in URL https://www.criterion.com/current/posts/1437-kap-into-darkness [Oplu7G_26NP5]
Description: Kapò: Into Darkness | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6205 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:59 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 468, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:59 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 468, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:59 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=henzell-perry, 224166 bytes
I 2022/06/09 11:05:00 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/films/27829-silence
I 2022/06/09 11:05:00 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/shop/browse
I 2022/06/09 11:05:00 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[OIDepe_26NP5 (1735154938694074368)]} 0 0
I 2022/06/09 11:05:00 REJECTED https://www.criterion.com/films/27829-silence - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/films/27829-silence aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:35:14Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
I 2022/06/09 11:05:00 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=henzell-perry, STACKING TIME = 2, PARSING TIME = 125
I 2022/06/09 11:05:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/films/fbe03f271e488f76de9836e9624a7526/arb7E8i8LJCGfMweLGxCwCClH5YVAI_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=moore-michael-1, 224192 bytes
I 2022/06/09 11:05:00 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 11)) = 240
I 2022/06/09 11:05:00 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse?director=moore-michael-1, STACKING TIME = 1, PARSING TIME = 74
I 2022/06/09 11:05:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/films/78a86bc12fbed832f0b341609a22fa52/lun1ptGstEhxOhmc24pORDXEVkN2Ve_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=henzell-perry
I 2022/06/09 11:05:00 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Odhdim_26NP5 (1735154938872332288)]} 0 2
I 2022/06/09 11:05:00 Fulltext indexing: Odhdim_26NP5 https://www.criterion.com/shop/browse/list?director=henzell-perry
I 2022/06/09 11:05:00 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=henzell-perry [Odhdim_26NP5]
Description: Perry Henzell films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12461 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:00 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=moore-michael-1
I 2022/06/09 11:05:00 Fulltext indexing: OawGcm_26NP5 https://www.criterion.com/shop/browse?director=moore-michael-1
I 2022/06/09 11:05:00 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[OawGcm_26NP5 (1735154938936295424)]} 0 2
I 2022/06/09 11:05:00 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse?director=moore-michael-1 [OawGcm_26NP5]
Description: Michael Moore films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12430 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:00 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:05:00 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=jaglom-henry, 224731 bytes
I 2022/06/09 11:05:00 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:05:00 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=jaglom-henry, STACKING TIME = 1, PARSING TIME = 89
I 2022/06/09 11:05:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/films/ebedf2335fa256b63d4e645e8c508f12/A4UT1QlIsS1FGC0qJQAS21FqwPqpNn_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1864-79581df36332f7a1f027b81311c9e0f9/j8fKXLhpRU7m0dGwebBdwZLgrXaO26_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=jaglom-henry
I 2022/06/09 11:05:00 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[OCZQCm_26NP5 (1735154939431223296)]} 0 2
I 2022/06/09 11:05:00 Fulltext indexing: OCZQCm_26NP5 https://www.criterion.com/shop/browse/list?director=jaglom-henry
I 2022/06/09 11:05:00 SWITCHBOARD *Indexed 1200 words in URL https://www.criterion.com/shop/browse/list?director=jaglom-henry [OCZQCm_26NP5]
Description: Henry Jaglom films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12515 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:00 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=nguyen-jon, 224200 bytes
I 2022/06/09 11:05:00 HTCACHE storing content of url https://www.criterion.com/current/posts/6396-competition-highs-and-lows, 92734 bytes
I 2022/06/09 11:05:00 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=nguyen-jon, STACKING TIME = 1, PARSING TIME = 34
I 2022/06/09 11:05:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 467, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 13)) = 238
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/films/45ae4aaeb01b65d3788e09d553527a0d/O5urSQ3UPS5ELUhh25uHIS7M5ri1p3_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 SWITCHBOARD CRAWL: ADDED 116 LINKS FROM https://www.criterion.com/current/posts/6396-competition-highs-and-lows, STACKING TIME = 12, PARSING TIME = 97
I 2022/06/09 11:05:00 REJECTED https://www.telegraph.co.uk/films/0/matthias-maxime-review-crisp-sweet-canadian-coming-of-age-tale/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://variety.com/2019/film/reviews/oh-mercy-review-1203223481/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://news.yahoo.com/bong-song-double-act-behind-koreas-cannes-victory-211807398.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.festival-cannes.com/en/festival/films/le-jeune-ahmed - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.locarnofestival.ch/pardo/pardo-live/today-at-festival/2019/05/Excellence_SONG.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://time.com/5592027/cannes-review-pedro-almodovar-pain-and-glory/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://criterion-production.s3.amazonaws.com/BlcZzr94BSUcAb10XEIg1TLf9mkCQW.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.hollywoodreporter.com/review/sibyl-review-1212046 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.filmcomment.com/blog/film-of-the-week-oh-mercy/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.youtube.com/embed/YcHB6eE3I1k?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/nuestras-madres - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.festival-cannes.com/en/festival/films/sibyl - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://film.avclub.com/robert-pattinson-and-willem-dafoe-get-lighthouse-fever-1834913555 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://cineuropa.org/en/newsdetail/373264/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://thefilmstage.com/reviews/frankie-review-cannes-isabelle-huppert-ira-sachs/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED http://www.anothergaze.com/celine-sciammas-portrait-de-la-jeune-fille-en-feu-portrait-lady-fire-explores-boundlessness-poetic-love-cannes-lesbian-feminist/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.filmcomment.com/blog/cannes-interview-adele-haenel/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:00 REJECTED https://www.youtube.com/embed/iPNfFRZjgkE?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.filmcomment.com/blog/cannes-interview-mati-diop/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.thedailybeast.com/young-ahmed-a-disturbing-portrait-of-an-islamic-terrorist-invades-cannes - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-mektoub-my-love-intermezzo-abdellatif-kechiche-20190523-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://twitter.com/jessicakiang/status/1131133907896815617 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://film.avclub.com/guessing-the-winners-and-picking-our-own-at-the-end-o-1835011696 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/it-must-be-heaven - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.nytimes.com/2019/05/25/movies/cannes-film-festival-winners-parasite.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.youtube.com/embed/uS-2B8Vl_fA?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://variety.com/2019/film/reviews/frankie-review-isabelle-huppert-1203220585/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://filmmakermagazine.com/107581-cannes-2019-dispatch-6-parasite-once-upon-a-time-in-hollywood-mektoub-my-love-intermezzo/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.theguardian.com/film/2019/may/20/portrait-of-a-lady-on-fire-review-celine-sciamma - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED http://www.lefilmfrancais.com/cinema/142135/cannes2019-le-tableau-final-des-etoiles-de-la-critique-palmometre - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.instagram.com/p/BxrbBkFhvq4/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/mektoub-my-love-intermezzo - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/roubaix-une-lumiere - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.nytimes.com/2019/05/24/movies/cannes-almodovar-kechiche.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://twitter.com/CriterionDaily/status/1132302591977807872 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://lwlies.com/festivals/young-ahmed-cannes-film-festival-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://twitter.com/CriterionDaily/status/1132318367287787521 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://variety.com/2019/film/reviews/cannes-film-review-matthias-maxime-1203223223/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/portrait-de-la-jeune-fille-en-feu - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/the-distance-between-us-and-the-sky - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://variety.com/2019/film/reviews/sibyl-review-1203225243/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/il-traditore - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/matthias-et-maxime - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.indiewire.com/2019/05/the-traitor-review-marco-bellocchio-cannes-1202144226/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.indiewire.com/2019/05/mektoub-my-love-intermezzo-unsimulated-sex-alcohol-report-abdellatif-kechiche-1202144998/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-13-trying-times - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/frankie - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://thefilmstage.com/reviews/cannes-review-with-parasite-bong-joon-ho-delivers-an-electrifying-assessment-of-social-stratification/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.youtube.com/embed/ssxK8FboFo4?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.screendaily.com/reviews/the-traitor-cannes-review/5139679.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=nguyen-jon
I 2022/06/09 11:05:01 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[NnlDam_26NP5 (1735154939833876480)]} 0 2
I 2022/06/09 11:05:01 Fulltext indexing: NnlDam_26NP5 https://www.criterion.com/shop/browse/list?director=nguyen-jon
I 2022/06/09 11:05:01 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=nguyen-jon [NnlDam_26NP5]
Description: Jon Nguyen films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12487 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:01 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 467, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:05:01 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/6396-competition-highs-and-lows
I 2022/06/09 11:05:01 Fulltext indexing: NWKbAG_26NP5 https://www.criterion.com/current/posts/6396-competition-highs-and-lows
I 2022/06/09 11:05:01 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[NWKbAG_26NP5 (1735154939941879808)]} 0 4
I 2022/06/09 11:05:01 SWITCHBOARD *Indexed 1291 words in URL https://www.criterion.com/current/posts/6396-competition-highs-and-lows [NWKbAG_26NP5]
Description: Competition Highs and Lows | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 18189 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:01 HTCACHE storing content of url https://www.criterion.com/films/28725-lone-wolf-and-cub-baby-cart-in-peril, 76601 bytes
I 2022/06/09 11:05:01 SWITCHBOARD CRAWL: ADDED 67 LINKS FROM https://www.criterion.com/films/28725-lone-wolf-and-cub-baby-cart-in-peril, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/95246b965f81274a17549dcbe2b4f694.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/a73036c1a053afe9981265d73ad0222e.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/lone-wolf-and-cub-baby-cart-in-peril?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/3de10aea9022f1adb4b52927aeadd2da.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://itunes.apple.com/us/movie/id1170328408?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/images/7742-355bb7d45a9482fc6c2ac3db60ad9495/lone_wolf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.amazon.com/dp/B01MXHGDKK - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/films/c7f130a16b24d863d6a58d45b4220e8d/pYeWiMfBSfKQvp81wNDqhKe22Q1FBj_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/images/7830-f315cd91efe00a972e37be56081e1dbf/LWC_designs_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/films/58e97ae4dd0b361bd131525faf284b78/K10x3FugjRS01iYKEXsLTr0OlDQk4P_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/images/7773-ee7702bf1312d6cd3550dbfc426265bc/Current_LW_C1-grab65_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 SWITCHBOARD Excluded 17 words in URL https://www.criterion.com/films/28725-lone-wolf-and-cub-baby-cart-in-peril
I 2022/06/09 11:05:01 Fulltext indexing: NLL08e_26NP5 https://www.criterion.com/films/28725-lone-wolf-and-cub-baby-cart-in-peril
I 2022/06/09 11:05:01 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[NLL08e_26NP5 (1735154940066660352)]} 0 1
I 2022/06/09 11:05:01 SWITCHBOARD *Indexed 338 words in URL https://www.criterion.com/films/28725-lone-wolf-and-cub-baby-cart-in-peril [NLL08e_26NP5]
Description: Lone Wolf and Cub: Baby Cart in Peril (1972) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3782 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:01 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 466, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 11)) = 240
I 2022/06/09 11:05:01 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=babenco-hector, 224795 bytes
I 2022/06/09 11:05:01 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 466, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:05:01 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=babenco-hector, STACKING TIME = 3, PARSING TIME = 197
I 2022/06/09 11:05:01 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1961-3b7b159a9ae3d500459e38e69c96a917/9P9MeXzolFQyY5OhK2XcwZ02e0Y0ZC_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/films/adc5dbcfd3feab7fb5ebe9ce4b103691/oTbeMEU4djNsFgw76dm1476k4EGUjO_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:01 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=babenco-hector
I 2022/06/09 11:05:01 Fulltext indexing: M-Eb9m_26NP5 https://www.criterion.com/shop/browse/list?director=babenco-hector
I 2022/06/09 11:05:01 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[M-Eb9m_26NP5 (1735154940651765760)]} 0 2
I 2022/06/09 11:05:01 SWITCHBOARD *Indexed 1204 words in URL https://www.criterion.com/shop/browse/list?director=babenco-hector [M-Eb9m_26NP5]
Description: Héctor Babenco films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12540 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:02 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 466, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:05:02 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=micheaux-oscar, 224777 bytes
I 2022/06/09 11:05:02 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=micheaux-oscar, STACKING TIME = 1, PARSING TIME = 32
I 2022/06/09 11:05:02 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/e8c19690806d171f7e3ef5c667737ca9/v6zAyHvnNeorIDi62bcCZ2NTMmOwHJ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1873-f368b5675c9f0eb398727cb9a03b6e7b/vDeZBqbuBzw7Bpsb3KEK6hBnY6zACK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=micheaux-oscar
I 2022/06/09 11:05:02 Fulltext indexing: Mp1dxm_26NP5 https://www.criterion.com/shop/browse/list?director=micheaux-oscar
I 2022/06/09 11:05:02 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 13)) = 238
I 2022/06/09 11:05:02 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Mp1dxm_26NP5 (1735154941071196160)]} 0 3
I 2022/06/09 11:05:02 SWITCHBOARD *Indexed 1196 words in URL https://www.criterion.com/shop/browse/list?director=micheaux-oscar [Mp1dxm_26NP5]
Description: Oscar Micheaux films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12529 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:02 HTCACHE storing content of url https://www.criterion.com/films/28724-lone-wolf-and-cub-baby-cart-to-hades, 76467 bytes
I 2022/06/09 11:05:02 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bogdanovich-peter, 224781 bytes
I 2022/06/09 11:05:02 SWITCHBOARD CRAWL: ADDED 67 LINKS FROM https://www.criterion.com/films/28724-lone-wolf-and-cub-baby-cart-to-hades, STACKING TIME = 14, PARSING TIME = 21
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/9412bc6d3169c80870d1fe3af4c95bc5.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://itunes.apple.com/us/movie/id1169837282?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/lone-wolf-and-cub-baby-cart-to-hades?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/86bd8d74849a8d3e70dde7c17521c24b.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/bdccef4372055a6d7eba1d9e48d671e0/KbfBzW7rIf0GLzZ6mfOiZKBiAPrKFM_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/images/7742-355bb7d45a9482fc6c2ac3db60ad9495/lone_wolf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.amazon.com/dp/B01M6EAG9R - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/bd8e88e276dc6d474902734b8d0faf45.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/images/7830-f315cd91efe00a972e37be56081e1dbf/LWC_designs_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/58e97ae4dd0b361bd131525faf284b78/K10x3FugjRS01iYKEXsLTr0OlDQk4P_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/images/7773-ee7702bf1312d6cd3550dbfc426265bc/Current_LW_C1-grab65_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 SWITCHBOARD Excluded 20 words in URL https://www.criterion.com/films/28724-lone-wolf-and-cub-baby-cart-to-hades
I 2022/06/09 11:05:02 Fulltext indexing: MMTW2e_26NP5 https://www.criterion.com/films/28724-lone-wolf-and-cub-baby-cart-to-hades
I 2022/06/09 11:05:02 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=bogdanovich-peter, STACKING TIME = 1, PARSING TIME = 38
I 2022/06/09 11:05:02 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/e3337688841b6e5c3e22796b3a889e02/feBBtCtzScE9oOvniyPVCbTsG34x2V_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1864-79581df36332f7a1f027b81311c9e0f9/j8fKXLhpRU7m0dGwebBdwZLgrXaO26_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[MMTW2e_26NP5 (1735154941329145856)]} 0 17
I 2022/06/09 11:05:02 SWITCHBOARD *Indexed 335 words in URL https://www.criterion.com/films/28724-lone-wolf-and-cub-baby-cart-to-hades [MMTW2e_26NP5]
Description: Lone Wolf and Cub: Baby Cart to Hades (1972) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3758 bytes |
LinkStorageTime: 19 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:02 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 504, robots.delay = 0, ((waitig = 252) - (timeSinceLastAccess = 11)) = 241
I 2022/06/09 11:05:02 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=bogdanovich-peter
I 2022/06/09 11:05:02 Fulltext indexing: MhnEmm_26NP5 https://www.criterion.com/shop/browse/list?director=bogdanovich-peter
I 2022/06/09 11:05:02 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[MhnEmm_26NP5 (1735154941403594752)]} 0 2
I 2022/06/09 11:05:02 SWITCHBOARD *Indexed 1196 words in URL https://www.criterion.com/shop/browse/list?director=bogdanovich-peter [MhnEmm_26NP5]
Description: Peter Bogdanovich films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12521 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:02 HTCACHE storing content of url https://www.criterion.com/films/29404-lettres-d-amour, 70654 bytes
I 2022/06/09 11:05:02 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/films/29404-lettres-d-amour, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1816-e188e2102b63387dbe23fc67edb6beea/DWwbQG5RL4lfG3HYvO9uUZ78amems4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/9fbdd9a4cfc783daf7ad2275333b3a90.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/6524a1ed52ee915b2a7148ac59ab1881.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/45c628fc5070f6fee6f29ef34ca518e9.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/f56a6a3cf85f93e5dc84efb56fd8100e.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/09ef0ec590d8fcbf5ea01259ae3ba9b9/3OZ3L2WPoJxZgumbeOigOW0aKsXxdJ_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/29d1fe954024975e5cd88f6e316f6823.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/lettres-d-amour?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/84f60757a2466574aa9497b56c5b2ec1.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/dbd64381dc74a65e71a743b59a63b9c6.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/images/9518-74f3b68e17d1c132c4fb7dd0555570cc/Current_29404id_015_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:02 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/films/29404-lettres-d-amour
I 2022/06/09 11:05:02 Fulltext indexing: MHDG8e_26NP5 https://www.criterion.com/films/29404-lettres-d-amour
I 2022/06/09 11:05:02 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[MHDG8e_26NP5 (1735154941601775616)]} 0 2
I 2022/06/09 11:05:02 SWITCHBOARD *Indexed 270 words in URL https://www.criterion.com/films/29404-lettres-d-amour [MHDG8e_26NP5]
Description: Lettres damour (1942) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3061 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:02 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 484, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 8)) = 242
I 2022/06/09 11:05:03 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 484, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:05:03 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=adolphson-edvin, 224809 bytes
I 2022/06/09 11:05:03 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=adolphson-edvin, STACKING TIME = 1, PARSING TIME = 19
I 2022/06/09 11:05:03 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/films/bb3d92ac28b94e8cbad1c169e15230eb/gxgY0bIbXgkuyFw4Zs4uwe1uD5PxnV_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1915-d024b93e4429f5b05d7f0bdc8d59c415/shXfQUMZTWnY6hrjrAHIhcBlosHu4c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=adolphson-edvin
I 2022/06/09 11:05:03 Fulltext indexing: L7wf3m_26NP5 https://www.criterion.com/shop/browse/list?director=adolphson-edvin
I 2022/06/09 11:05:03 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[L7wf3m_26NP5 (1735154942056857600)]} 0 3
I 2022/06/09 11:05:03 SWITCHBOARD *Indexed 1198 words in URL https://www.criterion.com/shop/browse/list?director=adolphson-edvin [L7wf3m_26NP5]
Description: Edvin Adolphson films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12524 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:03 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=lelouch-claude, 224291 bytes
I 2022/06/09 11:05:03 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 500, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:05:03 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=lelouch-claude, STACKING TIME = 1, PARSING TIME = 29
I 2022/06/09 11:05:03 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/films/89fd00884467657bae436047de8cd9b2/7RuTtvX525S0ENzFaMXEcuoqKM280Y_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=lelouch-claude
I 2022/06/09 11:05:03 Fulltext indexing: L4Iihm_26NP5 https://www.criterion.com/shop/browse/list?director=lelouch-claude
I 2022/06/09 11:05:03 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[L4Iihm_26NP5 (1735154942343118848)]} 0 2
I 2022/06/09 11:05:03 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=lelouch-claude [L4Iihm_26NP5]
Description: Claude Lelouch films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12584 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:03 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 500, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:05:03 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=sjoeberg-alf, 225777 bytes
I 2022/06/09 11:05:03 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=sjoeberg-alf, STACKING TIME = 1, PARSING TIME = 21
I 2022/06/09 11:05:03 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/films/f05e3d8b8d48f95dff100aa47f9f61ed/98HE82POwakYBdneI0UmYxj3CmWRyN_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/films/493d910d7cfcce1d103f6354027d6d37/NT4wqOdozXSzODYv04w2NGpHYiqc23_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1777-0bf5a031bd21c3d35f64323b27f49d77/BYXas4KPZ66EXCahX6uErMaxE3Xs3e_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=sjoeberg-alf
I 2022/06/09 11:05:03 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:05:03 Fulltext indexing: Lfk-Dm_26NP5 https://www.criterion.com/shop/browse/list?director=sjoeberg-alf
I 2022/06/09 11:05:03 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Lfk-Dm_26NP5 (1735154942616797184)]} 0 4
I 2022/06/09 11:05:03 SWITCHBOARD *Indexed 1212 words in URL https://www.criterion.com/shop/browse/list?director=sjoeberg-alf [Lfk-Dm_26NP5]
Description: Alf Sjöberg films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12566 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:03 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 515, robots.delay = 0, ((waitig = 257) - (timeSinceLastAccess = 12)) = 245
I 2022/06/09 11:05:03 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:05:03 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@4cf853f4[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wt(7.7.3):c176:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772696001}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wu(7.7.3):C8:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772698712}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wv(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772703827}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:05:03 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:05:03 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=smith-kevin, 224168 bytes
I 2022/06/09 11:05:03 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=smith-kevin, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:05:03 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/films/7219028d5161c33275211a71074f5f4a/ma1iHSdXi2xvx6cDYrynn8vKKNkODq_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:03 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=smith-kevin
I 2022/06/09 11:05:03 Fulltext indexing: LI70Dm_26NP5 https://www.criterion.com/shop/browse/list?director=smith-kevin
I 2022/06/09 11:05:03 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[LI70Dm_26NP5 (1735154942864261120)]} 0 4
I 2022/06/09 11:05:04 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=smith-kevin [LI70Dm_26NP5]
Description: Kevin Smith films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12472 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:04 HostQueue forcing crawl-delay of 251 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 522, robots.delay = 0, ((waitig = 261) - (timeSinceLastAccess = 10)) = 251
I 2022/06/09 11:05:04 HTCACHE storing content of url https://www.criterion.com/films/27624-the-white-angel, 73839 bytes
I 2022/06/09 11:05:04 SWITCHBOARD CRAWL: ADDED 62 LINKS FROM https://www.criterion.com/films/27624-the-white-angel, STACKING TIME = 1, PARSING TIME = 11
I 2022/06/09 11:05:04 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/c5fe70d9457505b773db808a0aa1bbae.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/images/3719-34e86eb3abe14685658f5613efd3b146/matarazzo_nobodyschildren-4_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/the-white-angel?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/films/224ad3784ebfb34f98b1af628337f3da/gf5q2Dxvw2rDGLoNCNOnF3L53EUKqK_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/fd022ca340ea50b496b2555066d48d57.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/f0ef8434f807ab5082a6a0b63f2111a4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1803-6a8b76d7af61cbee31aced4a8191a85a/zKIrZpwHhlb5ETRsoB0UujdI9OwYFz_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/films/b59efc718fc3309a4eb76255310280b8/MpKeZ33lims6VNmUQPeUZ06u1HCgd5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/films/c3671b7d05dd992c80de898da6f724a8/iKAAnLTwUhBFo0X62zBb8ijm258Sey_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/185ba19b70895755d4b0dce4cd86460c.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/572b757a5f9510a89da713901d2a6d39.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6475-/lZ5EyHLWra1qaKr6A01KvgMRozwhlx_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/films/e033d65a68d10236f332c06ce3725b59/5Pvpc7nsRpgo8lu3IQVlAeheME1g9w_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/6ab1e62fe3a341659643ec9abd877ef8.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/films/27624-the-white-angel
I 2022/06/09 11:05:04 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[LD6nNe_26NP5 (1735154943084462080)]} 0 2
I 2022/06/09 11:05:04 Fulltext indexing: LD6nNe_26NP5 https://www.criterion.com/films/27624-the-white-angel
I 2022/06/09 11:05:04 SWITCHBOARD *Indexed 312 words in URL https://www.criterion.com/films/27624-the-white-angel [LD6nNe_26NP5]
Description: The White Angel (1955) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3438 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:04 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=clayton-jack, 224160 bytes
I 2022/06/09 11:05:04 HostQueue forcing crawl-delay of 253 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 525, robots.delay = 0, ((waitig = 262) - (timeSinceLastAccess = 9)) = 253
I 2022/06/09 11:05:04 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=clayton-jack, STACKING TIME = 1, PARSING TIME = 89
I 2022/06/09 11:05:04 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/films/4a2579a876453c44391878218c0ff019/BXl2PKj5T4lPvbzJpQb8AQ49rf6uBh_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=clayton-jack
I 2022/06/09 11:05:04 Fulltext indexing: LFVvMm_26NP5 https://www.criterion.com/shop/browse/list?director=clayton-jack
I 2022/06/09 11:05:04 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[LFVvMm_26NP5 (1735154943366529024)]} 0 2
I 2022/06/09 11:05:04 SWITCHBOARD *Indexed 1186 words in URL https://www.criterion.com/shop/browse/list?director=clayton-jack [LFVvMm_26NP5]
Description: Jack Clayton films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12460 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:04 HostQueue forcing crawl-delay of 252 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 525, robots.delay = 0, ((waitig = 262) - (timeSinceLastAccess = 10)) = 252
I 2022/06/09 11:05:04 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=macpherson-kenneth, 224746 bytes
I 2022/06/09 11:05:04 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=macpherson-kenneth, STACKING TIME = 1, PARSING TIME = 25
I 2022/06/09 11:05:04 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/films/64a7765172f162a06129a504d81dcc64/RKbm2Glxv6ZMeQjvZyFuk4mZkZanvo_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1873-f368b5675c9f0eb398727cb9a03b6e7b/vDeZBqbuBzw7Bpsb3KEK6hBnY6zACK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 HTCACHE storing content of url https://www.criterion.com/current/posts/442-the-hours-and-times-kurosawa-and-the-art-of-epic-storytelling, 65581 bytes
I 2022/06/09 11:05:04 SWITCHBOARD CRAWL: ADDED 41 LINKS FROM https://www.criterion.com/current/posts/442-the-hours-and-times-kurosawa-and-the-art-of-epic-storytelling, STACKING TIME = 5, PARSING TIME = 5
I 2022/06/09 11:05:04 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/images/4280-0b082dc4055ce557083e17481a378dd9/current_2_302_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:04 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 506, robots.delay = 0, ((waitig = 253) - (timeSinceLastAccess = 12)) = 241
I 2022/06/09 11:05:04 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=macpherson-kenneth
I 2022/06/09 11:05:04 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[K59zXm_26NP5 (1735154943846776832)]} 0 2
I 2022/06/09 11:05:04 Fulltext indexing: K59zXm_26NP5 https://www.criterion.com/shop/browse/list?director=macpherson-kenneth
I 2022/06/09 11:05:04 SWITCHBOARD *Indexed 1196 words in URL https://www.criterion.com/shop/browse/list?director=macpherson-kenneth [K59zXm_26NP5]
Description: Kenneth Macpherson films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12509 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:04 SWITCHBOARD Excluded 26 words in URL https://www.criterion.com/current/posts/442-the-hours-and-times-kurosawa-and-the-art-of-epic-storytelling
I 2022/06/09 11:05:04 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[KWQVsG_26NP5 (1735154943872991232)]} 0 1
I 2022/06/09 11:05:04 Fulltext indexing: KWQVsG_26NP5 https://www.criterion.com/current/posts/442-the-hours-and-times-kurosawa-and-the-art-of-epic-storytelling
I 2022/06/09 11:05:04 SWITCHBOARD *Indexed 438 words in URL https://www.criterion.com/current/posts/442-the-hours-and-times-kurosawa-and-the-art-of-epic-storytelling [KWQVsG_26NP5]
Description: The Hours and Times: Kurosawa and the Art of Epic Storytelling | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5422 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:05 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 506, robots.delay = 0, ((waitig = 253) - (timeSinceLastAccess = 10)) = 243
I 2022/06/09 11:05:05 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=pfleghar-michael, 224306 bytes
I 2022/06/09 11:05:05 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=pfleghar-michael, STACKING TIME = 1, PARSING TIME = 67
I 2022/06/09 11:05:05 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/films/89fd00884467657bae436047de8cd9b2/7RuTtvX525S0ENzFaMXEcuoqKM280Y_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/my_criterion/4426-jonathan-keogh
I 2022/06/09 11:05:05 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/account/my-criterion
I 2022/06/09 11:05:05 REJECTED https://www.criterion.com/my_criterion/4426-jonathan-keogh - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/my_criterion/4426-jonathan-keogh to https://www.criterion.com/account/my-criterion placed on crawler queue for double-check
I 2022/06/09 11:05:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[J8jGN6_26NP5 (1735154944212729856)]} 0 6
I 2022/06/09 11:05:05 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=pfleghar-michael
I 2022/06/09 11:05:05 Fulltext indexing: Ksx0Cm_26NP5 https://www.criterion.com/shop/browse/list?director=pfleghar-michael
I 2022/06/09 11:05:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Ksx0Cm_26NP5 (1735154944287178752)]} 0 2
I 2022/06/09 11:05:05 SWITCHBOARD *Indexed 1186 words in URL https://www.criterion.com/shop/browse/list?director=pfleghar-michael [Ksx0Cm_26NP5]
Description: Michael Pfleghar films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12582 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:05 HostQueue forcing crawl-delay of 254 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 529, robots.delay = 0, ((waitig = 264) - (timeSinceLastAccess = 10)) = 254
I 2022/06/09 11:05:05 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=chabrol-claude, 224703 bytes
I 2022/06/09 11:05:05 HTCACHE storing content of url https://www.criterion.com/current/posts/6374-jessica-hausner-s-little-joe, 73730 bytes
I 2022/06/09 11:05:05 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=chabrol-claude, STACKING TIME = 1, PARSING TIME = 98
I 2022/06/09 11:05:05 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/films/56ba3e5cf5c30c4ca37d11dbc92db30c/TIrcSypBNof95PXcIsPiciN0HD59y4_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/films/e2ec7ab6b825e43071466bfb89fe7fff/wOzbQfzoXkpgxILMzKBKRMOwT7xfb7_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 10)) = 249
I 2022/06/09 11:05:05 SWITCHBOARD CRAWL: ADDED 65 LINKS FROM https://www.criterion.com/current/posts/6374-jessica-hausner-s-little-joe, STACKING TIME = 1, PARSING TIME = 21
I 2022/06/09 11:05:05 REJECTED https://www.telegraph.co.uk/films/0/little-joe-ben-whishaw-falls-mutant-flower-chilly-sci-fi-fable/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-chang-pedro-almodovar-ken-loach-jessica-hausner-20190518-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.festival-cannes.com/en/festival/films/little-joe - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.indiewire.com/2019/05/little-joe-review-cannes-1202142527/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://variety.com/2019/film/news/jessica-hausner-little-joe-cannes-film-festival-interview-1203219442/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/posts/7823-tribeca-2022 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/little-joe-jessica-hausner-sci-fi-plant-horror-drama - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.youtube.com/embed/S7ihx84V1q4?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/posts/7819-early-summer-reading - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/series/did-you-see-this - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://criterion-production.s3.amazonaws.com/eHPF8Mm6sTdbJvdVijP8UW2JzbXnuB.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.youtube.com/embed/I4cdpfJ-k5A?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://film.avclub.com/postwar-drama-and-an-unnerving-spin-on-a-sci-fi-classic-1834865079 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://deadline.com/2019/05/jessica-hausner-little-joe-cannes-interview-news-1202610970/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/category/1-on-film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-5-haitian-zombis-insidious-plants-takashi-miike - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/series/cannes-2019 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/posts/7822-irma-vep-revamp - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/category/20-the-daily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/author/654-david-hudson - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/posts/7818-american-neorealism-now - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:05 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=chabrol-claude
I 2022/06/09 11:05:05 Fulltext indexing: KTPXcm_26NP5 https://www.criterion.com/shop/browse/list?director=chabrol-claude
I 2022/06/09 11:05:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[KTPXcm_26NP5 (1735154944731774976)]} 0 2
I 2022/06/09 11:05:05 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=chabrol-claude [KTPXcm_26NP5]
Description: Claude Chabrol films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12508 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kDluwa_26NP5 (1735154944745406464)]} 0 0
I 2022/06/09 11:05:05 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/account/my-criterion
I 2022/06/09 11:05:05 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/login
I 2022/06/09 11:05:05 REJECTED https://www.criterion.com/account/my-criterion - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/account/my-criterion aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:34:07Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
I 2022/06/09 11:05:05 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/6374-jessica-hausner-s-little-joe
I 2022/06/09 11:05:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Jk4sYG_26NP5 (1735154944763232256)]} 0 1
I 2022/06/09 11:05:05 Fulltext indexing: Jk4sYG_26NP5 https://www.criterion.com/current/posts/6374-jessica-hausner-s-little-joe
I 2022/06/09 11:05:05 SWITCHBOARD *Indexed 537 words in URL https://www.criterion.com/current/posts/6374-jessica-hausner-s-little-joe [Jk4sYG_26NP5]
Description: Jessica Hausners Little Joe | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6523 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:05 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 10)) = 249
I 2022/06/09 11:05:06 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/current/top-10-lists/216-paul-dano-s-top-10
I 2022/06/09 11:05:06 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/current/posts
I 2022/06/09 11:05:06 REJECTED https://www.criterion.com/current/top-10-lists/216-paul-dano-s-top-10 - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/current/top-10-lists/216-paul-dano-s-top-10 aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:34:26Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
I 2022/06/09 11:05:06 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[JfDyUG_26NP5 (1735154945000210432)]} 0 2
I 2022/06/09 11:05:06 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 11)) = 248
I 2022/06/09 11:05:06 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 10)) = 249
I 2022/06/09 11:05:06 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=kinugasa-teinosuke, 224223 bytes
I 2022/06/09 11:05:06 SWITCHBOARD CRAWL: ADDED 42 LINKS FROM https://www.criterion.com/shop/browse/list?director=kinugasa-teinosuke, STACKING TIME = 1, PARSING TIME = 71
I 2022/06/09 11:05:06 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:06 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:06 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:06 REJECTED https://s3.amazonaws.com/criterion-production/films/84f05f64e1f280532d29c921e32e79f9/TpWnFHWJR7iUJtg686Ei4gSe78oZWc_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:06 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:06 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:06 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:06 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:06 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:06 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 513, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 12)) = 244
I 2022/06/09 11:05:06 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=kinugasa-teinosuke
I 2022/06/09 11:05:06 Fulltext indexing: Iuksgm_26NP5 https://www.criterion.com/shop/browse/list?director=kinugasa-teinosuke
I 2022/06/09 11:05:06 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Iuksgm_26NP5 (1735154945803419648)]} 0 2
I 2022/06/09 11:05:06 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=kinugasa-teinosuke [Iuksgm_26NP5]
Description: Teinosuke Kinugasa films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12509 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:06 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 513, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 10)) = 246
I 2022/06/09 11:05:07 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=to-johnnie, 224108 bytes
I 2022/06/09 11:05:07 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=to-johnnie, STACKING TIME = 1, PARSING TIME = 18
I 2022/06/09 11:05:07 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://s3.amazonaws.com/criterion-production/films/78f00702358370de10b7256ded97d10b/qh2QGOHZiI77jVyFWnv9ex9XhAUTy0_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=to-johnnie
I 2022/06/09 11:05:07 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[IuaxXm_26NP5 (1735154946180907008)]} 0 2
I 2022/06/09 11:05:07 Fulltext indexing: IuaxXm_26NP5 https://www.criterion.com/shop/browse?director=to-johnnie
I 2022/06/09 11:05:07 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse?director=to-johnnie [IuaxXm_26NP5]
Description: Johnnie To films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12402 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:07 HTCACHE storing content of url https://www.criterion.com/current/author/208-david-chute, 50989 bytes
I 2022/06/09 11:05:07 SWITCHBOARD CRAWL: ADDED 41 LINKS FROM https://www.criterion.com/current/author/208-david-chute, STACKING TIME = 1, PARSING TIME = 4
I 2022/06/09 11:05:07 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[H3dbaG_26NP5 (1735154946289958912)]} 0 48
I 2022/06/09 11:05:07 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/current/author/208-david-chute
I 2022/06/09 11:05:07 Fulltext indexing: H3dbaG_26NP5 https://www.criterion.com/current/author/208-david-chute
I 2022/06/09 11:05:07 SWITCHBOARD *Indexed 156 words in URL https://www.criterion.com/current/author/208-david-chute [H3dbaG_26NP5]
Description: David Chute | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1826 bytes |
LinkStorageTime: 49 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:07 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 502, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 8)) = 243
I 2022/06/09 11:05:07 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=allegret-marc, 224653 bytes
I 2022/06/09 11:05:07 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=allegret-marc, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:05:07 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://s3.amazonaws.com/criterion-production/films/28e497421d0e485cece5e9269c16af35/ZfUeJFt1aesMSRGbLbQqTuhBztrEGP_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1766-0926bfc8985e1333759badc8421feb20/CcaaWhABn32RGpmaY0QfWfQzx5pNgV_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:07 REJECTED https://www.criterion.com/current/posts/Braden%20King%20https:/twitter.com/bradenking/status/1478847388223692801 - no response body (http return code = 404)
I 2022/06/09 11:05:07 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[HX_aXG_26NP5 (1735154946424176640)]} 0 0
I 2022/06/09 11:05:07 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=allegret-marc
I 2022/06/09 11:05:07 Fulltext indexing: IJMy1m_26NP5 https://www.criterion.com/shop/browse/list?director=allegret-marc
I 2022/06/09 11:05:07 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[IJMy1m_26NP5 (1735154946470313984)]} 0 2
I 2022/06/09 11:05:07 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse/list?director=allegret-marc [IJMy1m_26NP5]
Description: Marc Allégret films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12470 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:07 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 508, robots.delay = 0, ((waitig = 254) - (timeSinceLastAccess = 10)) = 244
I 2022/06/09 11:05:07 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 508, robots.delay = 0, ((waitig = 254) - (timeSinceLastAccess = 10)) = 244
I 2022/06/09 11:05:08 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 508, robots.delay = 0, ((waitig = 254) - (timeSinceLastAccess = 11)) = 243
I 2022/06/09 11:05:08 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=penn-arthur, 224275 bytes
I 2022/06/09 11:05:08 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=penn-arthur, STACKING TIME = 1, PARSING TIME = 32
I 2022/06/09 11:05:08 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/films/89fd00884467657bae436047de8cd9b2/7RuTtvX525S0ENzFaMXEcuoqKM280Y_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=shimizu-hiroshi, 226453 bytes
I 2022/06/09 11:05:08 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 521, robots.delay = 0, ((waitig = 260) - (timeSinceLastAccess = 18)) = 242
I 2022/06/09 11:05:08 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse/list?director=shimizu-hiroshi, STACKING TIME = 6, PARSING TIME = 55
I 2022/06/09 11:05:08 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/films/c02c4aa609824030e40216497743497f/dhf3ReLOhyiAk1XzOwejbjQYO9yibb_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 HTCACHE storing content of url https://www.criterion.com/current/posts/1504-everlasting-process, 69206 bytes
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1791-33ac56c0ab6c46473fbb827201db6455/cl4pDTM4ZYZyfvJyUFlcDrISlIuq1Z_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/films/3cec2eb98c004b821d42cc6e14bfb0fc/MjYLPC6QAnciKmRDONHLk59AXcIvqR_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/films/e62f1e61b5f52c2aaeabaeceaf58b629/BenTqN2hpuF2PKWN8v0M0BZkEMiLAM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/films/8f2f7b3ac4527fdb2bc4d5dd0c4edc6a/REhK9I9DGXP9cIB0AMU5avMjepHy2I_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/1504-everlasting-process, STACKING TIME = 6, PARSING TIME = 19
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/films/f4a75fa9e4e3797bba7a179dd774d412/n4PstJirHzLRuR7mdBBOY7ntCG6Suo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://ericskillman.blogspot.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/images/4578-2d849516a06bdf1cedc75cbe45b9686a/current_samsmyth_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 REJECTED https://samsmyth.blogspot.com/2010/06/process-everlasting-moments-dvd-cover.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:08 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=penn-arthur
I 2022/06/09 11:05:08 Fulltext indexing: G54Oum_26NP5 https://www.criterion.com/shop/browse/list?director=penn-arthur
I 2022/06/09 11:05:08 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[G54Oum_26NP5 (1735154947648913408)]} 0 4
I 2022/06/09 11:05:08 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=penn-arthur [G54Oum_26NP5]
Description: Arthur Penn films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12581 bytes |
LinkStorageTime: 11 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:08 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=shimizu-hiroshi
I 2022/06/09 11:05:08 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[GycWEm_26NP5 (1735154947727556608)]} 0 2
I 2022/06/09 11:05:08 Fulltext indexing: GycWEm_26NP5 https://www.criterion.com/shop/browse/list?director=shimizu-hiroshi
I 2022/06/09 11:05:08 SWITCHBOARD *Indexed 1212 words in URL https://www.criterion.com/shop/browse/list?director=shimizu-hiroshi [GycWEm_26NP5]
Description: Hiroshi Shimizu films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12665 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:08 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/current/posts/1504-everlasting-process
I 2022/06/09 11:05:08 Fulltext indexing: Go4m6G_26NP5 https://www.criterion.com/current/posts/1504-everlasting-process
I 2022/06/09 11:05:08 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 10)) = 249
I 2022/06/09 11:05:08 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Go4m6G_26NP5 (1735154947744333824)]} 0 1
I 2022/06/09 11:05:08 SWITCHBOARD *Indexed 254 words in URL https://www.criterion.com/current/posts/1504-everlasting-process [Go4m6G_26NP5]
Description: Everlasting Process | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2937 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:08 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:05:08 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:05:08 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@bac5c720[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wt(7.7.3):c176:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772696001}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wu(7.7.3):C8:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772698712}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wv(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772703827}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ww(7.7.3):C19:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772708817}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:05:08 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:05:08 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 11)) = 248
I 2022/06/09 11:05:09 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=w-pabst-g, 226393 bytes
I 2022/06/09 11:05:09 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse/list?director=w-pabst-g, STACKING TIME = 1, PARSING TIME = 21
I 2022/06/09 11:05:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/7d539514e0694356b704de0bd0985ddc/PDCF8VbHXZoKJ3Lhx3IvzOzNlg83oB_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/01f2b09b21d5479965b2b93422ba1072/tUKPob5t1DjOrGRVmJEEfRaalbiOzy_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/8cff4ed5c749a90d25f50667ab1908d2/erzB7rRyF3un8YavvACbHzygC7P0lm_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/61a18b721d7da3c80336baf91a7bd9f7/vjFC4gmkYHWrhV09dgqGRyHtLcJ5Rp_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=w-pabst-g
I 2022/06/09 11:05:09 Fulltext indexing: GggVHm_26NP5 https://www.criterion.com/shop/browse/list?director=w-pabst-g
I 2022/06/09 11:05:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[GggVHm_26NP5 (1735154948258136064)]} 0 5
I 2022/06/09 11:05:09 SWITCHBOARD *Indexed 1213 words in URL https://www.criterion.com/shop/browse/list?director=w-pabst-g [GggVHm_26NP5]
Description: G. W. Pabst films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12667 bytes |
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:09 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 523, robots.delay = 0, ((waitig = 261) - (timeSinceLastAccess = 12)) = 249
I 2022/06/09 11:05:09 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=franju-georges, 224696 bytes
I 2022/06/09 11:05:09 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=franju-georges, STACKING TIME = 0, PARSING TIME = 20
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/fc3c1f281c7c268b95d6ccfcb9c09753/09SjT40uvff2VmvuzTfllqpz5yKg2o_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/257d1602c6a7da83cb3a70db7349bbaf/OXzoGLF8Og7Ffj3X2nJk35Q9nqpSa1_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 HTCACHE storing content of url https://www.criterion.com/current/posts/5808-fall-festival-starters, 75140 bytes
I 2022/06/09 11:05:09 SWITCHBOARD CRAWL: ADDED 69 LINKS FROM https://www.criterion.com/current/posts/5808-fall-festival-starters, STACKING TIME = 6, PARSING TIME = 10
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED http://film.britishcouncil.org/the-souvenir - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.independent.co.uk/arts-entertainment/films/features/matthias-schoenaerts-interview-film-racer-and-the-jailbird-terrence-malick-a8442311.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.esquire.com/entertainment/movies/a21068561/suspiria-teaser-trailer/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/embed/3uGIEY7tdg8?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED http://www.labiennale.org/en/news/restored-films-venezia-classici - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://deadline.com/2016/11/alfonso-cuaron-movie-fight-crew-mexico-city-police-1201847622/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/embed/PSoRx87OO6k?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED http://www.labiennale.org/en/news/first-man-damien-chazelle-opening-film-75th-venice-film-festival - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/eRQoFkgvhx4BSKGEehVCLrMDahjA17.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.filmlinc.org/nyff2018/daily/alfonso-cuarons-roma-announced-as-nyff56-centerpiece/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.nytimes.com/2018/07/18/movies/alfonso-cuarns-roma-new-york-film-festival.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED http://cineuropa.org/en/newsdetail/357023/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://variety.com/2018/film/festivals/first-man-damien-chazelle-ryan-gosling-venice-film-festival-opening-night-1202877318/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://variety.com/2018/film/news/the-sisters-brothers-suspiria-my-brilliant-friend-venice-film-festival-1202878014/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED http://www.labiennale.org/en/news/pre-opening-event-75th-festival-tuesday-28-august - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/embed/Gj2oli0MLSU?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://variety.com/2017/film/podcasts/playback-podcast-damien-chazelle-la-la-land-1201963282/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=franju-georges
I 2022/06/09 11:05:09 Fulltext indexing: FdOYmm_26NP5 https://www.criterion.com/shop/browse/list?director=franju-georges
I 2022/06/09 11:05:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[FdOYmm_26NP5 (1735154948501405696)]} 0 6
I 2022/06/09 11:05:09 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse/list?director=franju-georges [FdOYmm_26NP5]
Description: Georges Franju films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12510 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:09 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/5808-fall-festival-starters
I 2022/06/09 11:05:09 Fulltext indexing: FWPlsG_26NP5 https://www.criterion.com/current/posts/5808-fall-festival-starters
I 2022/06/09 11:05:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[FWPlsG_26NP5 (1735154948544397312)]} 0 2
I 2022/06/09 11:05:09 SWITCHBOARD *Indexed 577 words in URL https://www.criterion.com/current/posts/5808-fall-festival-starters [FWPlsG_26NP5]
Description: Fall Festival Starters | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 7404 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:09 HostQueue forcing crawl-delay of 250 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 520, robots.delay = 0, ((waitig = 260) - (timeSinceLastAccess = 10)) = 250
I 2022/06/09 11:05:09 HTCACHE storing content of url https://www.criterion.com/films/28017-the-wicked-lady, 70959 bytes
I 2022/06/09 11:05:09 SWITCHBOARD CRAWL: ADDED 61 LINKS FROM https://www.criterion.com/films/28017-the-wicked-lady, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/614becb858a85ddb393d426b5743891d.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/044cdbf494d19764a30df44d01451bc7.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/bd2452146ea73af63f76a550204841f9/4yJd8DqIYDNQ5jaoJpFxnPPkbFjdAG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/images/3913-13e0f359ffe35a4c4e0598e2e9db3246/madonnaof7moons_1432_003_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/772b7e5558887c5daed761fe8fd4153d/Y7bOdy6FLY9xSaO7A52u4izRfBkmmC_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/3b1076b25c261c70b6e88b3a91e91b01.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/74be83402097d3f4c5d9e7331de31471/43YJwgdfANxgafSJNaTDUwGxR8Gait_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.amazon.com/dp/B00JJH2I3W - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/e13cb552465c40bea2e1f7459f42e861.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://itunes.apple.com/us/movie/id835386915?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1812-2bdeb818c107e95738f59894990c22b2/oTM1jWGYaWx6KHPFFGsXiyVmbdpCPi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/the-wicked-lady?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/b59efc718fc3309a4eb76255310280b8/MpKeZ33lims6VNmUQPeUZ06u1HCgd5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/films/28017-the-wicked-lady
I 2022/06/09 11:05:09 Fulltext indexing: FLju_e_26NP5 https://www.criterion.com/films/28017-the-wicked-lady
I 2022/06/09 11:05:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[FLju_e_26NP5 (1735154948663934976)]} 0 1
I 2022/06/09 11:05:09 SWITCHBOARD *Indexed 286 words in URL https://www.criterion.com/films/28017-the-wicked-lady [FLju_e_26NP5]
Description: The Wicked Lady (1945) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3024 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:09 HTCACHE storing content of url https://www.criterion.com/films/28019-madonna-of-the-seven-moons, 71648 bytes
I 2022/06/09 11:05:09 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 504, robots.delay = 0, ((waitig = 252) - (timeSinceLastAccess = 7)) = 245
I 2022/06/09 11:05:09 SWITCHBOARD CRAWL: ADDED 62 LINKS FROM https://www.criterion.com/films/28019-madonna-of-the-seven-moons, STACKING TIME = 1, PARSING TIME = 65
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/33a18e5b6c729a18d7c79c6c5510f3a5.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/5516ffe44b57e6de93b7d80b7c0a7936.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/images/3913-13e0f359ffe35a4c4e0598e2e9db3246/madonnaof7moons_1432_003_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/5f3050443408e30c95355064b9d16259.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/f448ba6e5e6976d113fbadfdbf654f1a.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/cbbe2b461ec0a30f180e6745ee9577e7/Oyu16EuKreAQCBgp029vYl7Nx6NV86_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1812-2bdeb818c107e95738f59894990c22b2/oTM1jWGYaWx6KHPFFGsXiyVmbdpCPi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/bd2452146ea73af63f76a550204841f9/4yJd8DqIYDNQ5jaoJpFxnPPkbFjdAG_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.amazon.com/dp/B00JE58M84 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/madonna-of-the-seven-moons?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/ae888c07c9f50bca9b2df4ca8ab675b8.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/224ad3784ebfb34f98b1af628337f3da/gf5q2Dxvw2rDGLoNCNOnF3L53EUKqK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://itunes.apple.com/us/movie/id826939330?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/e033d65a68d10236f332c06ce3725b59/5Pvpc7nsRpgo8lu3IQVlAeheME1g9w_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:09 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/films/28019-madonna-of-the-seven-moons
I 2022/06/09 11:05:09 Fulltext indexing: FLa3He_26NP5 https://www.criterion.com/films/28019-madonna-of-the-seven-moons
I 2022/06/09 11:05:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[FLa3He_26NP5 (1735154948956487680)]} 0 3
I 2022/06/09 11:05:09 SWITCHBOARD *Indexed 289 words in URL https://www.criterion.com/films/28019-madonna-of-the-seven-moons [FLa3He_26NP5]
Description: Madonna of the Seven Moons (1945) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3000 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:09 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 504, robots.delay = 0, ((waitig = 252) - (timeSinceLastAccess = 11)) = 241
I 2022/06/09 11:05:10 HTCACHE storing content of url https://www.criterion.com/current/posts/3965-jan-troell-enduring-film-pioneer, 76386 bytes
I 2022/06/09 11:05:10 SWITCHBOARD CRAWL: ADDED 62 LINKS FROM https://www.criterion.com/current/posts/3965-jan-troell-enduring-film-pioneer, STACKING TIME = 1, PARSING TIME = 10
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/f4a75fa9e4e3797bba7a179dd774d412/n4PstJirHzLRuR7mdBBOY7ntCG6Suo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/aaceb4cad8621ca9617f4c00b8ad4748/5xi0GwA3BbtdOq3TOnIIC17roOR2Pu_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/images/6801-6a0c932a832bfd1bc395b0884eecdc17/Screen_Shot_2016-03-09_at_2.18.04_PM_medium.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/ccf8a3f2353002103ef420fd02fe2585/cE4nJ2rcnsqFoXZOGdTHQz1j9zLv3e_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/images/6640-e6a74f730289b5b381e1e83111c845dc/livmax_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/af3a424e036ce6064ba4c3b884c82128/cwe2k8wIo3C0zHyHipEgHMC2IsVcXq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED http://www.filmcomment.com/blog/interview-jan-troell/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 HTCACHE storing content of url https://www.criterion.com/current/posts/1061-on-it-s-impossible-to-learn-to-plowby-reading-books, 68580 bytes
I 2022/06/09 11:05:10 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/1061-on-it-s-impossible-to-learn-to-plowby-reading-books, STACKING TIME = 1, PARSING TIME = 15
I 2022/06/09 11:05:10 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/3965-jan-troell-enduring-film-pioneer
I 2022/06/09 11:05:10 Fulltext indexing: FCB1VG_26NP5 https://www.criterion.com/current/posts/3965-jan-troell-enduring-film-pioneer
I 2022/06/09 11:05:10 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[FCB1VG_26NP5 (1735154949346557952)]} 0 4
I 2022/06/09 11:05:10 SWITCHBOARD *Indexed 450 words in URL https://www.criterion.com/current/posts/3965-jan-troell-enduring-film-pioneer [FCB1VG_26NP5]
Description: Jan Troell, Enduring Film Pioneer | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5455 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:10 SWITCHBOARD Excluded 25 words in URL https://www.criterion.com/current/posts/1061-on-it-s-impossible-to-learn-to-plowby-reading-books
I 2022/06/09 11:05:10 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 489, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:05:10 Fulltext indexing: Enn3sG_26NP5 https://www.criterion.com/current/posts/1061-on-it-s-impossible-to-learn-to-plowby-reading-books
I 2022/06/09 11:05:10 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Enn3sG_26NP5 (1735154949379063808)]} 0 2
I 2022/06/09 11:05:10 SWITCHBOARD *Indexed 274 words in URL https://www.criterion.com/current/posts/1061-on-it-s-impossible-to-learn-to-plowby-reading-books [Enn3sG_26NP5]
Description: On Its Impossible to Learn to Plowby Reading Books | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3654 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:10 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 489, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:05:10 HTCACHE storing content of url https://www.criterion.com/current/posts/397-kill-rebel-samurai-cinema, 177782 bytes
I 2022/06/09 11:05:10 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/397-kill-rebel-samurai-cinema, STACKING TIME = 2, PARSING TIME = 13
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 HTCACHE storing content of url https://www.criterion.com/current/posts/801-the-lacemaker, 67077 bytes
I 2022/06/09 11:05:10 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/801-the-lacemaker, STACKING TIME = 1, PARSING TIME = 10
I 2022/06/09 11:05:10 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 473, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:05:10 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/397-kill-rebel-samurai-cinema
I 2022/06/09 11:05:10 Fulltext indexing: Em7eeG_26NP5 https://www.criterion.com/current/posts/397-kill-rebel-samurai-cinema
I 2022/06/09 11:05:10 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Em7eeG_26NP5 (1735154950026035200)]} 0 14
I 2022/06/09 11:05:10 SWITCHBOARD *Indexed 1527 words in URL https://www.criterion.com/current/posts/397-kill-rebel-samurai-cinema [Em7eeG_26NP5]
Description: Kill!: Rebel Samurai Cinema | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 24560 bytes |
LinkStorageTime: 16 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:10 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/801-the-lacemaker
I 2022/06/09 11:05:10 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[EjPnwG_26NP5 (1735154950051201024)]} 0 1
I 2022/06/09 11:05:10 Fulltext indexing: EjPnwG_26NP5 https://www.criterion.com/current/posts/801-the-lacemaker
I 2022/06/09 11:05:10 SWITCHBOARD *Indexed 432 words in URL https://www.criterion.com/current/posts/801-the-lacemaker [EjPnwG_26NP5]
Description: The Lacemaker | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5669 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:10 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
I 2022/06/09 11:05:10 org.apache.solr.update.SolrIndexWriter Calling setCommitData with IW:org.apache.solr.update.SolrIndexWriter@fed4ccdd commitCommandVersion:0
I 2022/06/09 11:05:10 HTCACHE storing content of url https://www.criterion.com/films/31785-once-upon-a-time-in-china-v, 68838 bytes
I 2022/06/09 11:05:10 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 467, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:05:10 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/LegWC4s0uQ2V7gyLB5vUk8PKN3soHqTiGHJRtfCs.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1987-beb71f216e96d1ff2d0f8231f5b8b975/44LVkvftLRcr5paF4enJfBFTe5mI2c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/yzWM7KVvJrSi1rjOfBNHtiCAotHAYB6AMYoqRyZq.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/films/31785-once-upon-a-time-in-china-v, STACKING TIME = 6, PARSING TIME = 14
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/e8d4cd2c5dd1b2541a3c8325a6d1805f/W5gKGPvtkm5evOJ9devGYL935KMAdy_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/b796fc21a57558358eb7f9e54fa5e6d0/2WXT8ULgPXXbikn39pHMz1m7dVbtt7_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/tXUnJYbZkEw00gUBu3JqZMEsT5PUfzfkaYukXytQ.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/wSkXZAgGlsAZuwZjPkBwJNtFTlGtpESWZyGxXSIq.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/YHKHgqSXtNOG2QX3R98RPaZwblXrPHoxSOe6oSL1.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/cd4ae6bdbaea9fd9c9aff1d69f924bc4/5wErYoFwVfkciAfnpRbFIhPqv7tIC5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:10 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/tL2y6baWLC9qRe0Xa6hlsVKbZKpfXMQNnF5JjZjn.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 SWITCHBOARD Excluded 13 words in URL https://www.criterion.com/films/31785-once-upon-a-time-in-china-v
I 2022/06/09 11:05:11 Fulltext indexing: EXDJ_e_26NP5 https://www.criterion.com/films/31785-once-upon-a-time-in-china-v
I 2022/06/09 11:05:11 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[EXDJ_e_26NP5 (1735154950220021760)]} 0 4
I 2022/06/09 11:05:11 SWITCHBOARD *Indexed 301 words in URL https://www.criterion.com/films/31785-once-upon-a-time-in-china-v [EXDJ_e_26NP5]
Description: Once Upon a Time in China V (1994) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3005 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:11 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:05:11 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 467, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:05:11 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 467, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:05:11 HTCACHE storing content of url https://www.criterion.com/current/posts/3605-three-reasons-the-bridge, 65954 bytes
I 2022/06/09 11:05:11 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/3605-three-reasons-the-bridge, STACKING TIME = 1, PARSING TIME = 9
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.youtube.com/embed/AqAQsYmu_1Q?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/films/dc0c6d72367b18727c846047f2b39cb6/JOpfLu2e0pJYqYoJ67UvpPXUgKcIP1_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/current/posts/3605-three-reasons-the-bridge
I 2022/06/09 11:05:11 Fulltext indexing: DrKO6G_26NP5 https://www.criterion.com/current/posts/3605-three-reasons-the-bridge
I 2022/06/09 11:05:11 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DrKO6G_26NP5 (1735154950776815616)]} 0 2
I 2022/06/09 11:05:11 SWITCHBOARD *Indexed 217 words in URL https://www.criterion.com/current/posts/3605-three-reasons-the-bridge [DrKO6G_26NP5]
Description: Three Reasons: The Bridge | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2536 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:11 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=clouse-robert, 225276 bytes
I 2022/06/09 11:05:11 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse/list?director=clouse-robert, STACKING TIME = 4, PARSING TIME = 82
I 2022/06/09 11:05:11 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/films/294fc27b4aa7b43a4f34fbce39a90e89/HVUGoVCQ8abK5Srs4VPKUfbR66QTG1_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 467, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1957-f90d4c48a2f932ffe7df386499f9477e/73k4EkSiXEfsdi097fieFBGdb39vlg_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/films/d3ef5d03f0388465ff6d95625ee4e504/TPEJFhfV5tnvG022UXCwt2LOHhis7v_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=clouse-robert
I 2022/06/09 11:05:11 Fulltext indexing: DsMcPm_26NP5 https://www.criterion.com/shop/browse/list?director=clouse-robert
I 2022/06/09 11:05:11 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DsMcPm_26NP5 (1735154951059931136)]} 0 4
I 2022/06/09 11:05:11 SWITCHBOARD *Indexed 1203 words in URL https://www.criterion.com/shop/browse/list?director=clouse-robert [DsMcPm_26NP5]
Description: Robert Clouse films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12567 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:11 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=soukis-robert, 224790 bytes
I 2022/06/09 11:05:11 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 467, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:05:11 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=soukis-robert, STACKING TIME = 5, PARSING TIME = 36
I 2022/06/09 11:05:11 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/films/a40089af5a99e65997b7ab84b63c23e6/xwkezz5WNkgtavKsf03aC9CxPq4gTb_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1809-a4a8b84c4cbcababe9073629fd726b50/S4JbdHupEZur0VszttgXmG8hjRdrG2_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=soukis-robert
I 2022/06/09 11:05:12 Fulltext indexing: DneMom_26NP5 https://www.criterion.com/shop/browse/list?director=soukis-robert
I 2022/06/09 11:05:12 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DneMom_26NP5 (1735154951328366592)]} 0 2
I 2022/06/09 11:05:12 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=soukis-robert [DneMom_26NP5]
Description: Robert Soukis films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12525 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:12 HTCACHE storing content of url https://www.criterion.com/current/author/859-sean-gilman, 49572 bytes
I 2022/06/09 11:05:12 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/859-sean-gilman, STACKING TIME = 0, PARSING TIME = 7
I 2022/06/09 11:05:12 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DXC_GG_26NP5 (1735154951369261056)]} 0 1
I 2022/06/09 11:05:12 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/current/author/859-sean-gilman
I 2022/06/09 11:05:12 Fulltext indexing: DXC_GG_26NP5 https://www.criterion.com/current/author/859-sean-gilman
I 2022/06/09 11:05:12 SWITCHBOARD *Indexed 126 words in URL https://www.criterion.com/current/author/859-sean-gilman [DXC_GG_26NP5]
Description: Sean Gilman | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1479 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:12 HostQueue forcing crawl-delay of 166 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 458, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 84)) = 166
I 2022/06/09 11:05:12 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=dieterle-william, 224237 bytes
I 2022/06/09 11:05:12 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=dieterle-william, STACKING TIME = 1, PARSING TIME = 31
I 2022/06/09 11:05:12 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://s3.amazonaws.com/criterion-production/films/711e752ce9b5c42995bb463693a0f371/RN7rkLPEIs51hWPn9yes29Zlzsj0cW_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 463, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:05:12 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=dieterle-william
I 2022/06/09 11:05:12 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DeGpkm_26NP5 (1735154951782400000)]} 0 2
I 2022/06/09 11:05:12 Fulltext indexing: DeGpkm_26NP5 https://www.criterion.com/shop/browse/list?director=dieterle-william
I 2022/06/09 11:05:12 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=dieterle-william [DeGpkm_26NP5]
Description: William Dieterle films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12494 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:12 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 463, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:05:12 HTCACHE storing content of url https://www.criterion.com/current/posts/6378-bertrand-bonello-s-zombi-child, 70747 bytes
I 2022/06/09 11:05:12 REJECTED https://criterion-production.s3.amazonaws.com/quRlGcDfNCBO6BhcNNb1VaeKiJjpie.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.screendaily.com/reviews/zombi-child-cannes-review/5139570.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://film.avclub.com/more-zombies-and-a-new-downer-from-a-past-cannes-winne-1834839786 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://filmmakermagazine.com/107533-cannes-2019-dispatch-2-bacurau-zombi-child/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 SWITCHBOARD CRAWL: ADDED 60 LINKS FROM https://www.criterion.com/current/posts/6378-bertrand-bonello-s-zombi-child, STACKING TIME = 9, PARSING TIME = 16
I 2022/06/09 11:05:12 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=cline-edward, 224209 bytes
I 2022/06/09 11:05:12 REJECTED https://www.telegraph.co.uk/films/0/zombi-child-review-disquieting-tale-voodoo-colonialism-la-francaise/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-5-haitian-zombis-insidious-plants-takashi-miike - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.quinzaine-realisateurs.com/en/film/zombi-child/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.hollywoodreporter.com/review/zombi-child-review-1210505 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 SWITCHBOARD Excluded 23 words in URL https://www.criterion.com/current/posts/6378-bertrand-bonello-s-zombi-child
I 2022/06/09 11:05:12 Fulltext indexing: DI69TG_26NP5 https://www.criterion.com/current/posts/6378-bertrand-bonello-s-zombi-child
I 2022/06/09 11:05:12 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DI69TG_26NP5 (1735154952223850496)]} 0 3
I 2022/06/09 11:05:12 SWITCHBOARD *Indexed 504 words in URL https://www.criterion.com/current/posts/6378-bertrand-bonello-s-zombi-child [DI69TG_26NP5]
Description: Bertrand Bonellos Zombi Child | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6025 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:12 SWITCHBOARD CRAWL: ADDED 42 LINKS FROM https://www.criterion.com/shop/browse/list?director=cline-edward, STACKING TIME = 1, PARSING TIME = 114
I 2022/06/09 11:05:12 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://s3.amazonaws.com/criterion-production/films/ed560ce125d981a74da5f0b112c643c4/L3qXe0Ml9IhoWPs1K4gtHCLOCKSxMe_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:12 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 461, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:05:13 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=cline-edward
I 2022/06/09 11:05:13 Fulltext indexing: DPM8Wm_26NP5 https://www.criterion.com/shop/browse/list?director=cline-edward
I 2022/06/09 11:05:13 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DPM8Wm_26NP5 (1735154952318222336)]} 0 2
I 2022/06/09 11:05:13 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=cline-edward [DPM8Wm_26NP5]
Description: Edward Cline films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12499 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:05:13 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=zemeckis-robert, 224178 bytes
I 2022/06/09 11:05:13 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=zemeckis-robert, STACKING TIME = 1, PARSING TIME = 29
I 2022/06/09 11:05:13 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:13 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:13 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:13 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:05:13 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)