5457 lines
1.0 MiB
5457 lines
1.0 MiB
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/films/babb52f89b5488d00cb76a924d7e06eb/XPxkbzNVy36iDfcGUxaqpxFC6LJ0tI_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=christensen-benjamin, STACKING TIME = 1, PARSING TIME = 86
|
||
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/films/61e9ee86f43dcf3bfecafe032299b9f3/WU2TTrhW3pd1lKQ9tk4zH4b20JSv40_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=litvak-anatole
|
||
I 2022/06/09 11:04:20 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ucevem_26NP5 (1735154897508106240)]} 0 2
|
||
I 2022/06/09 11:04:20 Fulltext indexing: ucevem_26NP5 https://www.criterion.com/shop/browse?director=litvak-anatole
|
||
I 2022/06/09 11:04:20 SWITCHBOARD *Indexed 1186 words in URL https://www.criterion.com/shop/browse?director=litvak-anatole [ucevem_26NP5]
|
||
Description: Anatole Litvak films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12412 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:20 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=christensen-benjamin
|
||
I 2022/06/09 11:04:20 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[uXv-Sm_26NP5 (1735154897574166528)]} 0 2
|
||
I 2022/06/09 11:04:20 Fulltext indexing: uXv-Sm_26NP5 https://www.criterion.com/shop/browse/list?director=christensen-benjamin
|
||
I 2022/06/09 11:04:20 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 448, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:20 SWITCHBOARD *Indexed 1202 words in URL https://www.criterion.com/shop/browse/list?director=christensen-benjamin [uXv-Sm_26NP5]
|
||
Description: Benjamin Christensen films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12534 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:20 HTCACHE storing content of url https://www.criterion.com/current/posts/6375-ken-loach-s-sorry-we-missed-you, 72518 bytes
|
||
I 2022/06/09 11:04:20 REJECTED https://www.screendaily.com/reviews/sorry-we-missed-you-cannes-review/5139521.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.theguardian.com/film/2019/may/16/sorry-we-missed-you-review-ken-loach - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://lwlies.com/festivals/sorry-we-missed-you-cannes-film-festival-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://criterion-production.s3.amazonaws.com/HACTDUVL5NkTIZL8hjkJFG5LNjesp3.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-chang-pedro-almodovar-ken-loach-jessica-hausner-20190518-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.independent.co.uk/arts-entertainment/films/reviews/sorry-we-missed-you-cannes-film-festival-review-ken-loach-drama-a8917766.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://film.avclub.com/more-zombies-and-a-new-downer-from-a-past-cannes-winne-1834839786 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.festival-cannes.com/en/festival/films/sorry-we-missed-you - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 SWITCHBOARD CRAWL: ADDED 67 LINKS FROM https://www.criterion.com/current/posts/6375-ken-loach-s-sorry-we-missed-you, STACKING TIME = 5, PARSING TIME = 9
|
||
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://cine-vue.com/2019/05/cannes-2019-sorry-we-missed-you-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.timeout.com/london/film/sorry-we-missed-you - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/sorry-we-missed-you-ken-loach-gig-economy-drama-kris-hitchen-debbie-honeywood-newcastle - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.hollywoodreporter.com/review/sorry-we-missed-you-cannes-2019-1211221 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://cineuropa.org/en/newsdetail/372672/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.youtube.com/embed/jLlVDpWSn0c?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 REJECTED https://www.telegraph.co.uk/films/0/sorry-missed-review-kenloach-insightful-clear-eyed/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:20 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/6375-ken-loach-s-sorry-we-missed-you
|
||
I 2022/06/09 11:04:20 Fulltext indexing: uS9gIG_26NP5 https://www.criterion.com/current/posts/6375-ken-loach-s-sorry-we-missed-you
|
||
I 2022/06/09 11:04:20 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[uS9gIG_26NP5 (1735154897645469696)]} 0 1
|
||
I 2022/06/09 11:04:20 SWITCHBOARD *Indexed 499 words in URL https://www.criterion.com/current/posts/6375-ken-loach-s-sorry-we-missed-you [uS9gIG_26NP5]
|
||
Description: Ken Loach’s Sorry We Missed You | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 5988 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:21 HTCACHE storing content of url https://www.criterion.com/boxsets/494-eclipse-series-4-raymond-bernard, 66668 bytes
|
||
I 2022/06/09 11:04:21 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 446, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:21 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/boxsets/494-eclipse-series-4-raymond-bernard, STACKING TIME = 3, PARSING TIME = 8
|
||
I 2022/06/09 11:04:21 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/de1ffcb56beae0169f9e7b23ae7b9516.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1780-c489370ce256ad8a36a096bca4ed7bb4/9aNOPIxaT7oPmfJpfNfA6Bnm1Subqd_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/ce8d7645f17327c4196bf6d407d80d9c.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/725e2cf2cb8ad819db5ae2aa50fbe3b8/nGEJ6EsfxvGN30AgK99Ylco5wtBGmf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/52b1d3054682fba202f2679beffb971a/QubSQmCC5J7OT6oFVavxjM711ui3DR_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/boxsets/494-eclipse-series-4-raymond-bernard
|
||
I 2022/06/09 11:04:21 Fulltext indexing: uMCee3_26NP5 https://www.criterion.com/boxsets/494-eclipse-series-4-raymond-bernard
|
||
I 2022/06/09 11:04:21 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[uMCee3_26NP5 (1735154897919148032)]} 0 2
|
||
I 2022/06/09 11:04:21 SWITCHBOARD *Indexed 267 words in URL https://www.criterion.com/boxsets/494-eclipse-series-4-raymond-bernard [uMCee3_26NP5]
|
||
Description: Eclipse Series 4: Raymond Bernard | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 4351 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:21 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=siegel-don, 225197 bytes
|
||
I 2022/06/09 11:04:21 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse/list?director=siegel-don, STACKING TIME = 1, PARSING TIME = 21
|
||
I 2022/06/09 11:04:21 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1859-5a1d456ec7733e9205b555242f30548d/QydMAqke3toeG4h4BOBYwNcOHf6z3W_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/1fff3ac634ee3a07e07e071a03276f29/aYkEYStkCF5KUvvjpd3gN3fn5XYZP6_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/fd68e938462eb582cb0de5a34be73d61/AJyDlyL04mfIBYIkBs53hojPoQLt0A_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=siegel-don
|
||
I 2022/06/09 11:04:21 Fulltext indexing: uSmRHm_26NP5 https://www.criterion.com/shop/browse/list?director=siegel-don
|
||
I 2022/06/09 11:04:21 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[uSmRHm_26NP5 (1735154898085871616)]} 0 2
|
||
I 2022/06/09 11:04:21 SWITCHBOARD *Indexed 1201 words in URL https://www.criterion.com/shop/browse/list?director=siegel-don [uSmRHm_26NP5]
|
||
Description: Don Siegel films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12530 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:21 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 450, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:21 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 450, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:21 HTCACHE storing content of url https://www.criterion.com/current/posts/2024-eclipse-series-29-aki-kaurism-ki-s-leningrad-cowboys, 121014 bytes
|
||
I 2022/06/09 11:04:21 SWITCHBOARD CRAWL: ADDED 60 LINKS FROM https://www.criterion.com/current/posts/2024-eclipse-series-29-aki-kaurism-ki-s-leningrad-cowboys, STACKING TIME = 9, PARSING TIME = 12
|
||
I 2022/06/09 11:04:21 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/3ed41f79deffcb3052099b02c9660e9b/zQOZgJoUsBEgi8arpM5w8aT22vGo6W_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/f8bf41c3e8d2266f423881ceb3159429/58bZDer5maXJjg6GDgD8Tyrr6ZZAuT_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/images/3766-ff6aa079e27077432f53409f08cb425d/kaurismakicowboys_1484_006_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/8f12ceb5a2e46f5f1550942e055ef1af/5yl46GfrudlcteVtODCZveKlbIlys1_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bertolucci-bernardo, 224768 bytes
|
||
I 2022/06/09 11:04:21 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=bertolucci-bernardo, STACKING TIME = 1, PARSING TIME = 54
|
||
I 2022/06/09 11:04:21 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/8d41805aabbf9c68049033a9e54fc4ca/5HBkbTpi2BcdDfPwUmTIH76T5jR9jA_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/films/df4a3828538e371bb514327fb5be561f/URBYiTnt4otEoXtvtr2zbHA5ykxgkB_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@e6edfd3[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w8(7.7.3):C18:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772634528}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wa(7.7.3):C23:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772640046}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wb(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772645671}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wc(7.7.3):C17:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772650026}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wd(7.7.3):C4:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772650982}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_we(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772656193}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wf(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772661758}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
|
||
I 2022/06/09 11:04:21 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
|
||
I 2022/06/09 11:04:21 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 452, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:21 HTCACHE storing content of url https://www.criterion.com/current/posts/5657-jean-luc-godard-s-the-image-book, 74910 bytes
|
||
I 2022/06/09 11:04:21 SWITCHBOARD CRAWL: ADDED 81 LINKS FROM https://www.criterion.com/current/posts/5657-jean-luc-godard-s-the-image-book, STACKING TIME = 5, PARSING TIME = 14
|
||
I 2022/06/09 11:04:21 REJECTED https://www.telegraph.co.uk/films/2018/05/12/jean-luc-godard-facetime-press-conference-cannes-filming-boring/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.theglobeandmail.com/arts/film/article-cannes-diary-jean-luc-godard-says-goodbye-to-language-once-and-for/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://time.com/5275703/cannes-review-jean-luc-godard-the-image-book/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.villagevoice.com/2018/05/24/a-tale-of-many-godards/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.screendaily.com/reviews/the-image-book-cannes-review/5129209.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://criterion-production.s3.amazonaws.com/xpJzxC3KGLNkBjQu3UYoFw6YU2GTJj.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://film.avclub.com/jean-luc-godard-returns-to-cannes-to-make-a-dunce-out-o-1825979305 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED http://www.middleeasteye.net/in-depth/features/cannes-2018-middle-east-takes-home-jury-prize-1960489906 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-5-changless-change-jean-luc-godard-and-jia-zhangke - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED http://variety.com/2018/film/reviews/the-image-book-review-jean-luc-godard-1202807089/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://thefilmstage.com/reviews/cannes-review-jean-luc-godards-the-image-book-displays-an-infuriating-stimulating-love-of-cinema/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.filmcomment.com/blog/film-comment-podcast-cannes-day-four/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED http://desistfilm.com/cannes-2018-le-livre-dimage-by-jean-luc-godard/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/embed/FAk3c9OM8ZQ?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED http://variety.com/2018/film/global/jean-luc-godard-to-adapt-the-image-book-into-traveling-exhibit-star-in-a-vendredi-robinson-exclusive-1202805535/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED http://cineuropa.org/nw.aspx?t=newsdetail&l=en&did=354276 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED http://www.indiewire.com/2018/05/the-image-book-review-jean-luc-godard-cannes-1201963343/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/embed/TWFmQbrAYqE?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.timeout.com/london/film/the-image-book - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.slantmagazine.com/film/review/the-image-book - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://filmmakermagazine.com/105332-cannes-2018-dispatch-3-the-image-book/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.theguardian.com/film/2018/may/11/the-image-book-review-jean-luc-godard-cannes-2018 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://lundi.am/Vent-d-ouest-JL-Godard - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.hollywoodreporter.com/review/image-book-review-1111185 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.festival-cannes.com/en/festival/films/le-livre-d-image - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.pastemagazine.com/articles/2018/05/le-livre-dimage-the-image-book.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/2024-eclipse-series-29-aki-kaurism-ki-s-leningrad-cowboys
|
||
I 2022/06/09 11:04:21 REJECTED https://www.thewrap.com/the-image-book-film-review-once-again-jean-luc-godard-messes-with-viewers-heads/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.rogerebert.com/cannes/cannes-2018-the-image-book-cold-war - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 REJECTED https://mubi.com/notebook/posts/the-chamber-piece-an-interview-with-fabrice-aragno - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:21 Fulltext indexing: tyx3TG_26NP5 https://www.criterion.com/current/posts/2024-eclipse-series-29-aki-kaurism-ki-s-leningrad-cowboys
|
||
I 2022/06/09 11:04:21 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[tyx3TG_26NP5 (1735154898797854720)]} 0 12
|
||
I 2022/06/09 11:04:21 SWITCHBOARD *Indexed 957 words in URL https://www.criterion.com/current/posts/2024-eclipse-series-29-aki-kaurism-ki-s-leningrad-cowboys [tyx3TG_26NP5]
|
||
Description: Eclipse Series 29: Aki Kaurismäki’s Leningrad Cowboys | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 13086 bytes |
|
||
LinkStorageTime: 17 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:22 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=bertolucci-bernardo
|
||
I 2022/06/09 11:04:22 Fulltext indexing: uHi4Lm_26NP5 https://www.criterion.com/shop/browse/list?director=bertolucci-bernardo
|
||
I 2022/06/09 11:04:22 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[uHi4Lm_26NP5 (1735154898871255040)]} 0 4
|
||
I 2022/06/09 11:04:22 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse/list?director=bertolucci-bernardo [uHi4Lm_26NP5]
|
||
Description: Bernardo Bertolucci films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12537 bytes |
|
||
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:22 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 448, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
|
||
I 2022/06/09 11:04:22 HTCACHE storing content of url https://www.criterion.com/current/posts/5665-alice-rohrwacher-s-happy-as-lazzaro, 73320 bytes
|
||
I 2022/06/09 11:04:22 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/5657-jean-luc-godard-s-the-image-book
|
||
I 2022/06/09 11:04:22 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[tyYgsG_26NP5 (1735154898942558208)]} 0 2
|
||
I 2022/06/09 11:04:22 Fulltext indexing: tyYgsG_26NP5 https://www.criterion.com/current/posts/5657-jean-luc-godard-s-the-image-book
|
||
I 2022/06/09 11:04:22 SWITCHBOARD *Indexed 517 words in URL https://www.criterion.com/current/posts/5657-jean-luc-godard-s-the-image-book [tyYgsG_26NP5]
|
||
Description: Jean-Luc Godard’s The Image Book | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 6336 bytes |
|
||
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:22 SWITCHBOARD CRAWL: ADDED 78 LINKS FROM https://www.criterion.com/current/posts/5665-alice-rohrwacher-s-happy-as-lazzaro, STACKING TIME = 5, PARSING TIME = 14
|
||
I 2022/06/09 11:04:22 REJECTED https://thefilmstage.com/reviews/cannes-review-alice-rohrwachers-talents-fully-bloom-with-the-masterful-lazzaro-felice/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED http://lwlies.com/festivals/happy-lazzaro-cannes-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.youtube.com/embed/30KW3i3bxEo?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.rogerebert.com/cannes/cannes-2018-3-faces-happy-as-lazzaro - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://theplaylist.net/lazzaro-felice-alice-rohrwacher-eview-20180516/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED http://variety.com/2018/film/reviews/happy-as-lazzaro-review-1202808832/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED http://desistfilm.com/cannes-2018-happy-as-lazzaro-by-alice-rohrwarcher/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.villagevoice.com/2018/05/17/lazarus-come-forth-on-alice-rohrwachers-cannes-stunner-lazzaro-felice/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.hollywoodreporter.com/review/happy-as-lazzaro-lazzaro-felice-review-1111486 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.thewrap.com/happy-as-lazzaro-film-review-alice-rohrwacher-charts-the-course-of-a-holy-fool/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED http://cineuropa.org/nw.aspx?t=newsdetail&l=en&did=354354 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.theguardian.com/film/2018/may/14/happy-as-lazzaro-review-cannes-alice-rohrwacher-wonders-tobacco-sharecroppers - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://icsfilm.org/reviews/cannes-2018-review-lazzaro-felice-alice-rohrwacher/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED http://www.indiewire.com/2018/05/happy-as-lazzaro-review-alice-rohrwacher-cannes-2018-1201964121/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED http://www.anothergaze.com/cannes-review-alice-rohrwachers-happy-lazzaro-lazzaro-felice-feminist/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://filmmakermagazine.com/105338-cannes-2018-dispatch-4-shoplifters-girl-happy-as-lazzaro/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.screendaily.com/reviews/happy-as-lazzaro-cannes-review/5129305.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.timeout.com/london/film/happy-as-lazzaro - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://criterion-production.s3.amazonaws.com/5jKpSEOvYO6bPwgjJzPQpHH09IjISm.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.jigsawlounge.co.uk/film/reviews/cannes2018/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.festival-cannes.com/en/festival/films/lazzaro-felice - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-7-two-gentle-competitors-and-war-s-dirty-work - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://film.avclub.com/spike-lee-teams-up-with-jordan-peele-for-the-funny-poi-1826042384 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://cannes-ratings.herokuapp.com/Cannes2018 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED http://www.indiewire.com/2018/05/netflix-cannes-happy-as-lazarro-girl-1201966537/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/5665-alice-rohrwacher-s-happy-as-lazzaro
|
||
I 2022/06/09 11:04:22 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[txjKrG_26NP5 (1735154898985549824)]} 0 1
|
||
I 2022/06/09 11:04:22 Fulltext indexing: txjKrG_26NP5 https://www.criterion.com/current/posts/5665-alice-rohrwacher-s-happy-as-lazzaro
|
||
I 2022/06/09 11:04:22 SWITCHBOARD *Indexed 503 words in URL https://www.criterion.com/current/posts/5665-alice-rohrwacher-s-happy-as-lazzaro [txjKrG_26NP5]
|
||
Description: Alice Rohrwacher’s Happy as Lazzaro | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 5793 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:22 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 445, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:22 HTCACHE storing content of url https://www.criterion.com/films/28727-lone-wolf-and-cub-white-heaven-in-hell, 76851 bytes
|
||
I 2022/06/09 11:04:22 SWITCHBOARD CRAWL: ADDED 67 LINKS FROM https://www.criterion.com/films/28727-lone-wolf-and-cub-white-heaven-in-hell, STACKING TIME = 1, PARSING TIME = 11
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/lone-wolf-and-cub-white-heaven-in-hell?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/films/8ce45228e2f3a463d6c77bb4be8fa192/EaTlysvdcJsvJJ3tP1pUrBI2UN2cEl_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/17177cec9104c935b7ee57a6bf36c178.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/images/7742-355bb7d45a9482fc6c2ac3db60ad9495/lone_wolf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.amazon.com/dp/B01M5LJOEO - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://itunes.apple.com/us/movie/id1170771456?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/9daffdb90cdc6bee986834ed46660b66.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/a70ff6a4b7f88021a257d0be6981d057.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/images/7830-f315cd91efe00a972e37be56081e1dbf/LWC_designs_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/films/58e97ae4dd0b361bd131525faf284b78/K10x3FugjRS01iYKEXsLTr0OlDQk4P_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/images/7773-ee7702bf1312d6cd3550dbfc426265bc/Current_LW_C1-grab65_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 SWITCHBOARD Excluded 17 words in URL https://www.criterion.com/films/28727-lone-wolf-and-cub-white-heaven-in-hell
|
||
I 2022/06/09 11:04:22 Fulltext indexing: tordge_26NP5 https://www.criterion.com/films/28727-lone-wolf-and-cub-white-heaven-in-hell
|
||
I 2022/06/09 11:04:22 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[tordge_26NP5 (1735154899239305216)]} 0 6
|
||
I 2022/06/09 11:04:22 SWITCHBOARD *Indexed 344 words in URL https://www.criterion.com/films/28727-lone-wolf-and-cub-white-heaven-in-hell [tordge_26NP5]
|
||
Description: Lone Wolf and Cub: White Heaven in Hell (1974) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3800 bytes |
|
||
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:22 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 442, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
|
||
I 2022/06/09 11:04:22 HTCACHE storing content of url https://www.criterion.com/current/posts/905-the-last-picture-show, 82679 bytes
|
||
I 2022/06/09 11:04:22 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/905-the-last-picture-show, STACKING TIME = 1, PARSING TIME = 51
|
||
I 2022/06/09 11:04:22 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 437, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:22 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/905-the-last-picture-show
|
||
I 2022/06/09 11:04:22 Fulltext indexing: tbPo1G_26NP5 https://www.criterion.com/current/posts/905-the-last-picture-show
|
||
I 2022/06/09 11:04:22 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[tbPo1G_26NP5 (1735154899703824384)]} 0 2
|
||
I 2022/06/09 11:04:22 SWITCHBOARD *Indexed 498 words in URL https://www.criterion.com/current/posts/905-the-last-picture-show [tbPo1G_26NP5]
|
||
Description: The Last Picture Show | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 6570 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:22 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=forsyth-bill, 224164 bytes
|
||
I 2022/06/09 11:04:22 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=forsyth-bill, STACKING TIME = 1, PARSING TIME = 23
|
||
I 2022/06/09 11:04:22 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://s3.amazonaws.com/criterion-production/films/404d9791922b336d3015f1500ee014eb/tG4CJ9c3PbmNziiX4KClVc9Pj6MsDU_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:22 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=forsyth-bill
|
||
I 2022/06/09 11:04:23 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[thvE3m_26NP5 (1735154899898859520)]} 0 2
|
||
I 2022/06/09 11:04:23 Fulltext indexing: thvE3m_26NP5 https://www.criterion.com/shop/browse/list?director=forsyth-bill
|
||
I 2022/06/09 11:04:23 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=forsyth-bill [thvE3m_26NP5]
|
||
Description: Bill Forsyth films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12475 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:23 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 440, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:23 HTCACHE storing content of url https://www.criterion.com/current/posts/764-fanfan-la-tulipe-en-garde, 66944 bytes
|
||
I 2022/06/09 11:04:23 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 SWITCHBOARD CRAWL: ADDED 41 LINKS FROM https://www.criterion.com/current/posts/764-fanfan-la-tulipe-en-garde, STACKING TIME = 2, PARSING TIME = 6
|
||
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/images/4369-a829ab08178ba2ab415dda3775dfdbf2/img_current_1188_094_medium.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/764-fanfan-la-tulipe-en-garde
|
||
I 2022/06/09 11:04:23 Fulltext indexing: s-lmzG_26NP5 https://www.criterion.com/current/posts/764-fanfan-la-tulipe-en-garde
|
||
I 2022/06/09 11:04:23 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=ozerov-yuri, 224265 bytes
|
||
I 2022/06/09 11:04:23 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[s-lmzG_26NP5 (1735154900279492608)]} 0 7
|
||
I 2022/06/09 11:04:23 SWITCHBOARD *Indexed 771 words in URL https://www.criterion.com/current/posts/764-fanfan-la-tulipe-en-garde [s-lmzG_26NP5]
|
||
Description: Fanfan la Tulipe: En Garde! | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 10350 bytes |
|
||
LinkStorageTime: 12 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:23 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 438, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
|
||
I 2022/06/09 11:04:23 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=ozerov-yuri, STACKING TIME = 1, PARSING TIME = 31
|
||
I 2022/06/09 11:04:23 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/films/89fd00884467657bae436047de8cd9b2/7RuTtvX525S0ENzFaMXEcuoqKM280Y_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=ozerov-yuri
|
||
I 2022/06/09 11:04:23 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[tQd4Jm_26NP5 (1735154900428390400)]} 0 2
|
||
I 2022/06/09 11:04:23 Fulltext indexing: tQd4Jm_26NP5 https://www.criterion.com/shop/browse/list?director=ozerov-yuri
|
||
I 2022/06/09 11:04:23 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=ozerov-yuri [tQd4Jm_26NP5]
|
||
Description: Juri Ozerov films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12571 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:23 HTCACHE storing content of url https://www.criterion.com/current/author/703-jyoti-mistry, 49912 bytes
|
||
I 2022/06/09 11:04:23 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/703-jyoti-mistry, STACKING TIME = 1, PARSING TIME = 4
|
||
I 2022/06/09 11:04:23 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/current/author/703-jyoti-mistry
|
||
I 2022/06/09 11:04:23 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[s2Ej1G_26NP5 (1735154900456701952)]} 0 1
|
||
I 2022/06/09 11:04:23 Fulltext indexing: s2Ej1G_26NP5 https://www.criterion.com/current/author/703-jyoti-mistry
|
||
I 2022/06/09 11:04:23 SWITCHBOARD *Indexed 132 words in URL https://www.criterion.com/current/author/703-jyoti-mistry [s2Ej1G_26NP5]
|
||
Description: Jyoti Mistry | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 1606 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:23 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 433, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:23 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 433, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
|
||
I 2022/06/09 11:04:23 HTCACHE storing content of url https://www.criterion.com/current/posts/7513-beasts-of-no-nation-a-different-kind-of-african-war-film, 87179 bytes
|
||
I 2022/06/09 11:04:23 SWITCHBOARD CRAWL: ADDED 63 LINKS FROM https://www.criterion.com/current/posts/7513-beasts-of-no-nation-a-different-kind-of-african-war-film, STACKING TIME = 2, PARSING TIME = 8
|
||
I 2022/06/09 11:04:23 REJECTED https://criterion-production.s3.amazonaws.com/z03m9YUmrPCKLCSTxvg3l8J3XIUtDL.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7513-/AlQgKHMrEYtBYqY3vbFGVgCDASqo9R_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://criterion-production.s3.amazonaws.com/Trpy7wGy1YAlukthn4k7bSwTCWGYZN.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7003-/LPfLZBD0q2OFWw19DtZJgFLtn232KC_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://criterion-production.s3.amazonaws.com/JHSMIJKcny3omzCBbyMLDY41iArADH.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/films/d625b0e7f179fa73f1d10d4ff66873a6/KwcufB3P2S9l5e6g7eQitniSmR32hr_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://criterion-production.s3.amazonaws.com/uomtCBcQtaGxubA9zAv30rjKvmUbLO.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:23 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 HTCACHE storing content of url https://www.criterion.com/current/posts/5656-christophe-honor-s-sorry-angel, 72235 bytes
|
||
I 2022/06/09 11:04:24 SWITCHBOARD CRAWL: ADDED 76 LINKS FROM https://www.criterion.com/current/posts/5656-christophe-honor-s-sorry-angel, STACKING TIME = 5, PARSING TIME = 10
|
||
I 2022/06/09 11:04:24 REJECTED https://film.avclub.com/mads-mikkelsen-endures-a-cold-crucible-but-its-cold-wa-1825957205 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.villagevoice.com/2018/05/14/music-madness-and-memory-at-cannes-part-two-cold-war-sorry-angel-and-the-mysteries-of-love/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-12-a-generational-romance-and-closing-fragments - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED http://lwlies.com/festivals/sorry-angel-cannes-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.slantmagazine.com/house/article/cannes-film-review-yomeddine-leto-sorry-angel - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.festival-cannes.com/en/festival/films/plaire-aimer-et-courir-vite - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://criterion-production.s3.amazonaws.com/iBOHAxDZL0mZ3DmRrcjkupNQRbGaSu.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.rogerebert.com/cannes/cannes-2018-leto-sorry-angel-border - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.telegraph.co.uk/films/2018/05/12/sorry-angel-review-lovely-bittersweet-gay-romance-drippingly/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://thefilmstage.com/reviews/cannes-review-christophe-honores-sorry-angel-is-a-rote-but-practiced-age-gap-romance/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.youtube.com/embed/aEclOo9XHHY?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.screendaily.com/reviews/sorry-angel-cannes-review/5129026.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.thewrap.com/sorry-angel-film-review-aids-drama-explores-quiet-places/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED http://variety.com/2018/film/global/christophe-honore-on-sorry-angel-in-france-were-blessed-as-filmmakers-1202807288/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED http://cineuropa.org/nw.aspx?t=newsdetail&l=en&did=354138 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.theguardian.com/film/2018/may/10/sorry-angel-apology-not-accepted-for-tedious-age-gap-gay-romance - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://cine-vue.com/2018/05/cannes-2018-sorry-angel-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.hollywoodreporter.com/review/sorry-angel-review-1109656 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED http://www.indiewire.com/2018/05/sorry-angel-review-christophe-honore-cannes-2018-1201962227/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://icsfilm.org/reviews/cannes-2018-review-sorry-angel-christophe-honore/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED http://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/sorry-angel-christophe-honore-vincent-lacoste-gay-life-90s - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.vanityfair.com/hollywood/2018/05/sorry-angel-christophe-honore-cannes-review - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED http://variety.com/2018/film/reviews/sorry-angel-review-plaire-aimer-et-courir-vite-1202805122/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.timeout.com/london/film/sorry-angel - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/7513-beasts-of-no-nation-a-different-kind-of-african-war-film
|
||
I 2022/06/09 11:04:24 Fulltext indexing: smkItG_26NP5 https://www.criterion.com/current/posts/7513-beasts-of-no-nation-a-different-kind-of-african-war-film
|
||
I 2022/06/09 11:04:24 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 427, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
|
||
I 2022/06/09 11:04:24 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[smkItG_26NP5 (1735154901082701824)]} 0 13
|
||
I 2022/06/09 11:04:24 SWITCHBOARD *Indexed 1280 words in URL https://www.criterion.com/current/posts/7513-beasts-of-no-nation-a-different-kind-of-african-war-film [smkItG_26NP5]
|
||
Description: Beasts of No Nation: A Different Kind of African War Film | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 17028 bytes |
|
||
LinkStorageTime: 15 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:24 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/5656-christophe-honor-s-sorry-angel
|
||
I 2022/06/09 11:04:24 Fulltext indexing: sUkDeG_26NP5 https://www.criterion.com/current/posts/5656-christophe-honor-s-sorry-angel
|
||
I 2022/06/09 11:04:24 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[sUkDeG_26NP5 (1735154901115207680)]} 0 1
|
||
I 2022/06/09 11:04:24 SWITCHBOARD *Indexed 430 words in URL https://www.criterion.com/current/posts/5656-christophe-honor-s-sorry-angel [sUkDeG_26NP5]
|
||
Description: Christophe Honoré’s Sorry Angel | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 4766 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:24 HTCACHE storing content of url https://www.criterion.com/current/author/97-james-harvey, 49717 bytes
|
||
I 2022/06/09 11:04:24 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/97-james-harvey, STACKING TIME = 1, PARSING TIME = 3
|
||
I 2022/06/09 11:04:24 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 SWITCHBOARD Excluded 13 words in URL https://www.criterion.com/current/author/97-james-harvey
|
||
I 2022/06/09 11:04:24 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[sO73bG_26NP5 (1735154901325971456)]} 0 1
|
||
I 2022/06/09 11:04:24 Fulltext indexing: sO73bG_26NP5 https://www.criterion.com/current/author/97-james-harvey
|
||
I 2022/06/09 11:04:24 SWITCHBOARD *Indexed 142 words in URL https://www.criterion.com/current/author/97-james-harvey [sO73bG_26NP5]
|
||
Description: James Harvey | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 2225 bytes |
|
||
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:24 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 424, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:24 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 424, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:24 HTCACHE storing content of url https://www.criterion.com/current/posts/4466-critics-hail-a-newly-restored-taiwanese-masterwork, 72835 bytes
|
||
I 2022/06/09 11:04:24 SWITCHBOARD CRAWL: ADDED 55 LINKS FROM https://www.criterion.com/current/posts/4466-critics-hail-a-newly-restored-taiwanese-masterwork, STACKING TIME = 1, PARSING TIME = 6
|
||
I 2022/06/09 11:04:24 REJECTED http://www.bam.org/taipeistory - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6502-/VcDUKU8LPywb5MgflsEJCCJOgfrUNv_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6827-/sYS7xZWV366q9OANUqtCwimvTnL90D_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/films/b8aa5bf1a2e514e781a4974edeb7f07a/khYYYxHBAKOhkTFow0fq2YBsI73A0F_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.nytimes.com/2017/03/16/movies/taipei-story-review.html?_r=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/images/8116-d54635daa17b8f8473f68a57c2177797/taipei_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED http://www.villagevoice.com/film/past-and-future-tug-at-an-unstable-present-in-a-restored-masterwork-by-edward-yang-9769270 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6701-/ETx6s7z5azkjRZIl1n1268y6oIFngD_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6516-/1ReaXkdoavjctz1ZBIEqTXIfC0QqZK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/current/posts/4466-critics-hail-a-newly-restored-taiwanese-masterwork
|
||
I 2022/06/09 11:04:24 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[r64W1G_26NP5 (1735154901727576064)]} 0 1
|
||
I 2022/06/09 11:04:24 Fulltext indexing: r64W1G_26NP5 https://www.criterion.com/current/posts/4466-critics-hail-a-newly-restored-taiwanese-masterwork
|
||
I 2022/06/09 11:04:24 SWITCHBOARD *Indexed 320 words in URL https://www.criterion.com/current/posts/4466-critics-hail-a-newly-restored-taiwanese-masterwork [r64W1G_26NP5]
|
||
Description: Critics Hail a Newly Restored Taiwanese Masterwork | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3754 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:24 HTCACHE storing content of url https://www.criterion.com/current/posts/3270-y-tu-mama-tambien-dirty-happy-things, 79796 bytes
|
||
I 2022/06/09 11:04:24 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/3270-y-tu-mama-tambien-dirty-happy-things, STACKING TIME = 2, PARSING TIME = 7
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/images/5281-09e07817d94e0955561f0ba6b656cab3/28005id_146_Current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/films/5f30d2a6f02704c28b2b31a9331e1f7c/9th6Iqsdh4VJpysPfdoOhLkU6YRLB9_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:24 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 418, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:24 SWITCHBOARD Excluded 30 words in URL https://www.criterion.com/current/posts/3270-y-tu-mama-tambien-dirty-happy-things
|
||
I 2022/06/09 11:04:24 Fulltext indexing: rwbkAG_26NP5 https://www.criterion.com/current/posts/3270-y-tu-mama-tambien-dirty-happy-things
|
||
I 2022/06/09 11:04:24 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[rwbkAG_26NP5 (1735154901898493952)]} 0 3
|
||
I 2022/06/09 11:04:24 SWITCHBOARD *Indexed 1020 words in URL https://www.criterion.com/current/posts/3270-y-tu-mama-tambien-dirty-happy-things [rwbkAG_26NP5]
|
||
Description: Y tu mamá también: Dirty Happy Things | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 14336 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:25 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 418, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:25 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 418, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:25 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=coppola, 224180 bytes
|
||
I 2022/06/09 11:04:25 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
|
||
I 2022/06/09 11:04:25 org.apache.solr.update.SolrIndexWriter Calling setCommitData with IW:org.apache.solr.update.SolrIndexWriter@fed4ccdd commitCommandVersion:0
|
||
I 2022/06/09 11:04:25 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=coppola, STACKING TIME = 5, PARSING TIME = 39
|
||
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/films/28b1b85a6f4119f34d6165c5256037f6/CGCMTOMzrmzXe1qXr6Mnpt4jXgsgi0_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 422, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
|
||
I 2022/06/09 11:04:25 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=coppola
|
||
I 2022/06/09 11:04:25 Fulltext indexing: rgZMmm_26NP5 https://www.criterion.com/shop/browse/list?director=coppola
|
||
I 2022/06/09 11:04:25 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[rgZMmm_26NP5 (1735154902752034816)]} 0 6
|
||
I 2022/06/09 11:04:25 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=coppola [rgZMmm_26NP5]
|
||
Description: Francis Ford Coppola films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12489 bytes |
|
||
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:25 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=audiard-jacques, 224159 bytes
|
||
I 2022/06/09 11:04:25 HTCACHE storing content of url https://www.criterion.com/current/posts/2654-the-dardennes-and-the-kid-with-a-bike, 70793 bytes
|
||
I 2022/06/09 11:04:25 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:04:25 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=audiard-jacques, STACKING TIME = 0, PARSING TIME = 36
|
||
I 2022/06/09 11:04:25 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/films/8d1c34bd6af4fa432ec28aebc0ad55d6/FMt4h5XnG7oyewPgkLZPxdrXDUPq0v_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 426, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/images/5073-6d8164bf2a1164be530153e3404dc049/KWAB_Current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/films/ed560ce125d981a74da5f0b112c643c4/sG8YTAGNjz7HsWG1RS1uUi54VYccwx_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/2654-the-dardennes-and-the-kid-with-a-bike, STACKING TIME = 5, PARSING TIME = 21
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.youtube.com/embed/jhVq-RmbA34?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:25 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=audiard-jacques
|
||
I 2022/06/09 11:04:25 Fulltext indexing: rNGwYm_26NP5 https://www.criterion.com/shop/browse/list?director=audiard-jacques
|
||
I 2022/06/09 11:04:25 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[rNGwYm_26NP5 (1735154903011033088)]} 0 8
|
||
I 2022/06/09 11:04:26 SWITCHBOARD *Indexed 1187 words in URL https://www.criterion.com/shop/browse/list?director=audiard-jacques [rNGwYm_26NP5]
|
||
Description: Jacques Audiard films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12457 bytes |
|
||
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:26 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/2654-the-dardennes-and-the-kid-with-a-bike
|
||
I 2022/06/09 11:04:26 Fulltext indexing: qW-f6G_26NP5 https://www.criterion.com/current/posts/2654-the-dardennes-and-the-kid-with-a-bike
|
||
I 2022/06/09 11:04:26 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[qW-f6G_26NP5 (1735154903082336256)]} 0 1
|
||
I 2022/06/09 11:04:26 SWITCHBOARD *Indexed 303 words in URL https://www.criterion.com/current/posts/2654-the-dardennes-and-the-kid-with-a-bike [qW-f6G_26NP5]
|
||
Description: The Dardennes and The Kid with a Bike | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3468 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:26 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 426, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:26 HTCACHE storing content of url https://www.criterion.com/current/posts/5143-victor-sj-str-m-s-stirring-flashbacks, 67721 bytes
|
||
I 2022/06/09 11:04:26 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/5143-victor-sj-str-m-s-stirring-flashbacks, STACKING TIME = 1, PARSING TIME = 9
|
||
I 2022/06/09 11:04:26 REJECTED https://www.filmstruck.com/us/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7777-/bTILAb13gYYFwneHHCb5dHHD1VjRf7_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7707-/hZxxnAQeKCi5vjiDKzvdNuftSnkdOM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/films/7265b13395ec259ff98672237c54b4c6/hl8OoSNAgm9ND4Fh7ksjUzNXplspyF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7807-/DXN9QiWNnsXTuLss0TTJ4r9JspRu5B_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://player.vimeo.com/video/243748992 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7738-/e35rwdsj2UHHYOdCCz8E5Qr8oxxKXj_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/current/posts/5143-victor-sj-str-m-s-stirring-flashbacks
|
||
I 2022/06/09 11:04:26 Fulltext indexing: pZIkoG_26NP5 https://www.criterion.com/current/posts/5143-victor-sj-str-m-s-stirring-flashbacks
|
||
I 2022/06/09 11:04:26 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[pZIkoG_26NP5 (1735154903244865536)]} 0 1
|
||
I 2022/06/09 11:04:26 SWITCHBOARD *Indexed 261 words in URL https://www.criterion.com/current/posts/5143-victor-sj-str-m-s-stirring-flashbacks [pZIkoG_26NP5]
|
||
Description: Victor Sjöström’s Stirring Flashbacks | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3314 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:26 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=ferreri-marco, 224232 bytes
|
||
I 2022/06/09 11:04:26 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=ferreri-marco, STACKING TIME = 1, PARSING TIME = 19
|
||
I 2022/06/09 11:04:26 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/films/898808a9204021a47c57b59173c72e75/ttGzJdB52PJ2tfhbXB89WxPhDBytPI_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=ferreri-marco
|
||
I 2022/06/09 11:04:26 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 426, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:26 Fulltext indexing: paAX9m_26NP5 https://www.criterion.com/shop/browse/list?director=ferreri-marco
|
||
I 2022/06/09 11:04:26 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[paAX9m_26NP5 (1735154903452483584)]} 0 3
|
||
I 2022/06/09 11:04:26 SWITCHBOARD *Indexed 1193 words in URL https://www.criterion.com/shop/browse/list?director=ferreri-marco [paAX9m_26NP5]
|
||
Description: Marco Ferreri films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12497 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:26 HTCACHE storing content of url https://www.criterion.com/current/posts/3803-sj-str-m-haunts-in-minnesota, 68006 bytes
|
||
I 2022/06/09 11:04:26 REJECTED https://www.youtube.com/embed/HoqCsMUoN1c?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6502-/VcDUKU8LPywb5MgflsEJCCJOgfrUNv_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 SWITCHBOARD CRAWL: ADDED 55 LINKS FROM https://www.criterion.com/current/posts/3803-sj-str-m-haunts-in-minnesota, STACKING TIME = 6, PARSING TIME = 12
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6827-/sYS7xZWV366q9OANUqtCwimvTnL90D_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED http://www.heightstheater.com/film/the-phantom-carriage/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/films/7265b13395ec259ff98672237c54b4c6/hl8OoSNAgm9ND4Fh7ksjUzNXplspyF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/images/6322-6a8d8311976bfaf0f3648d49a8b6dfb4/phantom_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6701-/ETx6s7z5azkjRZIl1n1268y6oIFngD_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6516-/1ReaXkdoavjctz1ZBIEqTXIfC0QqZK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 SWITCHBOARD Excluded 20 words in URL https://www.criterion.com/current/posts/3803-sj-str-m-haunts-in-minnesota
|
||
I 2022/06/09 11:04:26 Fulltext indexing: pNR6yG_26NP5 https://www.criterion.com/current/posts/3803-sj-str-m-haunts-in-minnesota
|
||
I 2022/06/09 11:04:26 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[pNR6yG_26NP5 (1735154903546855424)]} 0 1
|
||
I 2022/06/09 11:04:26 SWITCHBOARD *Indexed 288 words in URL https://www.criterion.com/current/posts/3803-sj-str-m-haunts-in-minnesota [pNR6yG_26NP5]
|
||
Description: Sjöström Haunts in Minnesota | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3323 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:26 HostQueue forcing crawl-delay of 199 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 425, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 51)) = 199
|
||
I 2022/06/09 11:04:26 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=ismail-usmar, 224768 bytes
|
||
I 2022/06/09 11:04:26 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=ismail-usmar, STACKING TIME = 1, PARSING TIME = 19
|
||
I 2022/06/09 11:04:26 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/films/40e3648637e296fc80b9a4d526f35951/uWUFo8TqJ9uMAr8esMMCqoDE4sgpMx_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1961-3b7b159a9ae3d500459e38e69c96a917/9P9MeXzolFQyY5OhK2XcwZ02e0Y0ZC_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:26 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 425, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:26 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=ismail-usmar
|
||
I 2022/06/09 11:04:26 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ovy1Lm_26NP5 (1735154904025006080)]} 0 2
|
||
I 2022/06/09 11:04:26 Fulltext indexing: ovy1Lm_26NP5 https://www.criterion.com/shop/browse/list?director=ismail-usmar
|
||
I 2022/06/09 11:04:26 SWITCHBOARD *Indexed 1195 words in URL https://www.criterion.com/shop/browse/list?director=ismail-usmar [ovy1Lm_26NP5]
|
||
Description: Usmar Ismail films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12513 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:26 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
|
||
I 2022/06/09 11:04:27 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:04:27 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@1ee4a635[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w8(7.7.3):C18:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772634528}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wa(7.7.3):C23:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772640046}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wb(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772645671}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wc(7.7.3):C17:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772650026}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wd(7.7.3):C4:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772650982}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_we(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772656193}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wf(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772661758}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wg(7.7.3):C15:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772665682}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wh(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772665845}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wi(7.7.3):C6:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772666996}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
|
||
I 2022/06/09 11:04:27 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
|
||
I 2022/06/09 11:04:27 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 425, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
|
||
I 2022/06/09 11:04:27 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=cazals-felipe, 224237 bytes
|
||
I 2022/06/09 11:04:27 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=cazals-felipe, STACKING TIME = 1, PARSING TIME = 37
|
||
I 2022/06/09 11:04:27 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/0edac3ad6fa837bdd9abe77bea6012f3/h4SRYVXnUjLhqLaGqxuhcGeEhix8Yq_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 426, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:27 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=cazals-felipe
|
||
I 2022/06/09 11:04:27 Fulltext indexing: n2_llm_26NP5 https://www.criterion.com/shop/browse/list?director=cazals-felipe
|
||
I 2022/06/09 11:04:27 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[n2_llm_26NP5 (1735154904577605632)]} 0 5
|
||
I 2022/06/09 11:04:27 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=cazals-felipe [n2_llm_26NP5]
|
||
Description: Felipe Cazals films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12500 bytes |
|
||
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:27 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=jarman-derek, 224149 bytes
|
||
I 2022/06/09 11:04:27 SWITCHBOARD CRAWL: ADDED 42 LINKS FROM https://www.criterion.com/shop/browse/list?director=jarman-derek, STACKING TIME = 2, PARSING TIME = 40
|
||
I 2022/06/09 11:04:27 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/f20e8f5bdc458777cda90775598ef89c/aiexRhqINL31ogh7SulZjNkvJQHnRD_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=jarman-derek
|
||
I 2022/06/09 11:04:27 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 429, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
|
||
I 2022/06/09 11:04:27 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[nrKyrm_26NP5 (1735154904777883648)]} 0 2
|
||
I 2022/06/09 11:04:27 Fulltext indexing: nrKyrm_26NP5 https://www.criterion.com/shop/browse/list?director=jarman-derek
|
||
I 2022/06/09 11:04:27 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse/list?director=jarman-derek [nrKyrm_26NP5]
|
||
Description: Derek Jarman films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12473 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:27 HTCACHE storing content of url https://www.criterion.com/films/28726-lone-wolf-and-cub-baby-cart-in-the-land-of-demons, 76815 bytes
|
||
I 2022/06/09 11:04:27 HTCACHE storing content of url https://www.criterion.com/films/27977-douce, 71365 bytes
|
||
I 2022/06/09 11:04:27 SWITCHBOARD CRAWL: ADDED 67 LINKS FROM https://www.criterion.com/films/28726-lone-wolf-and-cub-baby-cart-in-the-land-of-demons, STACKING TIME = 1, PARSING TIME = 18
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/3c9f09ce0317fdbf2199438da624ef26/wQrhudyvLgRCg2bvWKGyePtmhI6yh6_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/images/7742-355bb7d45a9482fc6c2ac3db60ad9495/lone_wolf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://itunes.apple.com/us/movie/id1170340182?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.amazon.com/dp/B01MQFA5CN - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/7c02624be1ea9c11cf46bd17cf27e9aa.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/87903afea7478207e48d1481cc4abe56.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/images/7830-f315cd91efe00a972e37be56081e1dbf/LWC_designs_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/58e97ae4dd0b361bd131525faf284b78/K10x3FugjRS01iYKEXsLTr0OlDQk4P_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/lone-wolf-and-cub-baby-cart-in-the-land-of-demons?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/e0d98d9b85c97e837770345f98c9f025.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/images/7773-ee7702bf1312d6cd3550dbfc426265bc/Current_LW_C1-grab65_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/films/27977-douce, STACKING TIME = 1, PARSING TIME = 107
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/douce?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1816-e188e2102b63387dbe23fc67edb6beea/DWwbQG5RL4lfG3HYvO9uUZ78amems4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/images/9541-7926a5f63159cd1b4241ec268ed9d6c9/douce_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/106b07b3cc434c73b4ae16e8ed87669d.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/f7d00ab6307c1ec81109bf1425893316.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/bbf16d8bcccc2dc396438789b2f31af7.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/460b59ca750bcf35c4b72400db769627.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/films/54383fec44d7e8782575c44af765b4b3/zvkyPLbQ24L9VjH6jP1r4Q0jCYbVMY_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/images/9518-74f3b68e17d1c132c4fb7dd0555570cc/Current_29404id_015_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/films/28726-lone-wolf-and-cub-baby-cart-in-the-land-of-demons
|
||
I 2022/06/09 11:04:27 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[nZ4dNe_26NP5 (1735154904990744576)]} 0 1
|
||
I 2022/06/09 11:04:27 Fulltext indexing: nZ4dNe_26NP5 https://www.criterion.com/films/28726-lone-wolf-and-cub-baby-cart-in-the-land-of-demons
|
||
I 2022/06/09 11:04:27 SWITCHBOARD *Indexed 342 words in URL https://www.criterion.com/films/28726-lone-wolf-and-cub-baby-cart-in-the-land-of-demons [nZ4dNe_26NP5]
|
||
Description: Lone Wolf and Cub: Baby Cart in the Land of Demons (1973) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3804 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:27 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/films/27977-douce
|
||
I 2022/06/09 11:04:27 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[nSOfce_26NP5 (1735154905009618944)]} 0 1
|
||
I 2022/06/09 11:04:27 Fulltext indexing: nSOfce_26NP5 https://www.criterion.com/films/27977-douce
|
||
I 2022/06/09 11:04:27 SWITCHBOARD *Indexed 269 words in URL https://www.criterion.com/films/27977-douce [nSOfce_26NP5]
|
||
Description: Douce (1943) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3106 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:27 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 429, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
|
||
I 2022/06/09 11:04:27 HTCACHE storing content of url https://www.criterion.com/current/posts/1170-bergman-and-i, 85838 bytes
|
||
I 2022/06/09 11:04:27 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/1170-bergman-and-i, STACKING TIME = 1, PARSING TIME = 6
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7820-/YKRO4oMLJouJhzJ0UVxMAsXpd2O4sF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7799-/BcSOGzRAmQVrNktlZ6a8juCv8YVP7g_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7821-/DIxDSXUzZ7yk0yAo3JyIs4aQdkSJqq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/images/4446-87cf72e6b02a2e66949e31ff7289fc20/bergmanisland_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:27 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/tout_image/7812-/cfFE90t4MLOwAR88lqDIdl2lvUw6SX_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/1170-bergman-and-i
|
||
I 2022/06/09 11:04:28 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[nPg55G_26NP5 (1735154905126010880)]} 0 2
|
||
I 2022/06/09 11:04:28 Fulltext indexing: nPg55G_26NP5 https://www.criterion.com/current/posts/1170-bergman-and-i
|
||
I 2022/06/09 11:04:28 SWITCHBOARD *Indexed 477 words in URL https://www.criterion.com/current/posts/1170-bergman-and-i [nPg55G_26NP5]
|
||
Description: Bergman and I | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 6238 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:28 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 428, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:28 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 428, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:28 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=voss-kurt, 224171 bytes
|
||
I 2022/06/09 11:04:28 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=voss-kurt, STACKING TIME = 1, PARSING TIME = 39
|
||
I 2022/06/09 11:04:28 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://s3.amazonaws.com/criterion-production/films/2131124bf11dd19cde56a791c8fc54f9/o5Y9AGkM9iZWMr9AQm46QYzEAvubcV_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=voss-kurt
|
||
I 2022/06/09 11:04:28 Fulltext indexing: nLywem_26NP5 https://www.criterion.com/shop/browse/list?director=voss-kurt
|
||
I 2022/06/09 11:04:28 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[nLywem_26NP5 (1735154905735233536)]} 0 2
|
||
I 2022/06/09 11:04:28 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=voss-kurt [nLywem_26NP5]
|
||
Description: Kurt Voss films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12487 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:28 HostQueue forcing crawl-delay of 187 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 429, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 63)) = 187
|
||
I 2022/06/09 11:04:28 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=mann-anthony, 224159 bytes
|
||
I 2022/06/09 11:04:28 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=mann-anthony, STACKING TIME = 2, PARSING TIME = 31
|
||
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://s3.amazonaws.com/criterion-production/films/27347e78f7beca764a3920161b531e11/9TTjP15bKoZruNTlx8uvvB6oPAMC2O_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:28 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 432, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:28 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=mann-anthony
|
||
I 2022/06/09 11:04:28 Fulltext indexing: m7bKdm_26NP5 https://www.criterion.com/shop/browse/list?director=mann-anthony
|
||
I 2022/06/09 11:04:28 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[m7bKdm_26NP5 (1735154906116915200)]} 0 2
|
||
I 2022/06/09 11:04:28 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=mann-anthony [m7bKdm_26NP5]
|
||
Description: Anthony Mann films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12465 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:29 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=holland-agnieszka, 224213 bytes
|
||
I 2022/06/09 11:04:29 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=holland-agnieszka, STACKING TIME = 2, PARSING TIME = 27
|
||
I 2022/06/09 11:04:29 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/films/1975c824e3f44554d1755f66ac8e9901/op5e76062WLo9yZJeiOlU7gNFEjL5m_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 434, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:29 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=berger-ludwig, 224196 bytes
|
||
I 2022/06/09 11:04:29 HTCACHE storing content of url https://www.criterion.com/current/posts/5321-sundance-2018-paul-dano-s-wildlife, 95467 bytes
|
||
I 2022/06/09 11:04:29 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=holland-agnieszka
|
||
I 2022/06/09 11:04:29 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=berger-ludwig, STACKING TIME = 12, PARSING TIME = 39
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/films/41132bea28bb3bbea12d52e78c20b378/kl6ebsg1AK3m1ejL3zvPJa17lTmxcW_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 Fulltext indexing: m2CQem_26NP5 https://www.criterion.com/shop/browse/list?director=holland-agnieszka
|
||
I 2022/06/09 11:04:29 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[m2CQem_26NP5 (1735154906512228352)]} 0 6
|
||
I 2022/06/09 11:04:29 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=holland-agnieszka [m2CQem_26NP5]
|
||
Description: Agnieszka Holland films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12487 bytes |
|
||
LinkStorageTime: 12 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:29 SWITCHBOARD CRAWL: ADDED 71 LINKS FROM https://www.criterion.com/current/posts/5321-sundance-2018-paul-dano-s-wildlife, STACKING TIME = 2, PARSING TIME = 32
|
||
I 2022/06/09 11:04:29 REJECTED https://theplaylist.net/carey-mulligan-wildlife-sundance-review-20180121/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://thefilmstage.com/reviews/sundance-review-wildlife-is-a-remarkably-assured-directorial-debut-for-paul-dano/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.youtube.com/embed/DpGk2oebiDY - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.avclub.com/laura-dern-digs-deep-in-the-most-powerful-and-disturbin-1822326316 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.youtube.com/embed/8yFxapmKLdM - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED http://www.sundance.org/projects/wildlife - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.thewrap.com/wildlife-review-paul-danos-directorial-debut-austere-portrait-family-crisis/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.theguardian.com/film/2018/jan/22/wildlife-review-carey-mulligan-paul-dano-directorial-debut-sundance - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED http://www.vulture.com/2018/01/wildlife-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.screendaily.com/reviews/wildlife-sundance-review/5125766.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.filmcomment.com/blog/film-comment-podcast-sundance-day-four/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.hollywoodreporter.com/review/wildlife-review-1076443 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.rogerebert.com/sundance/sundance-2018-wildlife - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED http://variety.com/2018/film/reviews/wildlife-review-carey-mulligan-1202671259/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED http://filmmakermagazine.com/104656-film-is-about-making-magic-with-these-kind-of-challenges-dp-diego-garcia-on-wildlife/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://s3.amazonaws.com/criterion-production/images/9525-3f1277f09f93bbce17dddd1309aedb17/wildlife01242018_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED http://www.latimes.com/entertainment/movies/la-et-mn-sundance-wildlife-paul-dano-20180120-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED https://www.cityweekly.net/BuzzBlog/archives/2018/01/24/sundance-film-festival-2018-day-6-capsules - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 REJECTED http://www.indiewire.com/2018/01/wildlife-review-paul-dano-carey-mulligan-jake-gyllenhaal-sundance-2018-1201919723/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:29 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 435, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:29 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=berger-ludwig
|
||
I 2022/06/09 11:04:29 Fulltext indexing: muBRVm_26NP5 https://www.criterion.com/shop/browse/list?director=berger-ludwig
|
||
I 2022/06/09 11:04:29 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[muBRVm_26NP5 (1735154906775420928)]} 0 11
|
||
I 2022/06/09 11:04:29 SWITCHBOARD *Indexed 1187 words in URL https://www.criterion.com/shop/browse/list?director=berger-ludwig [muBRVm_26NP5]
|
||
Description: Ludwig Berger films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12478 bytes |
|
||
LinkStorageTime: 18 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:29 HTCACHE storing content of url https://www.criterion.com/current/posts/1972-a-gondry-tribute-to-jean-vigo, 57032 bytes
|
||
I 2022/06/09 11:04:29 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/5321-sundance-2018-paul-dano-s-wildlife
|
||
I 2022/06/09 11:04:29 Fulltext indexing: mWwnsG_26NP5 https://www.criterion.com/current/posts/5321-sundance-2018-paul-dano-s-wildlife
|
||
I 2022/06/09 11:04:29 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[mWwnsG_26NP5 (1735154906879229952)]} 0 3
|
||
I 2022/06/09 11:04:29 SWITCHBOARD *Indexed 605 words in URL https://www.criterion.com/current/posts/5321-sundance-2018-paul-dano-s-wildlife [mWwnsG_26NP5]
|
||
Description: Sundance 2018: Paul Dano’s Wildlife | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 7444 bytes |
|
||
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:29 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=pichel-irving, 224720 bytes
|
||
I 2022/06/09 11:04:29 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 433, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 8)) = 243
|
||
I 2022/06/09 11:04:29 HTCACHE storing content of url https://www.criterion.com/current/author/628-terry-southern, 48869 bytes
|
||
I 2022/06/09 11:04:29 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 430, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 11)) = 240
|
||
I 2022/06/09 11:04:30 HeapReader generating index for /root/yacy/DATA/HTCACHE/file.array/YpLgnuY9YMRB.20220609110430002.blob, 0 MB. Please wait.
|
||
I 2022/06/09 11:04:30 HeapReader finished index generation for /root/yacy/DATA/HTCACHE/file.array/YpLgnuY9YMRB.20220609110430002.blob, 0 entries, 0 gaps.
|
||
I 2022/06/09 11:04:30 Heap initializing heap /root/yacy/DATA/HTCACHE/file.array/YpLgnuY9YMRB.20220609110430002.blob
|
||
I 2022/06/09 11:04:30 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 SWITCHBOARD CRAWL: ADDED 39 LINKS FROM https://www.criterion.com/current/posts/1972-a-gondry-tribute-to-jean-vigo, STACKING TIME = 2, PARSING TIME = 5
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/current/posts/1972-a-gondry-tribute-to-jean-vigo
|
||
I 2022/06/09 11:04:30 Fulltext indexing: mRRRZG_26NP5 https://www.criterion.com/current/posts/1972-a-gondry-tribute-to-jean-vigo
|
||
I 2022/06/09 11:04:30 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[mRRRZG_26NP5 (1735154907310194688)]} 0 5
|
||
I 2022/06/09 11:04:30 SWITCHBOARD *Indexed 99 words in URL https://www.criterion.com/current/posts/1972-a-gondry-tribute-to-jean-vigo [mRRRZG_26NP5]
|
||
Description: A Gondry Tribute to Jean Vigo | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 1355 bytes |
|
||
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:30 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1865-2b74f037f454df1d78013f06dc4aaea4/0TUeLtsha8fzPrVMeQ8rNOnpUVmvME_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://s3.amazonaws.com/criterion-production/films/1f4199efee0716b73e643f44cffd628f/FsFD7z9JPS8zGJwwhpboY5pnNOOmIc_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=pichel-irving, STACKING TIME = 3, PARSING TIME = 44
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/628-terry-southern, STACKING TIME = 1, PARSING TIME = 11
|
||
I 2022/06/09 11:04:30 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=pichel-irving
|
||
I 2022/06/09 11:04:30 Fulltext indexing: mVARsm_26NP5 https://www.criterion.com/shop/browse/list?director=pichel-irving
|
||
I 2022/06/09 11:04:30 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 430, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:30 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[mVARsm_26NP5 (1735154907506278400)]} 0 12
|
||
I 2022/06/09 11:04:30 SWITCHBOARD *Indexed 1195 words in URL https://www.criterion.com/shop/browse/list?director=pichel-irving [mVARsm_26NP5]
|
||
Description: Irving Pichel films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12504 bytes |
|
||
LinkStorageTime: 23 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:30 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/current/author/628-terry-southern
|
||
I 2022/06/09 11:04:30 Fulltext indexing: lpTHqG_26NP5 https://www.criterion.com/current/author/628-terry-southern
|
||
I 2022/06/09 11:04:30 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[lpTHqG_26NP5 (1735154907527249920)]} 0 0
|
||
I 2022/06/09 11:04:30 SWITCHBOARD *Indexed 112 words in URL https://www.criterion.com/current/author/628-terry-southern [lpTHqG_26NP5]
|
||
Description: Terry Southern | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 1343 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:30 YACY rulebasedUpdateInfo: not an automatic update selected
|
||
I 2022/06/09 11:04:30 RESOURCE OBSERVER resources ok
|
||
I 2022/06/09 11:04:30 SWITCHBOARD postprocessing deactivated: field process_sxt is not enabled
|
||
I 2022/06/09 11:04:30 SWITCHBOARD postprocessing deactivated: no enough ram (420048312), needed 536870912, to force change field postprocessing.minimum_ram
|
||
I 2022/06/09 11:04:30 SWITCHBOARD postprocessing deactivated: constraints violated
|
||
I 2022/06/09 11:04:30 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=armstrong-gillian, 224199 bytes
|
||
I 2022/06/09 11:04:30 HostQueue forcing crawl-delay of 247 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 439, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 4)) = 247
|
||
I 2022/06/09 11:04:30 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=kenton-erle-c, 224183 bytes
|
||
I 2022/06/09 11:04:30 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=armstrong-gillian, STACKING TIME = 1, PARSING TIME = 33
|
||
I 2022/06/09 11:04:30 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://s3.amazonaws.com/criterion-production/films/1076892f9021ddb7c859fd3c8e320e2a/Ci81X9eJjj5UeZxOsh0yqJlCk9CGOm_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:30 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=kenton-erle-c, STACKING TIME = 3, PARSING TIME = 43
|
||
I 2022/06/09 11:04:31 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/films/9de0eda5b161e218aec5e45c9f71bc46/CIZxKmCjJsPRkHDp0FKzMMDIX4n72l_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=armstrong-gillian
|
||
I 2022/06/09 11:04:31 Fulltext indexing: lR4q4m_26NP5 https://www.criterion.com/shop/browse/list?director=armstrong-gillian
|
||
I 2022/06/09 11:04:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[lR4q4m_26NP5 (1735154908351430656)]} 0 4
|
||
I 2022/06/09 11:04:31 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=armstrong-gillian [lR4q4m_26NP5]
|
||
Description: Gillian Armstrong films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12474 bytes |
|
||
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:31 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=kenton-erle-c
|
||
I 2022/06/09 11:04:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[lZg9rm_26NP5 (1735154908420636672)]} 0 2
|
||
I 2022/06/09 11:04:31 Fulltext indexing: lZg9rm_26NP5 https://www.criterion.com/shop/browse/list?director=kenton-erle-c
|
||
I 2022/06/09 11:04:31 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse/list?director=kenton-erle-c [lZg9rm_26NP5]
|
||
Description: Erle C. Kenton films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12467 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:31 HTCACHE storing content of url https://www.criterion.com/current/posts/6369-mendon-a-and-dornelles-s-bacurau, 71186 bytes
|
||
I 2022/06/09 11:04:31 SWITCHBOARD CRAWL: ADDED 62 LINKS FROM https://www.criterion.com/current/posts/6369-mendon-a-and-dornelles-s-bacurau, STACKING TIME = 2, PARSING TIME = 60
|
||
I 2022/06/09 11:04:31 REJECTED https://www.theguardian.com/film/2019/may/15/bacurau-review-brazil-outback-western-cannes - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://lwlies.com/festivals/bacarau-cannes-film-festival-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 437, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 7)) = 244
|
||
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.festival-cannes.com/en/festival/films/bacurau - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.youtube.com/embed/Hr49Ayyb3zs?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.telegraph.co.uk/films/0/bacurau-review-bloodsoaked-brazilian-sci-fi-western-shades-john/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://criterion-production.s3.amazonaws.com/LuFa9XKpt7y280rU9YIY3VdsTEwMsM.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-3-when-push-comes-to-shove - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-chang-notebook-bacurau-les-miserables-deerskin-20190516-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.hollywoodreporter.com/review/bacurau-review-1211067 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://thefilmstage.com/reviews/cannes-review-bacurau-is-a-john-carpenter-inspired-politically-fueled-revenge-fantasy/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 SWITCHBOARD Excluded 25 words in URL https://www.criterion.com/current/posts/6369-mendon-a-and-dornelles-s-bacurau
|
||
I 2022/06/09 11:04:31 Fulltext indexing: lGV5yG_26NP5 https://www.criterion.com/current/posts/6369-mendon-a-and-dornelles-s-bacurau
|
||
I 2022/06/09 11:04:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[lGV5yG_26NP5 (1735154908558000128)]} 0 2
|
||
I 2022/06/09 11:04:31 SWITCHBOARD *Indexed 473 words in URL https://www.criterion.com/current/posts/6369-mendon-a-and-dornelles-s-bacurau [lGV5yG_26NP5]
|
||
Description: Mendonça and Dornelles’s Bacurau | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 5403 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:31 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 437, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:31 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=kerrigan-lodge, 224160 bytes
|
||
I 2022/06/09 11:04:31 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=kerrigan-lodge, STACKING TIME = 2, PARSING TIME = 33
|
||
I 2022/06/09 11:04:31 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/films/072341219cb469ae34678a376c4cd241/Zj2dWjjyBYisz6T1eembGfuvePjxGp_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 437, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 12)) = 239
|
||
I 2022/06/09 11:04:31 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=kerrigan-lodge
|
||
I 2022/06/09 11:04:31 Fulltext indexing: k873Wm_26NP5 https://www.criterion.com/shop/browse/list?director=kerrigan-lodge
|
||
I 2022/06/09 11:04:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[k873Wm_26NP5 (1735154909134716928)]} 0 3
|
||
I 2022/06/09 11:04:31 SWITCHBOARD *Indexed 1187 words in URL https://www.criterion.com/shop/browse/list?director=kerrigan-lodge [k873Wm_26NP5]
|
||
Description: Lodge Kerrigan films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12460 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:31 HTCACHE storing content of url https://www.criterion.com/current/posts/6373-kantemir-balagov-s-beanpole, 72068 bytes
|
||
I 2022/06/09 11:04:31 SWITCHBOARD CRAWL: ADDED 64 LINKS FROM https://www.criterion.com/current/posts/6373-kantemir-balagov-s-beanpole, STACKING TIME = 2, PARSING TIME = 7
|
||
I 2022/06/09 11:04:31 REJECTED https://www.festival-cannes.com/en/films/beanpole - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://criterion-production.s3.amazonaws.com/n8sCcCzSAzLL1KiugbHE7WM5fJ7SHf.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://variety.com/2019/film/festivals/filmmaker-kantemir-balagov-talks-about-his-cannes-un-certain-regard-drama-beanpole-1203216225/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.indiewire.com/2019/05/beanpole-review-cannes-2019-1202141983/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.rogerebert.com/cannes/cannes-2019-for-sama-beanpole - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.youtube.com/embed/-2K0_PfthrY?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://film.avclub.com/postwar-drama-and-an-unnerving-spin-on-a-sci-fi-classic-1834865079 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.hollywoodreporter.com/review/beanpole-review-1211204 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-3-when-push-comes-to-shove - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.ioncinema.com/reviews/kantemir-balagov-beanpole-review - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://variety.com/2019/film/markets-festivals/beanpole-review-1203215728/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 REJECTED https://www.screendaily.com/reviews/beanpole-cannes-review/5139505.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:31 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/6373-kantemir-balagov-s-beanpole
|
||
I 2022/06/09 11:04:31 Fulltext indexing: k4x7HG_26NP5 https://www.criterion.com/current/posts/6373-kantemir-balagov-s-beanpole
|
||
I 2022/06/09 11:04:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[k4x7HG_26NP5 (1735154909267886080)]} 0 1
|
||
I 2022/06/09 11:04:31 SWITCHBOARD *Indexed 522 words in URL https://www.criterion.com/current/posts/6373-kantemir-balagov-s-beanpole [k4x7HG_26NP5]
|
||
Description: Kantemir Balagov’s Beanpole | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 6179 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:31 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 437, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:32 HTCACHE storing content of url https://www.criterion.com/current/posts/6383-robert-eggers-s-the-lighthouse, 71911 bytes
|
||
I 2022/06/09 11:04:32 SWITCHBOARD CRAWL: ADDED 65 LINKS FROM https://www.criterion.com/current/posts/6383-robert-eggers-s-the-lighthouse, STACKING TIME = 1, PARSING TIME = 53
|
||
I 2022/06/09 11:04:32 REJECTED https://www.rogerebert.com/cannes/cannes-2019-the-lighthouse-lux-aeterna - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.vanityfair.com/hollywood/2019/05/robert-pattinson-the-lighthouse-movie-review-cannes - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/posts/7823-tribeca-2022 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/posts/7819-early-summer-reading - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://lwlies.com/festivals/the-lighthouse-cannes-film-festival-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/series/did-you-see-this - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://twitter.com/A24/status/1130602426946543616 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://variety.com/2019/film/reviews/the-lighthouse-review-robert-pattinson-willem-dafoe-1203220127/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-6-teenage-martyrs-and-lighthouse-keepers - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.quinzaine-realisateurs.com/en/film/the-lighthouse/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/category/1-on-film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/series/cannes-2019 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.thedailybeast.com/robert-pattinson-loses-his-damn-mind-in-cannes-film-festivals-the-lighthouse - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://thefilmstage.com/reviews/the-light-house-cannes-review-robert-pattinson-willem-dafoe/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://theplaylist.net/lighthouse-cannes-review-20190519/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/posts/7822-irma-vep-revamp - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://criterion-production.s3.amazonaws.com/6LxWp33V8LLIofe5AHi4VLIOD0FDMX.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/category/20-the-daily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/author/654-david-hudson - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://criterion-v2.herokuapp.com/current/posts/7818-american-neorealism-now - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.telegraph.co.uk/films/0/lighthouse-review-film-will-make-head-soul-ring/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 SWITCHBOARD Excluded 26 words in URL https://www.criterion.com/current/posts/6383-robert-eggers-s-the-lighthouse
|
||
I 2022/06/09 11:04:32 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kYbziG_26NP5 (1735154909454532608)]} 0 2
|
||
I 2022/06/09 11:04:32 Fulltext indexing: kYbziG_26NP5 https://www.criterion.com/current/posts/6383-robert-eggers-s-the-lighthouse
|
||
I 2022/06/09 11:04:32 SWITCHBOARD *Indexed 559 words in URL https://www.criterion.com/current/posts/6383-robert-eggers-s-the-lighthouse [kYbziG_26NP5]
|
||
Description: Robert Eggers’s The Lighthouse | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 6375 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:32 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 435, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 14)) = 237
|
||
I 2022/06/09 11:04:32 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=nakahira-ko, 224182 bytes
|
||
I 2022/06/09 11:04:32 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=nakahira-ko, STACKING TIME = 1, PARSING TIME = 20
|
||
I 2022/06/09 11:04:32 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://s3.amazonaws.com/criterion-production/films/1d775c19a4731003ec45c846f2134507/OJ4jbjYEb96JR4ixXoUpfsFPa3Yvmb_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
|
||
I 2022/06/09 11:04:32 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 435, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:32 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=nakahira-ko
|
||
I 2022/06/09 11:04:32 Fulltext indexing: kX7U9m_26NP5 https://www.criterion.com/shop/browse/list?director=nakahira-ko
|
||
I 2022/06/09 11:04:32 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kX7U9m_26NP5 (1735154909934780416)]} 0 4
|
||
I 2022/06/09 11:04:32 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=nakahira-ko [kX7U9m_26NP5]
|
||
Description: Kô Nakahira films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12475 bytes |
|
||
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:32 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:04:32 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@5b6ac81a[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wk(7.7.3):C19:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772672584}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
|
||
I 2022/06/09 11:04:32 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
|
||
I 2022/06/09 11:04:32 HTCACHE storing content of url https://www.criterion.com/current/author/856-robert-daniels, 50068 bytes
|
||
I 2022/06/09 11:04:32 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/856-robert-daniels, STACKING TIME = 0, PARSING TIME = 4
|
||
I 2022/06/09 11:04:32 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kVg7QG_26NP5 (1735154910045929472)]} 0 1
|
||
I 2022/06/09 11:04:32 SWITCHBOARD Excluded 12 words in URL https://www.criterion.com/current/author/856-robert-daniels
|
||
I 2022/06/09 11:04:32 Fulltext indexing: kVg7QG_26NP5 https://www.criterion.com/current/author/856-robert-daniels
|
||
I 2022/06/09 11:04:32 SWITCHBOARD *Indexed 135 words in URL https://www.criterion.com/current/author/856-robert-daniels [kVg7QG_26NP5]
|
||
Description: Robert Daniels | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 1981 bytes |
|
||
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:32 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 433, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:32 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=nakagawa-nobuo, 224175 bytes
|
||
I 2022/06/09 11:04:32 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 436, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:33 SWITCHBOARD CRAWL: ADDED 42 LINKS FROM https://www.criterion.com/shop/browse/list?director=nakagawa-nobuo, STACKING TIME = 1, PARSING TIME = 30
|
||
I 2022/06/09 11:04:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/755cc14a06822abce0859f66f77ba87d/4ojfoQo0qFPOFyenaCA774vHWYBMWt_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=nakagawa-nobuo
|
||
I 2022/06/09 11:04:33 Fulltext indexing: kWH4_m_26NP5 https://www.criterion.com/shop/browse/list?director=nakagawa-nobuo
|
||
I 2022/06/09 11:04:33 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kWH4_m_26NP5 (1735154910451728384)]} 0 2
|
||
I 2022/06/09 11:04:33 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=nakagawa-nobuo [kWH4_m_26NP5]
|
||
Description: Nobuo Nakagawa films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12491 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:33 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 436, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:33 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bruckman-clyde, 224211 bytes
|
||
I 2022/06/09 11:04:33 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 439, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 11)) = 240
|
||
I 2022/06/09 11:04:33 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=gremillon-jean, 225904 bytes
|
||
I 2022/06/09 11:04:33 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=bruckman-clyde, STACKING TIME = 1, PARSING TIME = 144
|
||
I 2022/06/09 11:04:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/3a4a52811b630a9836c1b10cb2c55a38/1DZVBE8PnMfkggyvh5s9f7K2TSAiF0_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=gremillon-jean, STACKING TIME = 1, PARSING TIME = 68
|
||
I 2022/06/09 11:04:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1810-937c264a8f71a4e7b7e56fd9bb1f6573/RH8phccQiFbDuedTfBmuDM9ZgEXY4E_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/2246578d818804de9a010ec2f6761940/LS7nGzD3CGCHvbfnW7CFToCn83HJRf_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/1819692b0157948bf277f85e34504c85/HU1NSd7jljSoSS3ka7bypfce5Wx7X4_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/9f4c81fc85a20a9ca4af74f218536096/4MH6hVt4vxfz8BFDRrOc5UO5HC841g_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=bruckman-clyde
|
||
I 2022/06/09 11:04:33 Fulltext indexing: kUor1m_26NP5 https://www.criterion.com/shop/browse/list?director=bruckman-clyde
|
||
I 2022/06/09 11:04:33 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kUor1m_26NP5 (1735154911115476992)]} 0 3
|
||
I 2022/06/09 11:04:33 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=bruckman-clyde [kUor1m_26NP5]
|
||
Description: Clyde Bruckman films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12484 bytes |
|
||
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:33 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 440, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 9)) = 242
|
||
I 2022/06/09 11:04:33 HTCACHE storing content of url https://www.criterion.com/current/posts/2516-pennebaker-on-mailer-wild-90-and-beyond-the-law, 71327 bytes
|
||
I 2022/06/09 11:04:33 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/2516-pennebaker-on-mailer-wild-90-and-beyond-the-law, STACKING TIME = 5, PARSING TIME = 14
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.youtube.com/embed/9NR--aUs7gY?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/b069abe928385f97336491824e25923b/MjNYU96ZyBqDUuGvnpoqXLFz6MVKPS_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/f01fd5cd8ceeda0a6b33f78e25f81d98/dYeHQUY0SAGvBWP5y5tDXGWAYDhgrh_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/images/5009-0fef25298fac51d7cbd5e4fc73c07983/Mailer_Episode_1_Current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=gremillon-jean
|
||
I 2022/06/09 11:04:33 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kUQ9Xm_26NP5 (1735154911315755008)]} 0 2
|
||
I 2022/06/09 11:04:33 Fulltext indexing: kUQ9Xm_26NP5 https://www.criterion.com/shop/browse/list?director=gremillon-jean
|
||
I 2022/06/09 11:04:33 SWITCHBOARD *Indexed 1211 words in URL https://www.criterion.com/shop/browse/list?director=gremillon-jean [kUQ9Xm_26NP5]
|
||
Description: Jean Grémillon films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12622 bytes |
|
||
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:33 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/current/posts/2516-pennebaker-on-mailer-wild-90-and-beyond-the-law
|
||
I 2022/06/09 11:04:33 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[jn7tkG_26NP5 (1735154911333580800)]} 0 1
|
||
I 2022/06/09 11:04:33 Fulltext indexing: jn7tkG_26NP5 https://www.criterion.com/current/posts/2516-pennebaker-on-mailer-wild-90-and-beyond-the-law
|
||
I 2022/06/09 11:04:33 SWITCHBOARD *Indexed 275 words in URL https://www.criterion.com/current/posts/2516-pennebaker-on-mailer-wild-90-and-beyond-the-law [jn7tkG_26NP5]
|
||
Description: Pennebaker on Mailer: Wild 90 and Beyond the Law | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3231 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:33 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=menzies-william-cameron, 224247 bytes
|
||
I 2022/06/09 11:04:33 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=menzies-william-cameron, STACKING TIME = 1, PARSING TIME = 20
|
||
I 2022/06/09 11:04:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://s3.amazonaws.com/criterion-production/films/8acc91247738186c941d26d985dd25d1/Tq53ScFvzrnFOqKLNLZfExM0t52C7k_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=menzies-william-cameron
|
||
I 2022/06/09 11:04:34 Fulltext indexing: kPW78m_26NP5 https://www.criterion.com/shop/browse/list?director=menzies-william-cameron
|
||
I 2022/06/09 11:04:34 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 442, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 12)) = 239
|
||
I 2022/06/09 11:04:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kPW78m_26NP5 (1735154911474089984)]} 0 5
|
||
I 2022/06/09 11:04:34 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=menzies-william-cameron [kPW78m_26NP5]
|
||
Description: William Cameron Menzies films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12495 bytes |
|
||
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:34 HTCACHE storing content of url https://www.criterion.com/current/author/179-pauline-kael, 54697 bytes
|
||
I 2022/06/09 11:04:34 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/current/author/179-pauline-kael, STACKING TIME = 2, PARSING TIME = 4
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[i91qcG_26NP5 (1735154911712116736)]} 0 1
|
||
I 2022/06/09 11:04:34 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/current/author/179-pauline-kael
|
||
I 2022/06/09 11:04:34 Fulltext indexing: i91qcG_26NP5 https://www.criterion.com/current/author/179-pauline-kael
|
||
I 2022/06/09 11:04:34 SWITCHBOARD *Indexed 185 words in URL https://www.criterion.com/current/author/179-pauline-kael [i91qcG_26NP5]
|
||
Description: Pauline Kael | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 2270 bytes |
|
||
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:34 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 439, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:34 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=turell-saul-j, 225976 bytes
|
||
I 2022/06/09 11:04:34 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=turell-saul-j, STACKING TIME = 1, PARSING TIME = 20
|
||
I 2022/06/09 11:04:34 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://s3.amazonaws.com/criterion-production/films/d77d4a38d63c784669e245e92055766d/Z8aTrBp2gBHeTKrc8NwYihqR92yy9L_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://s3.amazonaws.com/criterion-production/films/69a755ba1d29f769584b674d4114ac40/gGzssRokuyhN2VwbxmWfgZJVIq2EM7_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1873-f368b5675c9f0eb398727cb9a03b6e7b/vDeZBqbuBzw7Bpsb3KEK6hBnY6zACK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 442, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 12)) = 239
|
||
I 2022/06/09 11:04:34 SWITCHBOARD Excluded 11 words in URL https://www.criterion.com/shop/browse/list?director=turell-saul-j
|
||
I 2022/06/09 11:04:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[jV-1sm_26NP5 (1735154912029835264)]} 0 2
|
||
I 2022/06/09 11:04:34 Fulltext indexing: jV-1sm_26NP5 https://www.criterion.com/shop/browse/list?director=turell-saul-j
|
||
I 2022/06/09 11:04:34 SWITCHBOARD *Indexed 1210 words in URL https://www.criterion.com/shop/browse/list?director=turell-saul-j [jV-1sm_26NP5]
|
||
Description: Saul J. Turell films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12647 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:34 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=rees-dee, 224126 bytes
|
||
I 2022/06/09 11:04:34 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=rees-dee, STACKING TIME = 0, PARSING TIME = 84
|
||
I 2022/06/09 11:04:34 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://s3.amazonaws.com/criterion-production/films/a68a36ba70947ac3704098a3860aa0b8/wHzhBwq8bLT9UrBKTI9PpiwWDV3YAM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 442, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:34 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=rees-dee
|
||
I 2022/06/09 11:04:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[i9ZO_m_26NP5 (1735154912336019456)]} 0 2
|
||
I 2022/06/09 11:04:34 Fulltext indexing: i9ZO_m_26NP5 https://www.criterion.com/shop/browse/list?director=rees-dee
|
||
I 2022/06/09 11:04:34 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=rees-dee [i9ZO_m_26NP5]
|
||
Description: Dee Rees films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12452 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:34 HTCACHE storing content of url https://www.criterion.com/current/author/404-hope-parrish, 48381 bytes
|
||
I 2022/06/09 11:04:34 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/404-hope-parrish, STACKING TIME = 0, PARSING TIME = 4
|
||
I 2022/06/09 11:04:34 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:34 SWITCHBOARD Excluded 13 words in URL https://www.criterion.com/current/author/404-hope-parrish
|
||
I 2022/06/09 11:04:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[iecx0G_26NP5 (1735154912428294144)]} 0 1
|
||
I 2022/06/09 11:04:34 Fulltext indexing: iecx0G_26NP5 https://www.criterion.com/current/author/404-hope-parrish
|
||
I 2022/06/09 11:04:34 SWITCHBOARD *Indexed 120 words in URL https://www.criterion.com/current/author/404-hope-parrish [iecx0G_26NP5]
|
||
Description: Hope Parrish | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 1421 bytes |
|
||
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:35 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 439, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:35 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=fukunaga-cary-joji, 224197 bytes
|
||
I 2022/06/09 11:04:35 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=fukunaga-cary-joji, STACKING TIME = 0, PARSING TIME = 22
|
||
I 2022/06/09 11:04:35 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/d625b0e7f179fa73f1d10d4ff66873a6/KwcufB3P2S9l5e6g7eQitniSmR32hr_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 HTCACHE storing content of url https://www.criterion.com/shop/collection/141-volker-schl-ndorff/list, 55104 bytes
|
||
I 2022/06/09 11:04:35 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=fukunaga-cary-joji
|
||
I 2022/06/09 11:04:35 SWITCHBOARD CRAWL: ADDED 49 LINKS FROM https://www.criterion.com/shop/collection/141-volker-schl-ndorff/list, STACKING TIME = 1, PARSING TIME = 67
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/1522135761424c32d477da9851f016ff/33GcylWNQvIKPneqIDDpcJnFPoUtSg_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/a8c39a413f3134b2940ede42c30c02d3/qL6ZEBtqgDUf0j76xJ6GFEpaGQYXl3_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/ea8616829a288b5b6d680c9f6b66ba59/03UzOLZzQogXDtQOTkIp8BbLpZWGYM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/d2c4d40b1ce44f03b1c60a2ae9829ded/EGvQdEyNez1O8QmzlhTj1a1gyGcKHg_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[iySfDm_26NP5 (1735154912798441472)]} 0 2
|
||
I 2022/06/09 11:04:35 Fulltext indexing: iySfDm_26NP5 https://www.criterion.com/shop/browse?director=fukunaga-cary-joji
|
||
I 2022/06/09 11:04:35 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/22a10f46e2d950d3ab907e62e119bd61/QMb2egiiChRALGyT7ZrOLX40p36N5I_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/6813575ce7945498b15effc2cef1777a/Kcg04nsDzd1F6UYUmtIJ4pn5qRcDCz_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/bed1dc8df02842d6a75325665e718ebd/da8xTBLVhcfx0KQXSyOOMImKRe6s2r_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 403, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:35 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse?director=fukunaga-cary-joji [iySfDm_26NP5]
|
||
Description: Cary Joji Fukunaga films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12425 bytes |
|
||
LinkStorageTime: 17 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:35 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ic4nAm_26NP5 (1735154912814170112)]} 0 1
|
||
I 2022/06/09 11:04:35 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/collection/141-volker-schl-ndorff/list
|
||
I 2022/06/09 11:04:35 Fulltext indexing: ic4nAm_26NP5 https://www.criterion.com/shop/collection/141-volker-schl-ndorff/list
|
||
I 2022/06/09 11:04:35 SWITCHBOARD *Indexed 157 words in URL https://www.criterion.com/shop/collection/141-volker-schl-ndorff/list [ic4nAm_26NP5]
|
||
Description: Volker Schlöndorff | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 1707 bytes |
|
||
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:35 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 403, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:35 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 403, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:35 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=makavejev-dusan, 227186 bytes
|
||
I 2022/06/09 11:04:35 SWITCHBOARD CRAWL: ADDED 48 LINKS FROM https://www.criterion.com/shop/browse/list?director=makavejev-dusan, STACKING TIME = 1, PARSING TIME = 44
|
||
I 2022/06/09 11:04:35 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/5eb3b02cc20d9090e506fdd942b92cab/WsP71MgPQ5GmNEtT7QghC3MI3epzGp_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/c8e739d705dfcfc85250cd8e7964cac5/Z2weqqbJY8cvVdxGqDCakEj4IjlDMF_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/561ccce505c4622a24bd32dfae8565e5/G3vBQac0x5LaFJBmoSIKMWRIRLhO2a_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/31ec952d32145bfe242a4c1b187021fa/bNa4o4AivhUBPdacbpJN6IKxha7994_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1794-050c8e7aa2d3ba1ddbd296d108239cf1/shvY9H0K9PWixHcIAuuAUuKL7GcP2V_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:35 REJECTED https://s3.amazonaws.com/criterion-production/films/688737ce839fa08fa7fda02cbabcbb27/IHT6Dhx73vSSKCSX8RLh2ThYArDcGb_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 SWITCHBOARD Excluded 11 words in URL https://www.criterion.com/shop/browse/list?director=makavejev-dusan
|
||
I 2022/06/09 11:04:36 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[iBitwm_26NP5 (1735154913560756224)]} 0 3
|
||
I 2022/06/09 11:04:36 Fulltext indexing: iBitwm_26NP5 https://www.criterion.com/shop/browse/list?director=makavejev-dusan
|
||
I 2022/06/09 11:04:36 SWITCHBOARD *Indexed 1228 words in URL https://www.criterion.com/shop/browse/list?director=makavejev-dusan [iBitwm_26NP5]
|
||
Description: Dušan Makavejev films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12802 bytes |
|
||
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:36 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=edgren-gustaf, 224803 bytes
|
||
I 2022/06/09 11:04:36 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 462, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 5)) = 245
|
||
I 2022/06/09 11:04:36 HTCACHE storing content of url https://www.criterion.com/current/posts/5289-sundance-2018-tamara-jenkins-s-private-life, 119555 bytes
|
||
I 2022/06/09 11:04:36 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=edgren-gustaf, STACKING TIME = 1, PARSING TIME = 125
|
||
I 2022/06/09 11:04:36 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1915-d024b93e4429f5b05d7f0bdc8d59c415/shXfQUMZTWnY6hrjrAHIhcBlosHu4c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/films/9ae2fdb4db278ca13d1d6f9f72c7f9d1/7AOCgcEUHNbk2QLpyr837sD4mNACMd_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 SWITCHBOARD CRAWL: ADDED 75 LINKS FROM https://www.criterion.com/current/posts/5289-sundance-2018-tamara-jenkins-s-private-life, STACKING TIME = 6, PARSING TIME = 20
|
||
I 2022/06/09 11:04:36 REJECTED http://www.vulture.com/2018/01/review-private-life-is-a-dazzling-comedy-about-families.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED http://variety.com/2018/film/reviews/private-life-review-sundance-paul-giamatti-kathryn-hahn-1202668747/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.ioncinema.com/reviews/private-life-tamara-jenkins-review - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED http://beta.latimes.com/entertainment/movies/la-et-mn-sundance-tamara-jenkins-private-life-20180119-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.youtube.com/embed/_CBQzRTPHRo - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.timeout.com/us/film/private-life - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.pastemagazine.com/articles/2018/01/private-life.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED http://uproxx.com/filmdrunk/private-life-movie-review-sundance/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.cityweekly.net/BuzzBlog/archives/2018/01/19/sundance-film-festival-2018-day-1-capsules - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED http://www.latimes.com/entertainment/movies/la-et-mn-sundance-diary-justin-chang-20180120-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED http://www.indiewire.com/2018/01/sundance-2018-private-life-tamara-jenkins-netflix-1201918180/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED http://www.indiewire.com/2018/01/private-life-review-tamara-jenkins-paul-giamatti-kathryn-hahn-sundance-2018-1201919179/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.rogerebert.com/sundance/sundance-2018-private-life - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.thedailybeast.com/private-life-the-perfect-sundance-opening-night-film-11-years-in-the-making - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://theplaylist.net/private-life-sundance-review-20180119/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.filmcomment.com/blog/film-comment-podcast-sundance-day-two/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/images/9484-bdac577c62fb0f754068f8b5a2e823d4/privatelife01192018_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.avclub.com/paul-giamatti-and-kathryn-hahn-try-to-get-pregnant-in-t-1822250381 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED http://www.sundance.org/projects/private-life - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED http://flavorwire.com/612744/the-best-and-worst-movies-of-the-2018-sundance-film-festival/8 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.rollingstone.com/movies/news/sundance-2018-blindspotting-private-life-hits-fest-sweet-spots-w515630 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.hollywoodreporter.com/review/private-life-review-1075835 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://thefilmstage.com/reviews/sundance-review-private-life-finds-hardship-honesty-and-humor-in-infertility/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=edgren-gustaf
|
||
I 2022/06/09 11:04:36 Fulltext indexing: hd5_fm_26NP5 https://www.criterion.com/shop/browse/list?director=edgren-gustaf
|
||
I 2022/06/09 11:04:36 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[hd5_fm_26NP5 (1735154913828143104)]} 0 5
|
||
I 2022/06/09 11:04:36 HTCACHE storing content of url https://www.criterion.com/current/author/405-marie-nyrer-d, 48881 bytes
|
||
I 2022/06/09 11:04:36 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 416, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:36 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/405-marie-nyrer-d, STACKING TIME = 10, PARSING TIME = 6
|
||
I 2022/06/09 11:04:36 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:36 SWITCHBOARD *Indexed 1199 words in URL https://www.criterion.com/shop/browse/list?director=edgren-gustaf [hd5_fm_26NP5]
|
||
Description: Gustaf Edgren films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12522 bytes |
|
||
LinkStorageTime: 245 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:36 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/5289-sundance-2018-tamara-jenkins-s-private-life
|
||
I 2022/06/09 11:04:36 Fulltext indexing: hdPeQG_26NP5 https://www.criterion.com/current/posts/5289-sundance-2018-tamara-jenkins-s-private-life
|
||
I 2022/06/09 11:04:36 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[hdPeQG_26NP5 (1735154914092384256)]} 0 4
|
||
I 2022/06/09 11:04:36 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[hX3j6G_26NP5 (1735154914099724288)]} 0 0
|
||
I 2022/06/09 11:04:36 SWITCHBOARD *Indexed 875 words in URL https://www.criterion.com/current/posts/5289-sundance-2018-tamara-jenkins-s-private-life [hdPeQG_26NP5]
|
||
Description: Sundance 2018: Tamara Jenkins’s Private Life | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12053 bytes |
|
||
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:36 SWITCHBOARD Excluded 12 words in URL https://www.criterion.com/current/author/405-marie-nyrer-d
|
||
I 2022/06/09 11:04:36 Fulltext indexing: hX3j6G_26NP5 https://www.criterion.com/current/author/405-marie-nyrer-d
|
||
I 2022/06/09 11:04:36 SWITCHBOARD *Indexed 118 words in URL https://www.criterion.com/current/author/405-marie-nyrer-d [hX3j6G_26NP5]
|
||
Description: Marie Nyreröd | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 1416 bytes |
|
||
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:36 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 416, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
|
||
I 2022/06/09 11:04:36 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 416, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:36 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=morris-errol, 226428 bytes
|
||
I 2022/06/09 11:04:37 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bay-michael, 224667 bytes
|
||
I 2022/06/09 11:04:37 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse/list?director=morris-errol, STACKING TIME = 1, PARSING TIME = 26
|
||
I 2022/06/09 11:04:37 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1776-78eb25c559a866198d58680c9874465e/N2awPkaE0xOiPEj26tjI9yv2eLLXZM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/cc08b14f22e5cb4d4abc35e2bd1e76eb/duZpzWtzx94ixvPnyavkOi9YOZqWR6_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/3217a6e471ba2f9239cbe6cb398aa02f/dgNduEohOof1NohG0fGVRpPOyb322m_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/939ec46490d500823654f65419a8db04/f57BMlw9kdr7YF0EiuQtIM1i67rFHU_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/5199593d0fcdff78d678ad5ac1745fa9/vZ1yJIDRTUcAEmQs90SEEQ7tQvHCqu_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 446, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:37 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=bay-michael, STACKING TIME = 1, PARSING TIME = 103
|
||
I 2022/06/09 11:04:37 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/07f4eeb745b2223e67ad82c6dc2e3ed3/o6c4ES95CWTBBINB8ahqL4niihMuLT_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/7858f806be01113ec3f26840d2ec0cab/zcQiVXuZFCisdXVhLeyD96Hxr6Vqp8_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=morris-errol
|
||
I 2022/06/09 11:04:37 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[hWtCam_26NP5 (1735154914750889984)]} 0 2
|
||
I 2022/06/09 11:04:37 Fulltext indexing: hWtCam_26NP5 https://www.criterion.com/shop/browse/list?director=morris-errol
|
||
I 2022/06/09 11:04:37 SWITCHBOARD *Indexed 1214 words in URL https://www.criterion.com/shop/browse/list?director=morris-errol [hWtCam_26NP5]
|
||
Description: Errol Morris films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12699 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:37 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=bay-michael
|
||
I 2022/06/09 11:04:37 Fulltext indexing: g5RLVm_26NP5 https://www.criterion.com/shop/browse/list?director=bay-michael
|
||
I 2022/06/09 11:04:37 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[g5RLVm_26NP5 (1735154914815901696)]} 0 2
|
||
I 2022/06/09 11:04:37 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=bay-michael [g5RLVm_26NP5]
|
||
Description: Michael Bay films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12503 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:37 HTCACHE storing content of url https://www.criterion.com/films/1430, 76513 bytes
|
||
I 2022/06/09 11:04:37 SWITCHBOARD CRAWL: ADDED 63 LINKS FROM https://www.criterion.com/films/1430, STACKING TIME = 1, PARSING TIME = 8
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/971-7c4c45c3fdeb5631e89f5bd69b34fd70/Dzc8TKpXR0nl2nAvmoZBjShpANSY3t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/966-b448a73623a3104f16704d630fdc4d4a/tAsvl7EI5McyRgmZlFekuGnwgQ7p0o_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/posts/1158-c313ab31d62f8c87337a018fca5b9d64/IMAMURA_Rayns_still_original.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/the-insect-woman?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/7740c3bbca66b23dc22b0d7950193f24.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/9594df5f105bf7ba4cfc780077d38c50.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/98fca346151a9909ce777d3c7bcf4e14/NiN005MtaZLuhUJZOq7z7jOk0wRd02_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/955f6f5f61b98e8a440adbaff6544904/Pg6ZGiA0S3eXXGhlqVdh9PGdQtPvUw_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/dd75421ca3130652d533fe8683dbda57/gu0f7I6dfxl8kO9uUrhIaVX2xcduMR_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/2d39dc6357f78246124d9b889cfee438.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/feabbe7bae1e5ba2f447871de1c77dc0.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/1cc4a3cbedc3d14be75648a0c88c3ebe.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/a0ee213bcd52a61768546b6f50b49e93/oUj4Rhj3eltZ2KJezVL8MqVJVtmf8S_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1855-d1af5d0a5e3d807a317f3cc4e9c52f38/vdPxhXhBYsJOygUcZ3XqUkDiS3dZOl_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/ec6c0c425195715ec06d674b8f7973e5.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 SWITCHBOARD Excluded 17 words in URL https://www.criterion.com/films/1430
|
||
I 2022/06/09 11:04:37 Fulltext indexing: gtsbye_26NP5 https://www.criterion.com/films/1430
|
||
I 2022/06/09 11:04:37 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[gtsbye_26NP5 (1735154914917613568)]} 0 2
|
||
I 2022/06/09 11:04:37 SWITCHBOARD *Indexed 348 words in URL https://www.criterion.com/films/1430 [gtsbye_26NP5]
|
||
Description: The Insect Woman (1963) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 4037 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:37 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 448, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:37 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
|
||
I 2022/06/09 11:04:37 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 448, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
|
||
I 2022/06/09 11:04:37 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:04:37 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@107d2a8c[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wk(7.7.3):C19:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772672584}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wl(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772677667}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
|
||
I 2022/06/09 11:04:37 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
|
||
I 2022/06/09 11:04:37 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=whelan-tim, 224199 bytes
|
||
I 2022/06/09 11:04:37 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=whelan-tim, STACKING TIME = 3, PARSING TIME = 35
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 REJECTED https://s3.amazonaws.com/criterion-production/films/41132bea28bb3bbea12d52e78c20b378/kl6ebsg1AK3m1ejL3zvPJa17lTmxcW_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:37 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=whelan-tim
|
||
I 2022/06/09 11:04:37 Fulltext indexing: gokr_m_26NP5 https://www.criterion.com/shop/browse/list?director=whelan-tim
|
||
I 2022/06/09 11:04:37 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[gokr_m_26NP5 (1735154915423027200)]} 0 4
|
||
I 2022/06/09 11:04:37 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=whelan-tim [gokr_m_26NP5]
|
||
Description: Tim Whelan films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12495 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:37 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 462, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:38 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=tanaka-tokuzo, 225536 bytes
|
||
I 2022/06/09 11:04:38 SWITCHBOARD CRAWL: ADDED 51 LINKS FROM https://www.criterion.com/shop/browse?director=tanaka-tokuzo, STACKING TIME = 7, PARSING TIME = 90
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/554a88f3e461b3af84a8d7c74395c982/ITGgOs4mQf6gDsezMZFForMyh7w2oR_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=jr, 224620 bytes
|
||
I 2022/06/09 11:04:38 HostQueue forcing crawl-delay of 250 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 0)) = 250
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/53029c4265e2623dc5d2e1437fdb0a15/natIW8grGAtqgN1rxflDsqZ0woLxK2_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/584243c99fe050be8794d19464fd5cc6/xyjrEVJ3oamY6KDEMHoxNgk8t2lRpc_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=jr, STACKING TIME = 1, PARSING TIME = 47
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/afeea27c388b3547969c9f509df11cb5/3wuQsvYjsHTc4015Rl3wQBQsb7T9kt_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1958-60bad70091eaab48ce960c49b1f07d94/zQHE891LazZ0Sw7jE3PnOSXsdP3yew_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=tanaka-tokuzo
|
||
I 2022/06/09 11:04:38 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[gdLEnm_26NP5 (1735154915879157760)]} 0 2
|
||
I 2022/06/09 11:04:38 Fulltext indexing: gdLEnm_26NP5 https://www.criterion.com/shop/browse?director=tanaka-tokuzo
|
||
I 2022/06/09 11:04:38 SWITCHBOARD *Indexed 1203 words in URL https://www.criterion.com/shop/browse?director=tanaka-tokuzo [gdLEnm_26NP5]
|
||
Description: Tokuzo Tanaka films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12543 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:38 HTCACHE storing content of url https://www.criterion.com/boxsets/689-roberto-rossellinis-war-trilogy, 72322 bytes
|
||
I 2022/06/09 11:04:38 SWITCHBOARD CRAWL: ADDED 51 LINKS FROM https://www.criterion.com/boxsets/689-roberto-rossellinis-war-trilogy, STACKING TIME = 1, PARSING TIME = 15
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1934-9121909235b34c704ecc53b432150a94/dZNZ1nCXJbEhZeNY4YNXuFLwJPn0yD_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/1901628472b08db9f6af1a6fcb778ef5.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/7cb52be2fd1572e3a5b5276e84d487fa/TECVVTeHMMxaHWxosd91eIX8qS2hMK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/7b27c9719b8fe36e334fa0ba43910a0e/GwB3ma4IYcYsr5CZ4ZizQW0i0mxyxc_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/065858072a69f217ae517f2cc87a2c68/0T7IhsxzrE4rW48efa7kEHGrSe77j0_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/e8457d67a289fb50aad38901ea42f732.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/412a7c9e10c6bb26f1d1190e87cc6014.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1852-f20e8f5bdc458777cda90775598ef89c/G2jPTXWEYclcKFXC7LYWoNEj5D7V7l_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 HTCACHE storing content of url https://www.criterion.com/current/posts/109-the-discreet-charm-of-the-bourgeosie, 71411 bytes
|
||
I 2022/06/09 11:04:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/109-the-discreet-charm-of-the-bourgeosie, STACKING TIME = 4, PARSING TIME = 13
|
||
I 2022/06/09 11:04:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 465, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:38 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=jr
|
||
I 2022/06/09 11:04:38 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[gFLNjm_26NP5 (1735154916080484352)]} 0 2
|
||
I 2022/06/09 11:04:38 Fulltext indexing: gFLNjm_26NP5 https://www.criterion.com/shop/browse/list?director=jr
|
||
I 2022/06/09 11:04:38 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=jr [gFLNjm_26NP5]
|
||
Description: JR films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12462 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:38 SWITCHBOARD Excluded 22 words in URL https://www.criterion.com/boxsets/689-roberto-rossellinis-war-trilogy
|
||
I 2022/06/09 11:04:38 Fulltext indexing: f9jsz3_26NP5 https://www.criterion.com/boxsets/689-roberto-rossellinis-war-trilogy
|
||
I 2022/06/09 11:04:38 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[f9jsz3_26NP5 (1735154916127670272)]} 0 4
|
||
I 2022/06/09 11:04:38 SWITCHBOARD *Indexed 448 words in URL https://www.criterion.com/boxsets/689-roberto-rossellinis-war-trilogy [f9jsz3_26NP5]
|
||
Description: Roberto Rossellini’s War Trilogy | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 7665 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:38 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/109-the-discreet-charm-of-the-bourgeosie
|
||
I 2022/06/09 11:04:38 Fulltext indexing: fr0NEG_26NP5 https://www.criterion.com/current/posts/109-the-discreet-charm-of-the-bourgeosie
|
||
I 2022/06/09 11:04:38 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[fr0NEG_26NP5 (1735154916183244800)]} 0 2
|
||
I 2022/06/09 11:04:38 SWITCHBOARD *Indexed 668 words in URL https://www.criterion.com/current/posts/109-the-discreet-charm-of-the-bourgeosie [fr0NEG_26NP5]
|
||
Description: The Discreet Charm of the Bourgeosie | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 9166 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:38 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 465, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
|
||
I 2022/06/09 11:04:38 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=carra-lucille, 224170 bytes
|
||
I 2022/06/09 11:04:38 HTCACHE storing content of url https://www.criterion.com/current/posts/7516-previewing-venice-2021, 78718 bytes
|
||
I 2022/06/09 11:04:38 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=carra-lucille, STACKING TIME = 3, PARSING TIME = 71
|
||
I 2022/06/09 11:04:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 453, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/films/9a029e3b8977bb7f3cb8290bfbd4f9a4/vXIQATcZON4HV9vCqpFDM8ZTMrR42c_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 SWITCHBOARD CRAWL: ADDED 69 LINKS FROM https://www.criterion.com/current/posts/7516-previewing-venice-2021, STACKING TIME = 1, PARSING TIME = 32
|
||
I 2022/06/09 11:04:38 REJECTED https://www.indiewire.com/2021/08/the-hand-of-god-paolo-sorrentino-interview-1234659825/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://variety.com/2021/film/news/bong-joon-ho-venice-film-festival-covid-19-netflix-1235053563/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.vanityfair.com/hollywood/2021/08/awards-insider-first-look-jane-campion-power-of-the-dog - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.vanityfair.com/hollywood/2021/08/awards-insider-maggie-gyllenhaal-lost-daughter-first-look - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://news.yahoo.com/dune-long-considered-unadaptable-screenwriters-190014204.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.labiennale.org/en/news/jamie-lee-curtis-golden-lion-lifetime-achievement - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.labiennale.org/en/cinema/2021/lineup/venezia-78-competition/madres-paralelas - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://variety.com/2021/film/spotlight/venice-barbera-oscar-1235051867/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.vulture.com/2021/08/pablo-larran-interview-on-spencer-and-biopics.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.labiennale.org/en/news/roberto-benigni-golden-lion-lifetime-achievement - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.theguardian.com/lifeandstyle/2018/oct/06/maggie-gyllenhaal-elena-ferrante-film-book - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://criterion-production.s3.amazonaws.com/h1c8DOClXrEBKiQVLBn1deSMKCAjRQ.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://deadline.com/2021/08/timothee-chalamet-interview-dune-venice-film-festival-1234824699/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.hollywoodreporter.com/movies/movie-news/edgar-wright-last-night-in-soho-baby-driver-2-1235005922/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://www.indiewire.com/2021/08/the-card-counter-paul-schrader-interview-1234660264/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://variety.com/2021/film/spotlight/how-venice-topper-barbera-earned-varietys-intl-achievement-in-film-award-1235051861/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://variety.com/2021/film/spotlight/venice-barbera-biennale-1235051970/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:38 REJECTED https://variety.com/2021/film/spotlight/jamie-lee-curtis-talks-halloween-venice-golden-lion-1235050354/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=carra-lucille
|
||
I 2022/06/09 11:04:39 Fulltext indexing: fQHXDm_26NP5 https://www.criterion.com/shop/browse/list?director=carra-lucille
|
||
I 2022/06/09 11:04:39 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[fQHXDm_26NP5 (1735154916723261440)]} 0 6
|
||
I 2022/06/09 11:04:39 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=carra-lucille [fQHXDm_26NP5]
|
||
Description: Lucille Carra films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12468 bytes |
|
||
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:39 SWITCHBOARD Excluded 30 words in URL https://www.criterion.com/current/posts/7516-previewing-venice-2021
|
||
I 2022/06/09 11:04:39 Fulltext indexing: fGGLtG_26NP5 https://www.criterion.com/current/posts/7516-previewing-venice-2021
|
||
I 2022/06/09 11:04:39 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[fGGLtG_26NP5 (1735154916788273152)]} 0 2
|
||
I 2022/06/09 11:04:39 SWITCHBOARD *Indexed 821 words in URL https://www.criterion.com/current/posts/7516-previewing-venice-2021 [fGGLtG_26NP5]
|
||
Description: Previewing Venice 2021 | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 11559 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:39 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 453, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:39 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 453, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:39 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=mann-michael, 224122 bytes
|
||
I 2022/06/09 11:04:39 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=mann-michael, STACKING TIME = 2, PARSING TIME = 32
|
||
I 2022/06/09 11:04:39 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://s3.amazonaws.com/criterion-production/films/5fdd7999dcd7824705792d1d95ee538f/r9qKHNq3ldPSJH2wwBckeNlDucBRMr_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 462, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
|
||
I 2022/06/09 11:04:39 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=mann-michael
|
||
I 2022/06/09 11:04:39 Fulltext indexing: e0aRXm_26NP5 https://www.criterion.com/shop/browse/list?director=mann-michael
|
||
I 2022/06/09 11:04:39 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[e0aRXm_26NP5 (1735154917446778880)]} 0 2
|
||
I 2022/06/09 11:04:39 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse/list?director=mann-michael [e0aRXm_26NP5]
|
||
Description: Michael Mann films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12447 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:39 HTCACHE storing content of url https://www.criterion.com/current/posts/6389-critics-week-awards-and-highlights, 86551 bytes
|
||
I 2022/06/09 11:04:39 SWITCHBOARD CRAWL: ADDED 97 LINKS FROM https://www.criterion.com/current/posts/6389-critics-week-awards-and-highlights, STACKING TIME = 6, PARSING TIME = 10
|
||
I 2022/06/09 11:04:39 REJECTED https://variety.com/2019/film/festivals/i-lost-my-body-director-jeremy-clapin-critics-week-breakout-1203221101/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/les-heros-ne-meurent-jamais - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.hollywoodreporter.com/review/land-ashes-review-1212491 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://criterion-production.s3.amazonaws.com/9pQeSejRjwTNYTNNInqj87g0cdDUzN.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/ahetzkSwUdw?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://variety.com/2019/film/reviews/litigante-review-1203216875/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-4-struggling-for-justice-overcoming-grief - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://cineuropa.org/en/newsdetail/373051/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/wL8G7NVhk50?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/posts/7819-early-summer-reading - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/X-PIoLQ8OsU?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/spKShkRlFgc?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/nuestras-madres - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.screendaily.com/reviews/you-deserve-a-lover-cannes-review/5139688.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/vivarium - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/the-unknown-saint - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/ikki-illa-meint - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.hollywoodreporter.com/review/heroes-don-t-die-les-heros-ne-meurent-jamais-review-1210499 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://cineuropa.org/en/newsdetail/372503/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.screendaily.com/reviews/our-mothers-cannes-review/5139788.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/posts/7822-irma-vep-revamp - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://twitter.com/midmarauder/status/1125844260706705408 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/tu-merites-un-amour - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://variety.com/2019/film/reviews/vivarium-review-jesse-eisenberg-imogen-poots-1203219403/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.screendaily.com/reviews/dwelling-in-the-fuchun-mountains-cannes-review/5139844.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/AXyZmuR_mBA?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/she-runs - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.hollywoodreporter.com/review/you-deserve-a-lover-1212183 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/posts/7823-tribeca-2022 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/WodOCZtv1EY?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/abou-leila - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.screendaily.com/reviews/vivarium-cannes-review/5139622.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/series/did-you-see-this - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.screendaily.com/reviews/i-lost-my-body-cannes-review/5139271.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/litigante - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/category/1-on-film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/series/cannes-2019 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/dwelling-in-the-fushun-mountains - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/unknown-saint-alaa-eddine-aljem-moroccan-buried-loot-comedy - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/hvitur-hvitur-dagur - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://womenandhollywood.com/cannes-2019-women-directors-meet-sofia-quiros-ceniza-negra/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/jai-perdu-mon-corps - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/O7oNgj0H788?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/ceniza-negra - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://player.vimeo.com/video/336205899 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/category/20-the-daily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.hollywoodreporter.com/review/i-lost-my-body-jai-perdu-mon-corps-review-1210449 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/author/654-david-hudson - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://criterion-v2.herokuapp.com/current/posts/7818-american-neorealism-now - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED http://www.semainedelacritique.com/en/news/winners-of-the-58supthsup-semaine-de-la-critique - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://variety.com/2019/film/reviews/heroes-dont-die-review-1203221561/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.youtube.com/embed/b5doRU9tQ78?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 REJECTED https://www.hollywoodreporter.com/review/vivarium-review-1211969 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:39 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/6389-critics-week-awards-and-highlights
|
||
I 2022/06/09 11:04:39 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[eshkTG_26NP5 (1735154917631328256)]} 0 2
|
||
I 2022/06/09 11:04:39 Fulltext indexing: eshkTG_26NP5 https://www.criterion.com/current/posts/6389-critics-week-awards-and-highlights
|
||
I 2022/06/09 11:04:39 SWITCHBOARD *Indexed 883 words in URL https://www.criterion.com/current/posts/6389-critics-week-awards-and-highlights [eshkTG_26NP5]
|
||
Description: Critics’ Week Awards and Highlights | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 11269 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:39 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 459, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:40 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=ivory-james, 224711 bytes
|
||
I 2022/06/09 11:04:40 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=ivory-james, STACKING TIME = 0, PARSING TIME = 24
|
||
I 2022/06/09 11:04:40 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/2a684a78d6b7c2e6aabb3341e08f0cf4/49RLuNCJcOFrAKVB1w00pKqmEQmXOs_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/5a82e5535265fb02f88a49f6b2fe730c/JbuzGZpZS8bG3GgdRDT5pEXIFKeAoa_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 481, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:40 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=ivory-james
|
||
I 2022/06/09 11:04:40 Fulltext indexing: e0JBOm_26NP5 https://www.criterion.com/shop/browse/list?director=ivory-james
|
||
I 2022/06/09 11:04:40 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[e0JBOm_26NP5 (1735154918013009920)]} 0 3
|
||
I 2022/06/09 11:04:40 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=ivory-james [e0JBOm_26NP5]
|
||
Description: James Ivory films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12521 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:40 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=manchevski-milcho, 224199 bytes
|
||
I 2022/06/09 11:04:40 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=manchevski-milcho, STACKING TIME = 1, PARSING TIME = 23
|
||
I 2022/06/09 11:04:40 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/f50d7a7d8ec2588c4bd6e4db00a8120f/jgAIXDTNR403btDmNLF1s6QJ7VILif_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 491, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
|
||
I 2022/06/09 11:04:40 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=manchevski-milcho
|
||
I 2022/06/09 11:04:40 Fulltext indexing: d2YYkm_26NP5 https://www.criterion.com/shop/browse/list?director=manchevski-milcho
|
||
I 2022/06/09 11:04:40 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[d2YYkm_26NP5 (1735154918247890944)]} 0 2
|
||
I 2022/06/09 11:04:40 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=manchevski-milcho [d2YYkm_26NP5]
|
||
Description: Milcho Manchevski films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12485 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:40 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=s-yeaworth-jr-irvin, 224219 bytes
|
||
I 2022/06/09 11:04:40 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=s-yeaworth-jr-irvin, STACKING TIME = 1, PARSING TIME = 22
|
||
I 2022/06/09 11:04:40 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/6790c92a200a349fa2918b59929a5b7c/SxJFUmRYI29BBqY4Im7uNTWIYo2pJi_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=s-yeaworth-jr-irvin
|
||
I 2022/06/09 11:04:40 Fulltext indexing: d18l5m_26NP5 https://www.criterion.com/shop/browse/list?director=s-yeaworth-jr-irvin
|
||
I 2022/06/09 11:04:40 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[d18l5m_26NP5 (1735154918436634624)]} 0 2
|
||
I 2022/06/09 11:04:40 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=s-yeaworth-jr-irvin [d18l5m_26NP5]
|
||
Description: Irvin S. Yeaworth Jr. films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12498 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:40 HTCACHE storing content of url https://www.criterion.com/current/posts/1944-janus-films-acquires-kaurism-ki-s-latest, 71114 bytes
|
||
I 2022/06/09 11:04:40 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/1944-janus-films-acquires-kaurism-ki-s-latest, STACKING TIME = 2, PARSING TIME = 6
|
||
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/3ed41f79deffcb3052099b02c9660e9b/zQOZgJoUsBEgi8arpM5w8aT22vGo6W_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/f8bf41c3e8d2266f423881ceb3159429/58bZDer5maXJjg6GDgD8Tyrr6ZZAuT_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/films/8f12ceb5a2e46f5f1550942e055ef1af/5yl46GfrudlcteVtODCZveKlbIlys1_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
|
||
I 2022/06/09 11:04:40 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:40 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
|
||
I 2022/06/09 11:04:40 org.apache.solr.update.SolrIndexWriter Calling setCommitData with IW:org.apache.solr.update.SolrIndexWriter@fed4ccdd commitCommandVersion:0
|
||
I 2022/06/09 11:04:40 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/current/posts/1944-janus-films-acquires-kaurism-ki-s-latest
|
||
I 2022/06/09 11:04:40 Fulltext indexing: dxNXWG_26NP5 https://www.criterion.com/current/posts/1944-janus-films-acquires-kaurism-ki-s-latest
|
||
I 2022/06/09 11:04:40 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[dxNXWG_26NP5 (1735154918501646336)]} 0 8
|
||
I 2022/06/09 11:04:40 SWITCHBOARD *Indexed 296 words in URL https://www.criterion.com/current/posts/1944-janus-films-acquires-kaurism-ki-s-latest [dxNXWG_26NP5]
|
||
Description: Janus Films Acquires Kaurismäki’s Latest | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3248 bytes |
|
||
LinkStorageTime: 9 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:40 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:04:40 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:41 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=fincher-david, 224767 bytes
|
||
I 2022/06/09 11:04:41 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=fincher-david, STACKING TIME = 0, PARSING TIME = 23
|
||
I 2022/06/09 11:04:41 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/films/693744d52bb74cb5166725421bb473e6/d121BfwKuez4Xs7tpGnThzfqDXpCgK_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/films/999405f5b0b718043c15d1183d04bede/6ZXWpPhvznaU1VM6grQfjigdBN06Pi_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=lee-bruce, 224739 bytes
|
||
I 2022/06/09 11:04:41 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=lee-bruce, STACKING TIME = 1, PARSING TIME = 39
|
||
I 2022/06/09 11:04:41 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1957-f90d4c48a2f932ffe7df386499f9477e/73k4EkSiXEfsdi097fieFBGdb39vlg_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/films/30b0ea18473faf0c3bf0486b76b0b761/sMo9K1Z55wY1dYmHiRgWpISyYJbV5S_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=fincher-david
|
||
I 2022/06/09 11:04:41 Fulltext indexing: dRZgkm_26NP5 https://www.criterion.com/shop/browse/list?director=fincher-david
|
||
I 2022/06/09 11:04:41 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
|
||
I 2022/06/09 11:04:41 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[dRZgkm_26NP5 (1735154919033274368)]} 0 15
|
||
I 2022/06/09 11:04:41 SWITCHBOARD *Indexed 1195 words in URL https://www.criterion.com/shop/browse/list?director=fincher-david [dRZgkm_26NP5]
|
||
Description: David Fincher films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12544 bytes |
|
||
LinkStorageTime: 18 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:41 HTCACHE storing content of url https://www.criterion.com/current/posts/561-eclipse-series-2-the-documentaries-of-louis-malle, 95335 bytes
|
||
I 2022/06/09 11:04:41 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/561-eclipse-series-2-the-documentaries-of-louis-malle, STACKING TIME = 6, PARSING TIME = 14
|
||
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=lee-bruce
|
||
I 2022/06/09 11:04:41 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[dCJyGm_26NP5 (1735154919186366464)]} 0 2
|
||
I 2022/06/09 11:04:41 Fulltext indexing: dCJyGm_26NP5 https://www.criterion.com/shop/browse/list?director=lee-bruce
|
||
I 2022/06/09 11:04:41 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse/list?director=lee-bruce [dCJyGm_26NP5]
|
||
Description: Bruce Lee films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12508 bytes |
|
||
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:41 HostQueue forcing crawl-delay of 230 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 489, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 20)) = 230
|
||
I 2022/06/09 11:04:41 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/561-eclipse-series-2-the-documentaries-of-louis-malle
|
||
I 2022/06/09 11:04:41 Fulltext indexing: cupuTG_26NP5 https://www.criterion.com/current/posts/561-eclipse-series-2-the-documentaries-of-louis-malle
|
||
I 2022/06/09 11:04:41 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[cupuTG_26NP5 (1735154919357284352)]} 0 9
|
||
I 2022/06/09 11:04:41 SWITCHBOARD *Indexed 1387 words in URL https://www.criterion.com/current/posts/561-eclipse-series-2-the-documentaries-of-louis-malle [cupuTG_26NP5]
|
||
Description: Eclipse Series 2:The Documentaries of Louis Malle | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 23010 bytes |
|
||
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:41 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 489, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:41 HTCACHE storing content of url https://www.criterion.com/current/posts/5670-ryusuke-hamaguchi-s-asako-i-ii, 72384 bytes
|
||
I 2022/06/09 11:04:41 SWITCHBOARD CRAWL: ADDED 73 LINKS FROM https://www.criterion.com/current/posts/5670-ryusuke-hamaguchi-s-asako-i-ii, STACKING TIME = 6, PARSING TIME = 11
|
||
I 2022/06/09 11:04:41 REJECTED http://variety.com/2018/film/asia/asako-i-ii-review-netetemo-sametemo-1202809972/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.hollywoodreporter.com/review/asako-i-ii-netemo-sametemo-film-review-cannes-2018-1111789 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.festival-cannes.com/en/festival/films/netemo-sametemo - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://criticsroundup.com/film/happy-hour/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.youtube.com/embed/6baCO63Y6ZM?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://thefilmstage.com/reviews/cannes-review-ryusuke-hamaguchis-happy-hour-follow-up-asako-i-ii-is-a-romance-lacking-in-passion/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.theguardian.com/film/2018/may/15/asako-i-ii-review-japanese-romcom-flips-gaze-ryusuke-hamaguchi - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED http://lwlies.com/festivals/asako-ii-first-look-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED http://www.indiewire.com/2018/05/asako-i-ii-review-ryusuke-hamaguchi-1201964358/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-8-transportive-doublings-and-divisive-titles - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://mubi.com/notebook/posts/hidden-in-reality-ryusuke-hamaguchi-and-asako-i-ii - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://criterion-production.s3.amazonaws.com/5PxkwlKNwsssK3VKZVW3lu0nTjQvOO.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.filmcomment.com/blog/film-comment-podcast-cannes-day-eight/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.rogerebert.com/cannes/cannes-2018-the-house-that-jack-built-at-war - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.screendaily.com/reviews/asako-i-and-ii-cannes-review/5129364.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.youtube.com/embed/7dZIy-0o9ZQ?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://icsfilm.org/reviews/cannes-2018-review-asako-i-ii-ryusuke-hamaguchi/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.filmcomment.com/blog/interview-ryusuke-hamaguchi/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://film.avclub.com/spike-lee-teams-up-with-jordan-peele-for-the-funny-poi-1826042384 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED http://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/asako-i-ii-mournful-hamaguchi-ryusuke-mournful-drama - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 REJECTED https://www.thewrap.com/asako-i-ii-film-review-leisurely-japanese-drama-explores-nature-of-love/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:41 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/5670-ryusuke-hamaguchi-s-asako-i-ii
|
||
I 2022/06/09 11:04:41 Fulltext indexing: bghe0G_26NP5 https://www.criterion.com/current/posts/5670-ryusuke-hamaguchi-s-asako-i-ii
|
||
I 2022/06/09 11:04:41 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[bghe0G_26NP5 (1735154919754694656)]} 0 2
|
||
I 2022/06/09 11:04:41 SWITCHBOARD *Indexed 427 words in URL https://www.criterion.com/current/posts/5670-ryusuke-hamaguchi-s-asako-i-ii [bghe0G_26NP5]
|
||
Description: Ryusuke Hamaguchi’s Asako I & II | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 4890 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:42 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=eisenstein-sergei, 226536 bytes
|
||
I 2022/06/09 11:04:42 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 492, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:42 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/aed45dbeaf63414624b16890eb458dea/fSaXGJq2BBkowhHXw2FJft0UoNIdnI_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1838-ee328a31205b114ef125fd81b54b5cd0/VZGhEsbGQY3luUNqMc64IKmXGoRe9U_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse/list?director=eisenstein-sergei, STACKING TIME = 3, PARSING TIME = 41
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/19c902a73e243b9293de4c717430f639/H1rWEdJtowN7Xh9vSnOvPU9I6y0AgT_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/b653a1545863d441cf9b8a8bc50946b8/SXcj1Zf8bWoyaoUEuzlFAc4gPNGwYm_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 HTCACHE storing content of url https://www.criterion.com/films/31784-once-upon-a-time-in-china-iv, 69293 bytes
|
||
I 2022/06/09 11:04:42 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/films/31784-once-upon-a-time-in-china-iv, STACKING TIME = 1, PARSING TIME = 17
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/KuWudzfbRgcPhJ0rR2bxTuLmg9FckZXz83m8qPOg.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1987-beb71f216e96d1ff2d0f8231f5b8b975/44LVkvftLRcr5paF4enJfBFTe5mI2c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/547f1ff5b611dda3dcfb1f50cb05e5ad/dzsA6F9rZ81DMybNgmhZiptrofQ0el_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/IzTEOB8bn3M7E71d8AYa2bsODWbSqLS0zK860sB3.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/lwUiAurIs3naTuH7TldsyolVQs4eCXVbdsavoGc8.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/e8d4cd2c5dd1b2541a3c8325a6d1805f/W5gKGPvtkm5evOJ9devGYL935KMAdy_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/3cC32fOwUa9qSR0GQo7IG9efAeSLIuyCdOEvd2z3.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/hrONj5485vAB6fFCtShQlMCxwLVF5P3pchVEPSxy.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/IG65jmiJyizSSz9EUnzO6d6CSWXUVnkZHW2Ddnk7.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/cd4ae6bdbaea9fd9c9aff1d69f924bc4/5wErYoFwVfkciAfnpRbFIhPqv7tIC5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/l47N2CPqlXDTfmssaxGdQ40vxlyb18Dqez62eSYb.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/AuV1S0FxEyXHpP7VrGPLiPwjE8L0uODZsucREqz8.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=eisenstein-sergei
|
||
I 2022/06/09 11:04:42 Fulltext indexing: bzm91m_26NP5 https://www.criterion.com/shop/browse/list?director=eisenstein-sergei
|
||
I 2022/06/09 11:04:42 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[bzm91m_26NP5 (1735154920032567296)]} 0 3
|
||
I 2022/06/09 11:04:42 SWITCHBOARD *Indexed 1217 words in URL https://www.criterion.com/shop/browse/list?director=eisenstein-sergei [bzm91m_26NP5]
|
||
Description: Sergei Eisenstein films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12728 bytes |
|
||
LinkStorageTime: 14 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:42 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/films/31784-once-upon-a-time-in-china-iv
|
||
I 2022/06/09 11:04:42 Fulltext indexing: bVK-Le_26NP5 https://www.criterion.com/films/31784-once-upon-a-time-in-china-iv
|
||
I 2022/06/09 11:04:42 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[bVK-Le_26NP5 (1735154920049344512)]} 0 2
|
||
I 2022/06/09 11:04:42 SWITCHBOARD *Indexed 302 words in URL https://www.criterion.com/films/31784-once-upon-a-time-in-china-iv [bVK-Le_26NP5]
|
||
Description: Once Upon a Time in China IV (1993) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 2922 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:42 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:42 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:42 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=yates-peter, 224188 bytes
|
||
I 2022/06/09 11:04:42 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:42 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
|
||
I 2022/06/09 11:04:42 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=nicholson-jack, 224757 bytes
|
||
I 2022/06/09 11:04:42 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://s3.amazonaws.com/criterion-production/films/4b7b707e5d1ca031e64894a0f7664f56/Ih7c7WkDI5YcytMafpe83E30nVygJD_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=yates-peter, STACKING TIME = 3, PARSING TIME = 130
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:42 HTCACHE storing content of url https://www.criterion.com/current/posts/485-la-jet-e-unchained-melody, 127274 bytes
|
||
I 2022/06/09 11:04:43 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:04:43 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@1dd3358e[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wk(7.7.3):C19:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772672584}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wl(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772677667}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wm(7.7.3):C12:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772680783}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wn(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772680861}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wo(7.7.3):C6:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772682978}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
|
||
I 2022/06/09 11:04:43 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
|
||
I 2022/06/09 11:04:43 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=nicholson-jack, STACKING TIME = 1, PARSING TIME = 129
|
||
I 2022/06/09 11:04:43 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 498, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/485-la-jet-e-unchained-melody, STACKING TIME = 9, PARSING TIME = 24
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1864-79581df36332f7a1f027b81311c9e0f9/j8fKXLhpRU7m0dGwebBdwZLgrXaO26_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/films/5dab1d4b720b8dba2951246a6e579875/SfyfBq0uCWQjg71QHpsDa6kWq59vXR_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=yates-peter
|
||
I 2022/06/09 11:04:43 Fulltext indexing: ag4sKm_26NP5 https://www.criterion.com/shop/browse/list?director=yates-peter
|
||
I 2022/06/09 11:04:43 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ag4sKm_26NP5 (1735154921021374464)]} 0 4
|
||
I 2022/06/09 11:04:43 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=yates-peter [ag4sKm_26NP5]
|
||
Description: Peter Yates films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12473 bytes |
|
||
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:43 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=nicholson-jack
|
||
I 2022/06/09 11:04:43 Fulltext indexing: aVkQcm_26NP5 https://www.criterion.com/shop/browse/list?director=nicholson-jack
|
||
I 2022/06/09 11:04:43 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[aVkQcm_26NP5 (1735154921095823360)]} 0 2
|
||
I 2022/06/09 11:04:43 SWITCHBOARD *Indexed 1202 words in URL https://www.criterion.com/shop/browse/list?director=nicholson-jack [aVkQcm_26NP5]
|
||
Description: Jack Nicholson films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12527 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:43 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=nyreroed-marie, 224129 bytes
|
||
I 2022/06/09 11:04:43 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 498, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
|
||
I 2022/06/09 11:04:43 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/485-la-jet-e-unchained-melody
|
||
I 2022/06/09 11:04:43 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=nyreroed-marie, STACKING TIME = 1, PARSING TIME = 106
|
||
I 2022/06/09 11:04:43 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://s3.amazonaws.com/criterion-production/films/4caa477448c9fe2ee28f80df08f4d89b/NmCRkRglJzsgL3AKwNshj7ENlgQZIN_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:43 Fulltext indexing: aBJHDG_26NP5 https://www.criterion.com/current/posts/485-la-jet-e-unchained-melody
|
||
I 2022/06/09 11:04:43 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[aBJHDG_26NP5 (1735154921353773056)]} 0 9
|
||
I 2022/06/09 11:04:43 SWITCHBOARD *Indexed 946 words in URL https://www.criterion.com/current/posts/485-la-jet-e-unchained-melody [aBJHDG_26NP5]
|
||
Description: La Jetée: Unchained Melody | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 14618 bytes |
|
||
LinkStorageTime: 15 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:43 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=nyreroed-marie
|
||
I 2022/06/09 11:04:43 Fulltext indexing: Z8DN1m_26NP5 https://www.criterion.com/shop/browse?director=nyreroed-marie
|
||
I 2022/06/09 11:04:43 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Z8DN1m_26NP5 (1735154921437659136)]} 0 4
|
||
I 2022/06/09 11:04:43 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse?director=nyreroed-marie [Z8DN1m_26NP5]
|
||
Description: Marie Nyreröd films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12398 bytes |
|
||
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:43 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 498, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:43 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 498, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:43 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=misumi-kenji, 230595 bytes
|
||
I 2022/06/09 11:04:43 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=freeland-thornton, 224738 bytes
|
||
I 2022/06/09 11:04:44 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=freeland-thornton, STACKING TIME = 2, PARSING TIME = 34
|
||
I 2022/06/09 11:04:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/d0606bfda6ec74c0019bd85bbe973ae0/6DSu10XLoj9GjtPJZGAxaXPs113oQD_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1873-f368b5675c9f0eb398727cb9a03b6e7b/vDeZBqbuBzw7Bpsb3KEK6hBnY6zACK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 HostQueue forcing crawl-delay of 250 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 513, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 6)) = 250
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/shop/browse/list?director=misumi-kenji, STACKING TIME = 2, PARSING TIME = 151
|
||
I 2022/06/09 11:04:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/bdccef4372055a6d7eba1d9e48d671e0/KbfBzW7rIf0GLzZ6mfOiZKBiAPrKFM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/06aae5d548f169e22473a1560b8af40b/C9uzPZ2an3M9AoDXjTI7aMH8KBOYDU_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/4bc990101d88c836ce459146bb0409c8/0RtHB019kbCekdVj2WNYnd3nxS9laR_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/8fa83fea9e1e866a11e31a00dbe58c97/L5ozj9f0PX4PQ864pbUt7PYTLfOySF_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/3c9f09ce0317fdbf2199438da624ef26/wQrhudyvLgRCg2bvWKGyePtmhI6yh6_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/235f87e883adc39c3c5c392aab084c4a/e4p57PTsISfbCCMcxSYyifYr5EEYhr_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/20189bedc7f5a0720311b6fb5413302f/tjxCpBXiJPhyzAIuy6ma0sBtZKpi9w_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/ca9546c2ceb1b830a1e5f190805fb6fd/21FyOwvaD7hVXctaQrX9RVy5muIn6D_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/24eeb2dee1e42ab06aaed3c486f00939/CnXoyCADKyul3CulQ9MSBa4MeB9mZX_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 HTCACHE storing content of url https://www.criterion.com/current/posts/432-a-tribute-a-canterbury-tale, 78066 bytes
|
||
I 2022/06/09 11:04:44 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/432-a-tribute-a-canterbury-tale, STACKING TIME = 1, PARSING TIME = 15
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=freeland-thornton
|
||
I 2022/06/09 11:04:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Zj20Cm_26NP5 (1735154922159079424)]} 0 2
|
||
I 2022/06/09 11:04:44 Fulltext indexing: Zj20Cm_26NP5 https://www.criterion.com/shop/browse/list?director=freeland-thornton
|
||
I 2022/06/09 11:04:44 SWITCHBOARD *Indexed 1195 words in URL https://www.criterion.com/shop/browse/list?director=freeland-thornton [Zj20Cm_26NP5]
|
||
Description: Thornton Freeland films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12506 bytes |
|
||
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:44 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=misumi-kenji
|
||
I 2022/06/09 11:04:44 Fulltext indexing: Z4Lgxm_26NP5 https://www.criterion.com/shop/browse/list?director=misumi-kenji
|
||
I 2022/06/09 11:04:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Z4Lgxm_26NP5 (1735154922245062656)]} 0 7
|
||
I 2022/06/09 11:04:44 SWITCHBOARD *Indexed 1246 words in URL https://www.criterion.com/shop/browse/list?director=misumi-kenji [Z4Lgxm_26NP5]
|
||
Description: Kenji Misumi films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 13097 bytes |
|
||
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:44 HTCACHE storing content of url https://www.criterion.com/films/28018-the-man-in-grey, 71256 bytes
|
||
I 2022/06/09 11:04:44 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/432-a-tribute-a-canterbury-tale
|
||
I 2022/06/09 11:04:44 HostQueue forcing crawl-delay of 251 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 502, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 0)) = 251
|
||
I 2022/06/09 11:04:44 Fulltext indexing: Zcv07G_26NP5 https://www.criterion.com/current/posts/432-a-tribute-a-canterbury-tale
|
||
I 2022/06/09 11:04:44 SWITCHBOARD CRAWL: ADDED 61 LINKS FROM https://www.criterion.com/films/28018-the-man-in-grey, STACKING TIME = 73, PARSING TIME = 13
|
||
I 2022/06/09 11:04:44 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=klos-elmar, 224205 bytes
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/ee3210abb4d599864dbe91e0ece052be.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/b13056ccb124d15b8d7516bac7576c8e/VduGrRfHEEl6sOgrkY5looQFWyjKvX_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Zcv07G_26NP5 (1735154922382426112)]} 0 19
|
||
I 2022/06/09 11:04:44 SWITCHBOARD *Indexed 594 words in URL https://www.criterion.com/current/posts/432-a-tribute-a-canterbury-tale [Zcv07G_26NP5]
|
||
Description: A Tribute: A Canterbury Tale | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 7802 bytes |
|
||
LinkStorageTime: 26 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/images/3913-13e0f359ffe35a4c4e0598e2e9db3246/madonnaof7moons_1432_003_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/d4d406ae48cd0fbfc78216e2efb5bf43/NEeX7phTkN3xjrd1deXb0RmokIXDG3_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/74be83402097d3f4c5d9e7331de31471/43YJwgdfANxgafSJNaTDUwGxR8Gait_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/4e7a58bca1bc1539f053cc08e0e1ca82.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/9dd9777e1aa8821e6a74e257ccfd7348.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://itunes.apple.com/us/movie/id811526103?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1812-2bdeb818c107e95738f59894990c22b2/oTM1jWGYaWx6KHPFFGsXiyVmbdpCPi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/the-man-in-grey?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.amazon.com/dp/B00JP33FH8 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/bf29c35e8cf957012e14fd777d65aa52.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/27426a8ea03aee693162a016f3af1fb9/p7llBxrsJUQX2Ov6G91uPDpYYs0fWN_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/films/28018-the-man-in-grey
|
||
I 2022/06/09 11:04:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ZPBnPe_26NP5 (1735154922482040832)]} 0 1
|
||
I 2022/06/09 11:04:44 Fulltext indexing: ZPBnPe_26NP5 https://www.criterion.com/films/28018-the-man-in-grey
|
||
I 2022/06/09 11:04:44 SWITCHBOARD *Indexed 289 words in URL https://www.criterion.com/films/28018-the-man-in-grey [ZPBnPe_26NP5]
|
||
Description: The Man in Grey (1943) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3045 bytes |
|
||
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/17a09b267d7df228c099117fcc503b0b/HEN7Igx0rZ7xS24SClFcPTBRs9HxSL_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=klos-elmar, STACKING TIME = 3, PARSING TIME = 43
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=klos-elmar
|
||
I 2022/06/09 11:04:44 Fulltext indexing: ZjADWm_26NP5 https://www.criterion.com/shop/browse/list?director=klos-elmar
|
||
I 2022/06/09 11:04:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ZjADWm_26NP5 (1735154922581655552)]} 0 2
|
||
I 2022/06/09 11:04:44 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=klos-elmar [ZjADWm_26NP5]
|
||
Description: Elmar Klos films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12502 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:44 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 513, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 10)) = 246
|
||
I 2022/06/09 11:04:44 HTCACHE storing content of url https://www.criterion.com/films/28723-lone-wolf-and-cub-baby-cart-at-the-river-styx, 77603 bytes
|
||
I 2022/06/09 11:04:44 SWITCHBOARD CRAWL: ADDED 68 LINKS FROM https://www.criterion.com/films/28723-lone-wolf-and-cub-baby-cart-at-the-river-styx, STACKING TIME = 2, PARSING TIME = 17
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/5639adc405d0157ff9fa02b1dedb6653.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/c438eda5cb94c4b6b8daa783749e4f2b.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/24eeb2dee1e42ab06aaed3c486f00939/CnXoyCADKyul3CulQ9MSBa4MeB9mZX_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/lone-wolf-and-cub-baby-cart-at-the-river-styx?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/images/7742-355bb7d45a9482fc6c2ac3db60ad9495/lone_wolf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.amazon.com/dp/B01M6EAKVM - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://itunes.apple.com/us/movie/id1169371082?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/c177fa8a7f4dfbc5dd12bf23b2332471.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/images/7830-f315cd91efe00a972e37be56081e1dbf/LWC_designs_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/films/58e97ae4dd0b361bd131525faf284b78/K10x3FugjRS01iYKEXsLTr0OlDQk4P_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/9c35734b41649852c32c085956e55337.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 REJECTED https://s3.amazonaws.com/criterion-production/images/7773-ee7702bf1312d6cd3550dbfc426265bc/Current_LW_C1-grab65_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:44 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/films/28723-lone-wolf-and-cub-baby-cart-at-the-river-styx
|
||
I 2022/06/09 11:04:44 Fulltext indexing: ZJnYce_26NP5 https://www.criterion.com/films/28723-lone-wolf-and-cub-baby-cart-at-the-river-styx
|
||
I 2022/06/09 11:04:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ZJnYce_26NP5 (1735154922894131200)]} 0 1
|
||
I 2022/06/09 11:04:44 SWITCHBOARD *Indexed 343 words in URL https://www.criterion.com/films/28723-lone-wolf-and-cub-baby-cart-at-the-river-styx [ZJnYce_26NP5]
|
||
Description: Lone Wolf and Cub: Baby Cart at the River Styx (1972) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3869 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:44 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 510, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 10)) = 245
|
||
I 2022/06/09 11:04:45 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 510, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 11)) = 244
|
||
I 2022/06/09 11:04:45 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=matarazzo-raffaello, 226510 bytes
|
||
I 2022/06/09 11:04:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/224ad3784ebfb34f98b1af628337f3da/gf5q2Dxvw2rDGLoNCNOnF3L53EUKqK_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/c3671b7d05dd992c80de898da6f724a8/iKAAnLTwUhBFo0X62zBb8ijm258Sey_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=gomez-muriel-emilio, 224779 bytes
|
||
I 2022/06/09 11:04:45 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse/list?director=matarazzo-raffaello, STACKING TIME = 5, PARSING TIME = 29
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/27426a8ea03aee693162a016f3af1fb9/p7llBxrsJUQX2Ov6G91uPDpYYs0fWN_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1803-6a8b76d7af61cbee31aced4a8191a85a/zKIrZpwHhlb5ETRsoB0UujdI9OwYFz_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/b59efc718fc3309a4eb76255310280b8/MpKeZ33lims6VNmUQPeUZ06u1HCgd5_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 513, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 13)) = 243
|
||
I 2022/06/09 11:04:45 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=gomez-muriel-emilio, STACKING TIME = 1, PARSING TIME = 59
|
||
I 2022/06/09 11:04:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1976-ece4132f4abef8c4e7beb0a0edffc9a8/y26UyQwNxt4FguJSgQIZWpCNlLsjHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/89594ab78e17a9778dc78f275076d760/kADie75znXN9EHJ7qBhLrNwch9t918_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=matarazzo-raffaello
|
||
I 2022/06/09 11:04:45 Fulltext indexing: ZJj0am_26NP5 https://www.criterion.com/shop/browse/list?director=matarazzo-raffaello
|
||
I 2022/06/09 11:04:45 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ZJj0am_26NP5 (1735154923591434240)]} 0 3
|
||
I 2022/06/09 11:04:45 SWITCHBOARD *Indexed 1210 words in URL https://www.criterion.com/shop/browse/list?director=matarazzo-raffaello [ZJj0am_26NP5]
|
||
Description: Raffaello Matarazzo films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12707 bytes |
|
||
LinkStorageTime: 90 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:45 HTCACHE storing content of url https://www.criterion.com/current/posts/663-trafic-watching-the-wheels, 80500 bytes
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/663-trafic-watching-the-wheels, STACKING TIME = 4, PARSING TIME = 18
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/images/4352-1cf1f70c7926a0f997ce964c45e36f81/img_current_545_007_medium.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 513, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 10)) = 246
|
||
I 2022/06/09 11:04:45 HTCACHE storing content of url https://www.criterion.com/current/posts/6274-robert-zemeckis-looks-back-on-his-debut-film-jitters, 64277 bytes
|
||
I 2022/06/09 11:04:45 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/6274-robert-zemeckis-looks-back-on-his-debut-film-jitters, STACKING TIME = 8, PARSING TIME = 11
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://player.vimeo.com/video/321824302 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/66ba5fe958ad045a4f5b5ebe54570c97/A3LCcKjo5itwNBka6dKwl6td7sSpB0_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=gomez-muriel-emilio
|
||
I 2022/06/09 11:04:45 Fulltext indexing: ZFuk9m_26NP5 https://www.criterion.com/shop/browse/list?director=gomez-muriel-emilio
|
||
I 2022/06/09 11:04:45 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ZFuk9m_26NP5 (1735154923820023808)]} 0 2
|
||
I 2022/06/09 11:04:45 SWITCHBOARD *Indexed 1200 words in URL https://www.criterion.com/shop/browse/list?director=gomez-muriel-emilio [ZFuk9m_26NP5]
|
||
Description: Emilio Gómez Muriel films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12526 bytes |
|
||
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:45 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/663-trafic-watching-the-wheels
|
||
I 2022/06/09 11:04:45 Fulltext indexing: Y_9D5G_26NP5 https://www.criterion.com/current/posts/663-trafic-watching-the-wheels
|
||
I 2022/06/09 11:04:45 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Y_9D5G_26NP5 (1735154923917541376)]} 0 7
|
||
I 2022/06/09 11:04:45 SWITCHBOARD *Indexed 1063 words in URL https://www.criterion.com/current/posts/663-trafic-watching-the-wheels [Y_9D5G_26NP5]
|
||
Description: Trafic: Watching the Wheels | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 14877 bytes |
|
||
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:45 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/6274-robert-zemeckis-looks-back-on-his-debut-film-jitters
|
||
I 2022/06/09 11:04:45 Fulltext indexing: YrB28G_26NP5 https://www.criterion.com/current/posts/6274-robert-zemeckis-looks-back-on-his-debut-film-jitters
|
||
I 2022/06/09 11:04:45 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[YrB28G_26NP5 (1735154923940610048)]} 0 1
|
||
I 2022/06/09 11:04:45 SWITCHBOARD *Indexed 317 words in URL https://www.criterion.com/current/posts/6274-robert-zemeckis-looks-back-on-his-debut-film-jitters [YrB28G_26NP5]
|
||
Description: Robert Zemeckis Looks Back on His Debut-Film Jitters | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3721 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:45 HTCACHE storing content of url https://www.criterion.com/current/posts/1244-ingmar-bergman-s-belongings-sold-at-auction, 77676 bytes
|
||
I 2022/06/09 11:04:45 SWITCHBOARD CRAWL: ADDED 77 LINKS FROM https://www.criterion.com/current/posts/1244-ingmar-bergman-s-belongings-sold-at-auction, STACKING TIME = 3, PARSING TIME = 7
|
||
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/38-modell-av-dramatiska-teatern-skala-1-50?locale=en-US - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/6fede1f031c07b843ffa8965d47043f3/9QWkE37UXlpfhZrTIsaZHdWmooGJ1a_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/262-glasskivor-tio-stycken-till-laterna-magica - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_filmprojector.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/74A-papperskorg?locale=en-US - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/311-filmpris-golden-globe-award-for-hostsonaten-1978 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/254-fotografi-john-bryson-fotografi - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_JAWS.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://twitter.com/bukowskis/status/4454678676 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/270-schackpjaser-31-stycken - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/261-laterna-magica-lapierre-paris-ca-1870%20 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman-auction-wastebasket.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_header.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/4caa477448c9fe2ee28f80df08f4d89b/NmCRkRglJzsgL3AKwNshj7ENlgQZIN_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/33-sprattelgubbe?locale=en-US - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_chess_set.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/films/95d0fe890da5c6008298dc39ca2195b4/oTvnw5EnwHQLpwttOLxX4yAOWY7o0j_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_5.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/263-filmprojektor-1920-tal-e-marland-ab-stockholm - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://www.bukowskis.se/auctions/H022/261-laterna-magica-lapierre-paris-ca-1870?locale=en-US - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED http://news.bbc.co.uk/2/hi/europe/8280740.stm - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_theatre_two-up.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:45 REJECTED https://s3.amazonaws.com/criterion_images/current/Current_bergman_auction_jumping_jack.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 501, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:46 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/current/posts/1244-ingmar-bergman-s-belongings-sold-at-auction
|
||
I 2022/06/09 11:04:46 Fulltext indexing: YrBCbG_26NP5 https://www.criterion.com/current/posts/1244-ingmar-bergman-s-belongings-sold-at-auction
|
||
I 2022/06/09 11:04:46 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[YrBCbG_26NP5 (1735154924016107520)]} 0 2
|
||
I 2022/06/09 11:04:46 SWITCHBOARD *Indexed 336 words in URL https://www.criterion.com/current/posts/1244-ingmar-bergman-s-belongings-sold-at-auction [YrBCbG_26NP5]
|
||
Description: Ingmar Bergman’s Belongings Sold at Auction | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3975 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:46 HTCACHE storing content of url https://www.criterion.com/current/posts/5658-jia-zhangke-s-ash-is-purest-white, 73772 bytes
|
||
I 2022/06/09 11:04:46 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 496, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:46 SWITCHBOARD CRAWL: ADDED 80 LINKS FROM https://www.criterion.com/current/posts/5658-jia-zhangke-s-ash-is-purest-white, STACKING TIME = 5, PARSING TIME = 8
|
||
I 2022/06/09 11:04:46 REJECTED https://www.screendaily.com/reviews/ash-is-purest-white-cannes-review/5129220.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.thewrap.com/ash-purest-white-film-review-characters-growing-pains-china/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://film.avclub.com/jean-luc-godard-returns-to-cannes-to-make-a-dunce-out-o-1825979305 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED http://www.latimes.com/entertainment/movies/la-et-mn-cannes-diary-ash-is-purest-white-cold-war-20180512-htmlstory.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-5-changless-change-jean-luc-godard-and-jia-zhangke - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.filmcomment.com/blog/film-comment-podcast-cannes-day-four/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED http://variety.com/2018/film/asia/jia-zhangke-making-his-most-expensive-indie-film-ash-is-purest-white-1202805661/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://thefilmstage.com/reviews/cannes-review-with-ash-is-purest-white-jia-zhangke-stages-another-exceptional-platform-for-zhao-tao/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED http://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/ash-purest-white-jia-zhangke-zhao-tao-magisterial-mob-critique - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.theguardian.com/film/2018/may/11/ash-is-purest-white-review-chinese-gangsters-girlfriend-saga-burns-bright - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://deadline.com/2018/05/cannes-buzz-films-girls-of-the-sun-and-ash-is-purest-white-set-for-us-distribution-by-cohen-media-group-1202394690/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://criterion-production.s3.amazonaws.com/WK9FBxqSEA7Dlce2yWlaEbLstGf5jx.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.hollywoodreporter.com/review/ash-is-purest-white-cannes-2018-1111288 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED http://lwlies.com/festivals/ash-is-purest-white-cannes-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED http://www.indiewire.com/2018/05/ash-is-purest-white-review-jia-zhangke-cannes-2018-1201963491/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://cine-vue.com/2018/05/cannes-2018-ash-is-purest-white-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED http://www.vulture.com/2018/05/ash-is-purest-white-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://icsfilm.org/reviews/cannes-2018-review-ash-purest-white-jia-zhangke/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.filmcomment.com/blog/film-week-ash-purest-white/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.ioncinema.com/reviews/jia-zhangke-ash-is-purest-white-review - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.youtube.com/embed/Xr7B-GhQaTM?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED http://desistfilm.com/cannes-2018-ash-is-purest-white-by-jia-zhang-ke/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED http://variety.com/2018/film/reviews/ash-is-purest-white-review-1202802929/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.festival-cannes.com/en/festival/films/jiang-hu-er-nv - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.vanityfair.com/hollywood/2018/05/the-angel-ash-is-purest-white-cannes-movie-reviews - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.rogerebert.com/cannes/cannes-2018-ash-is-the-purest-white-girls-of-the-sun-girl - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://theplaylist.net/ash-purest-white-cannes-review-20180515/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED http://cineuropa.org/nw.aspx?t=newsdetail&l=en&did=354268 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 SWITCHBOARD Excluded 23 words in URL https://www.criterion.com/current/posts/5658-jia-zhangke-s-ash-is-purest-white
|
||
I 2022/06/09 11:04:46 Fulltext indexing: YqqKcG_26NP5 https://www.criterion.com/current/posts/5658-jia-zhangke-s-ash-is-purest-white
|
||
I 2022/06/09 11:04:46 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[YqqKcG_26NP5 (1735154924372623360)]} 0 2
|
||
I 2022/06/09 11:04:46 SWITCHBOARD *Indexed 482 words in URL https://www.criterion.com/current/posts/5658-jia-zhangke-s-ash-is-purest-white [YqqKcG_26NP5]
|
||
Description: Jia Zhangke’s Ash Is Purest White | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 5647 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:46 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 496, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:46 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=von-trotta-margarethe-1, 224282 bytes
|
||
I 2022/06/09 11:04:46 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:46 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=von-trotta-margarethe-1, STACKING TIME = 7, PARSING TIME = 29
|
||
I 2022/06/09 11:04:46 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 REJECTED https://s3.amazonaws.com/criterion-production/films/bed1dc8df02842d6a75325665e718ebd/da8xTBLVhcfx0KQXSyOOMImKRe6s2r_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:46 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=von-trotta-margarethe-1
|
||
I 2022/06/09 11:04:46 Fulltext indexing: YZXxhm_26NP5 https://www.criterion.com/shop/browse/list?director=von-trotta-margarethe-1
|
||
I 2022/06/09 11:04:46 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[YZXxhm_26NP5 (1735154924884328448)]} 0 3
|
||
I 2022/06/09 11:04:46 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=von-trotta-margarethe-1 [YZXxhm_26NP5]
|
||
Description: Margarethe von Trotta films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12501 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:47 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:47 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=kassovitz-mathieu, 224188 bytes
|
||
I 2022/06/09 11:04:47 HTCACHE storing content of url https://www.criterion.com/current/posts/1231-mayerling-star-crossed, 84641 bytes
|
||
I 2022/06/09 11:04:47 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=kassovitz-mathieu, STACKING TIME = 1, PARSING TIME = 25
|
||
I 2022/06/09 11:04:47 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/726755430bd298a5aa424f68a792bcea/aQ0KQhoip19olkpwhmNbrAfMY6qhAB_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/1231-mayerling-star-crossed, STACKING TIME = 2, PARSING TIME = 81
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/babb52f89b5488d00cb76a924d7e06eb/XPxkbzNVy36iDfcGUxaqpxFC6LJ0tI_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/images/4467-3a447d1c16dc2d86db906fc2a056e122/current_553_014_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=kassovitz-mathieu
|
||
I 2022/06/09 11:04:47 Fulltext indexing: X87vrm_26NP5 https://www.criterion.com/shop/browse/list?director=kassovitz-mathieu
|
||
I 2022/06/09 11:04:47 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[X87vrm_26NP5 (1735154925276495872)]} 0 6
|
||
I 2022/06/09 11:04:47 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=kassovitz-mathieu [X87vrm_26NP5]
|
||
Description: Mathieu Kassovitz films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12488 bytes |
|
||
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:47 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 491, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:47 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/1231-mayerling-star-crossed
|
||
I 2022/06/09 11:04:47 Fulltext indexing: X10F5G_26NP5 https://www.criterion.com/current/posts/1231-mayerling-star-crossed
|
||
I 2022/06/09 11:04:47 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[X10F5G_26NP5 (1735154925323681792)]} 0 2
|
||
I 2022/06/09 11:04:47 SWITCHBOARD *Indexed 527 words in URL https://www.criterion.com/current/posts/1231-mayerling-star-crossed [X10F5G_26NP5]
|
||
Description: Mayerling: Star-Crossed | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 6195 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:47 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=tennyson-pen, 224751 bytes
|
||
I 2022/06/09 11:04:47 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=tennyson-pen, STACKING TIME = 2, PARSING TIME = 23
|
||
I 2022/06/09 11:04:47 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/5b83c252b600e87cb7d663f2b1d1ac8d/F8v9OxoNple8ycZ2VntXSIFAXCg8pJ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1873-f368b5675c9f0eb398727cb9a03b6e7b/vDeZBqbuBzw7Bpsb3KEK6hBnY6zACK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 490, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
|
||
I 2022/06/09 11:04:47 HTCACHE storing content of url https://www.criterion.com/films/354, 79512 bytes
|
||
I 2022/06/09 11:04:47 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=tennyson-pen
|
||
I 2022/06/09 11:04:47 SWITCHBOARD CRAWL: ADDED 72 LINKS FROM https://www.criterion.com/films/354, STACKING TIME = 3, PARSING TIME = 19
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/b653a1545863d441cf9b8a8bc50946b8/SXcj1Zf8bWoyaoUEuzlFAc4gPNGwYm_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/ivan-the-terrible-part-ii?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1838-ee328a31205b114ef125fd81b54b5cd0/VZGhEsbGQY3luUNqMc64IKmXGoRe9U_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/88b2556fa888690aab792e7454a6fe26.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 Fulltext indexing: XyrSom_26NP5 https://www.criterion.com/shop/browse/list?director=tennyson-pen
|
||
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/82523d90468b398c9e487fdf969d36d6.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/53a8103ba77901a31cb565c8bf2c7338.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[XyrSom_26NP5 (1735154925698023424)]} 0 4
|
||
I 2022/06/09 11:04:47 SWITCHBOARD *Indexed 1199 words in URL https://www.criterion.com/shop/browse/list?director=tennyson-pen [XyrSom_26NP5]
|
||
Description: Pen Tennyson films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12523 bytes |
|
||
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/6918d33a17ccf9cdc1667c362121d593.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/0870d8add1b5719448d1f445679e503a.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/1068-194f3f49c166adeecb9d66968442e517/jGwTz0i0KByW1oG0AH1KWNjVFgbkXE_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/posts/1131-e236cb8c87ec5b809c2301982224d1ec/IVAN_rosenbaum_still_1_original.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/55b413192f2d6d855cffb078b25156f4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/b1039bd526df1e78680c71b08daa9071.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/e27c982cba14c95724fdb5c647e63ded.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/76cd8b50aedb0c256bf117124487f494.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/19c902a73e243b9293de4c717430f639/H1rWEdJtowN7Xh9vSnOvPU9I6y0AgT_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/aed45dbeaf63414624b16890eb458dea/fSaXGJq2BBkowhHXw2FJft0UoNIdnI_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/b49b46aa1dacc0e1530a24cf23c764b4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/films/32d0f143682b34732d0fdd3ed8e0e7bb/fBklqTm33Jg32mLL79K5I4cZHSt9cE_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/c651b027da04c8a0a1553975e3098af2.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5626-/ai8UvErjxFGByr17RCrqAnwWw7xvRj_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/76b270cd5220e804477fd41e9c907d3b.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:47 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/films/354
|
||
I 2022/06/09 11:04:47 Fulltext indexing: XC-kYe_26NP5 https://www.criterion.com/films/354
|
||
I 2022/06/09 11:04:47 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[XC-kYe_26NP5 (1735154925746257920)]} 0 3
|
||
I 2022/06/09 11:04:47 SWITCHBOARD *Indexed 371 words in URL https://www.criterion.com/films/354 [XC-kYe_26NP5]
|
||
Description: Ivan the Terrible, Part II (1958) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 4329 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:47 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:48 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
|
||
I 2022/06/09 11:04:48 HTCACHE storing content of url https://www.criterion.com/current/posts/5757-the-hope-that-fueled-bowling-for-columbine, 64340 bytes
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/5757-the-hope-that-fueled-bowling-for-columbine, STACKING TIME = 8, PARSING TIME = 20
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://player.vimeo.com/video/271504473 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/films/78a86bc12fbed832f0b341609a22fa52/lun1ptGstEhxOhmc24pORDXEVkN2Ve_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/5757-the-hope-that-fueled-bowling-for-columbine
|
||
I 2022/06/09 11:04:48 Fulltext indexing: WHtrsG_26NP5 https://www.criterion.com/current/posts/5757-the-hope-that-fueled-bowling-for-columbine
|
||
I 2022/06/09 11:04:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[WHtrsG_26NP5 (1735154926169882624)]} 0 2
|
||
I 2022/06/09 11:04:48 SWITCHBOARD *Indexed 348 words in URL https://www.criterion.com/current/posts/5757-the-hope-that-fueled-bowling-for-columbine [WHtrsG_26NP5]
|
||
Description: The Hope That Fueled Bowling for Columbine | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3913 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:48 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
|
||
I 2022/06/09 11:04:48 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=brook-peter, 224668 bytes
|
||
I 2022/06/09 11:04:48 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 9)) = 241
|
||
I 2022/06/09 11:04:48 HTCACHE storing content of url https://www.criterion.com/current/posts/6095-a-dry-white-season-justice-against-the-law, 85182 bytes
|
||
I 2022/06/09 11:04:48 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=brook-peter, STACKING TIME = 3, PARSING TIME = 94
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1865-2b74f037f454df1d78013f06dc4aaea4/0TUeLtsha8fzPrVMeQ8rNOnpUVmvME_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/films/3c8e34ba4a541897737232a90611f947/uMJyxowOApuQ9O1hh2wLe26pqL14z1_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 SWITCHBOARD CRAWL: ADDED 59 LINKS FROM https://www.criterion.com/current/posts/6095-a-dry-white-season-justice-against-the-law, STACKING TIME = 5, PARSING TIME = 20
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://criterion-production.s3.amazonaws.com/wkjM3lyFlBq2Xwm4CbquxntstEQKIL.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://criterion-production.s3.amazonaws.com/VpkypwbMpTd43ooIA0Vvm3oz1Tum8Y.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://criterion-production.s3.amazonaws.com/A5OPNRaWvdavhX6UYpXnvGUGMTzAeL.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/films/d3fa0bd2e5949b9e3c861222fb594d95/ASnRN4Kj6AdEv8RJTDrHKhIve2ZFQY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6095-/4Fs4E8gQdXRg8bDqotCILcvvsJDWRu_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@417da17a[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wk(7.7.3):C19:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772672584}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wl(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772677667}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wm(7.7.3):C12:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772680783}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wn(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772680861}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wo(7.7.3):C6:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772682978}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wp(7.7.3):C22:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772688229}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
|
||
I 2022/06/09 11:04:48 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=brook-peter
|
||
I 2022/06/09 11:04:48 Fulltext indexing: W0frrm_26NP5 https://www.criterion.com/shop/browse/list?director=brook-peter
|
||
I 2022/06/09 11:04:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[W0frrm_26NP5 (1735154926541078528)]} 0 7
|
||
I 2022/06/09 11:04:48 SWITCHBOARD *Indexed 1200 words in URL https://www.criterion.com/shop/browse/list?director=brook-peter [W0frrm_26NP5]
|
||
Description: Peter Brook films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12492 bytes |
|
||
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:48 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/6095-a-dry-white-season-justice-against-the-law
|
||
I 2022/06/09 11:04:48 Fulltext indexing: WDcdcG_26NP5 https://www.criterion.com/current/posts/6095-a-dry-white-season-justice-against-the-law
|
||
I 2022/06/09 11:04:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[WDcdcG_26NP5 (1735154926636498944)]} 0 4
|
||
I 2022/06/09 11:04:48 SWITCHBOARD *Indexed 1045 words in URL https://www.criterion.com/current/posts/6095-a-dry-white-season-justice-against-the-law [WDcdcG_26NP5]
|
||
Description: A Dry White Season: Justice Against the Law | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 17143 bytes |
|
||
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:48 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 483, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:48 HTCACHE storing content of url https://www.criterion.com/current/posts/6106-euzhan-palcy-remembers-brando-s-nerves-on-the-set-of-a-dry-white-season, 68433 bytes
|
||
I 2022/06/09 11:04:48 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/6106-euzhan-palcy-remembers-brando-s-nerves-on-the-set-of-a-dry-white-season, STACKING TIME = 1, PARSING TIME = 6
|
||
I 2022/06/09 11:04:48 REJECTED https://player.vimeo.com/video/304438092 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/films/28473-werner-herzog-eats-his-shoe
|
||
I 2022/06/09 11:04:48 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/shop/browse
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterion.com/films/28473-werner-herzog-eats-his-shoe - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/films/28473-werner-herzog-eats-his-shoe aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:35:14Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
|
||
I 2022/06/09 11:04:48 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/films/d3fa0bd2e5949b9e3c861222fb594d95/ASnRN4Kj6AdEv8RJTDrHKhIve2ZFQY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 SWITCHBOARD Excluded 20 words in URL https://www.criterion.com/current/posts/6106-euzhan-palcy-remembers-brando-s-nerves-on-the-set-of-a-dry-white-season
|
||
I 2022/06/09 11:04:48 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[V9xhxe_26NP5 (1735154926815805440)]} 0 11
|
||
I 2022/06/09 11:04:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[WBRjMG_26NP5 (1735154926836776960)]} 0 1
|
||
I 2022/06/09 11:04:48 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:48 Fulltext indexing: WBRjMG_26NP5 https://www.criterion.com/current/posts/6106-euzhan-palcy-remembers-brando-s-nerves-on-the-set-of-a-dry-white-season
|
||
I 2022/06/09 11:04:48 SWITCHBOARD *Indexed 312 words in URL https://www.criterion.com/current/posts/6106-euzhan-palcy-remembers-brando-s-nerves-on-the-set-of-a-dry-white-season [WBRjMG_26NP5]
|
||
Description: Euzhan Palcy Remembers Brando’s Nerves on the Set of A Dry White Season | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3724 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:48 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 480, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:49 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 480, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:49 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 480, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:49 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=lo, 225147 bytes
|
||
I 2022/06/09 11:04:49 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://s3.amazonaws.com/criterion-production/films/028d5306fb6147b73970b738eb19a93a/HQcvC6MhrRZyx3VN4DHa77hZjWiOjk_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse/list?director=lo, STACKING TIME = 3, PARSING TIME = 33
|
||
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://s3.amazonaws.com/criterion-production/films/e8d4cd2c5dd1b2541a3c8325a6d1805f/W5gKGPvtkm5evOJ9devGYL935KMAdy_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1957-f90d4c48a2f932ffe7df386499f9477e/73k4EkSiXEfsdi097fieFBGdb39vlg_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 HostQueue forcing crawl-delay of 233 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 483, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 17)) = 233
|
||
I 2022/06/09 11:04:49 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=lo
|
||
I 2022/06/09 11:04:49 Fulltext indexing: V8su2m_26NP5 https://www.criterion.com/shop/browse/list?director=lo
|
||
I 2022/06/09 11:04:49 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[V8su2m_26NP5 (1735154927844458496)]} 0 3
|
||
I 2022/06/09 11:04:49 SWITCHBOARD *Indexed 1202 words in URL https://www.criterion.com/shop/browse/list?director=lo [V8su2m_26NP5]
|
||
Description: Lo Wei films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12502 bytes |
|
||
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:49 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=zinnemann-fred, 224766 bytes
|
||
I 2022/06/09 11:04:49 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 486, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
|
||
I 2022/06/09 11:04:49 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=zinnemann-fred, STACKING TIME = 4, PARSING TIME = 69
|
||
I 2022/06/09 11:04:49 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1976-ece4132f4abef8c4e7beb0a0edffc9a8/y26UyQwNxt4FguJSgQIZWpCNlLsjHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:49 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/89594ab78e17a9778dc78f275076d760/kADie75znXN9EHJ7qBhLrNwch9t918_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=chukhrai-grigori, 224800 bytes
|
||
I 2022/06/09 11:04:50 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 490, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
|
||
I 2022/06/09 11:04:50 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=chukhrai-grigori, STACKING TIME = 1, PARSING TIME = 45
|
||
I 2022/06/09 11:04:50 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/f89499b3cbca9503a0cf83ecba01142f/LlDUqJCmbsiL8xBf409dwhXIFKOKBt_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=zinnemann-fred
|
||
I 2022/06/09 11:04:50 Fulltext indexing: Vc_HQm_26NP5 https://www.criterion.com/shop/browse/list?director=zinnemann-fred
|
||
I 2022/06/09 11:04:50 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Vc_HQm_26NP5 (1735154928327852032)]} 0 2
|
||
I 2022/06/09 11:04:50 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse/list?director=zinnemann-fred [Vc_HQm_26NP5]
|
||
Description: Fred Zinnemann films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12520 bytes |
|
||
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:50 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=chukhrai-grigori
|
||
I 2022/06/09 11:04:50 Fulltext indexing: Uu2nwm_26NP5 https://www.criterion.com/shop/browse/list?director=chukhrai-grigori
|
||
I 2022/06/09 11:04:50 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Uu2nwm_26NP5 (1735154928394960896)]} 0 2
|
||
I 2022/06/09 11:04:50 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse/list?director=chukhrai-grigori [Uu2nwm_26NP5]
|
||
Description: Grigori Chukhrai films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12532 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:50 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 490, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:50 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=ophuls-max, 225801 bytes
|
||
I 2022/06/09 11:04:50 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=ophuls-max, STACKING TIME = 2, PARSING TIME = 81
|
||
I 2022/06/09 11:04:50 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/e5c7435a9dbc4d547966c42139b17e05/CIDQTwh6cAejzHXtCFcPu4on6pTik2_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/2d9f65a009ae0df30f7268f5cad30602/hCVpEfIN7DST5IptZEPGxHXn1hTR9M_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/a536a3fb3af463edd2e64152b9661f5c/vwUO7GHWr8ltPBhvvDbO51GIRnqIDL_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/1b0975e10aabfa4e04861a3c490964d7/Lu59y3u3gBwDYBDmvQripKKH1K9bTA_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 496, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:50 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=ophuls-max
|
||
I 2022/06/09 11:04:50 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[UuT3nm_26NP5 (1735154928798662656)]} 0 2
|
||
I 2022/06/09 11:04:50 Fulltext indexing: UuT3nm_26NP5 https://www.criterion.com/shop/browse/list?director=ophuls-max
|
||
I 2022/06/09 11:04:50 SWITCHBOARD *Indexed 1206 words in URL https://www.criterion.com/shop/browse/list?director=ophuls-max [UuT3nm_26NP5]
|
||
Description: Max Ophuls films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12625 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:50 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=weill-claudia, 224184 bytes
|
||
I 2022/06/09 11:04:50 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=weill-claudia, STACKING TIME = 1, PARSING TIME = 20
|
||
I 2022/06/09 11:04:50 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/bf7dd923004d069192e6cf732dd51e1f/x7ZTLqEHWNMZWFdmuX89ePaJ9aZNfN_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 HTCACHE storing content of url https://www.criterion.com/current/posts/2596-following-nolan-begins, 77618 bytes
|
||
I 2022/06/09 11:04:50 HostQueue forcing crawl-delay of 247 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 503, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 4)) = 247
|
||
I 2022/06/09 11:04:50 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/2596-following-nolan-begins, STACKING TIME = 2, PARSING TIME = 221
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/images/5052-29798d52bd807cc071b0cb5bf35c99be/Following_Essay_Current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 REJECTED https://s3.amazonaws.com/criterion-production/films/872420cba7088d5f5f64157663c6c2c5/PWMlxSDrb4crZTImCxrDbFsxYZMb4k_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:50 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=weill-claudia
|
||
I 2022/06/09 11:04:50 Fulltext indexing: Unf76m_26NP5 https://www.criterion.com/shop/browse/list?director=weill-claudia
|
||
I 2022/06/09 11:04:50 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Unf76m_26NP5 (1735154929203412992)]} 0 2
|
||
I 2022/06/09 11:04:50 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=weill-claudia [Unf76m_26NP5]
|
||
Description: Claudia Weill films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12491 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:51 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/2596-following-nolan-begins
|
||
I 2022/06/09 11:04:51 Fulltext indexing: Uc4rLG_26NP5 https://www.criterion.com/current/posts/2596-following-nolan-begins
|
||
I 2022/06/09 11:04:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Uc4rLG_26NP5 (1735154929276813312)]} 0 3
|
||
I 2022/06/09 11:04:51 SWITCHBOARD *Indexed 942 words in URL https://www.criterion.com/current/posts/2596-following-nolan-begins [Uc4rLG_26NP5]
|
||
Description: Following: Nolan Begins | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12627 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:51 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=pearce-leslie, 224219 bytes
|
||
I 2022/06/09 11:04:51 HTCACHE storing content of url https://www.criterion.com/current/posts/1049-sweet-death-veronika-voss-production-history, 160126 bytes
|
||
I 2022/06/09 11:04:51 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=pearce-leslie, STACKING TIME = 1, PARSING TIME = 33
|
||
I 2022/06/09 11:04:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/films/3a4a52811b630a9836c1b10cb2c55a38/1DZVBE8PnMfkggyvh5s9f7K2TSAiF0_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 507, robots.delay = 0, ((waitig = 253) - (timeSinceLastAccess = 10)) = 243
|
||
I 2022/06/09 11:04:51 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/1049-sweet-death-veronika-voss-production-history, STACKING TIME = 6, PARSING TIME = 105
|
||
I 2022/06/09 11:04:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 HTCACHE storing content of url https://www.criterion.com/boxsets/1110-andre-gregory-wallace-shawn-3-films, 74997 bytes
|
||
I 2022/06/09 11:04:51 SWITCHBOARD CRAWL: ADDED 50 LINKS FROM https://www.criterion.com/boxsets/1110-andre-gregory-wallace-shawn-3-films, STACKING TIME = 2, PARSING TIME = 13
|
||
I 2022/06/09 11:04:51 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/3e0bdd8b1538100c61a1c69840258f0d.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1880-d84b68ccfdfdb7ef19d89946ab43b5cb/XxW9C5BcXk4DjYmbLD9UBavXz6xwa0_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/films/55540be3417fc8d3aca18151f48009d7/1wrYMIHOtZzcuJuYju1L7NYD14iLdR_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/2e140cbe7177d67fd312f8acdea2a4d4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/films/4d12d0b6523c10060860b2695f32672f/KmC0p8LFp4dAfgyLalLkQRoZgakcY3_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/films/b5de3107dbc1cac694c7581d787b3cd8/0EwZ3HapM3kv8NAIbGAzViVdQBbJHS_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=pearce-leslie
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/5d62080f96d3e0ca24d41d44c1375a37.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[UPFfrm_26NP5 (1735154929557831680)]} 0 2
|
||
I 2022/06/09 11:04:51 Fulltext indexing: UPFfrm_26NP5 https://www.criterion.com/shop/browse/list?director=pearce-leslie
|
||
I 2022/06/09 11:04:51 SWITCHBOARD *Indexed 1193 words in URL https://www.criterion.com/shop/browse/list?director=pearce-leslie [UPFfrm_26NP5]
|
||
Description: Leslie Pearce films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12492 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:51 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 506, robots.delay = 0, ((waitig = 253) - (timeSinceLastAccess = 11)) = 242
|
||
I 2022/06/09 11:04:51 HTCACHE storing content of url https://www.criterion.com/current/posts/6386-quentin-tarantino-s-once-upon-a-time-in-hollywood, 75563 bytes
|
||
I 2022/06/09 11:04:51 SWITCHBOARD CRAWL: ADDED 69 LINKS FROM https://www.criterion.com/current/posts/6386-quentin-tarantino-s-once-upon-a-time-in-hollywood, STACKING TIME = 6, PARSING TIME = 9
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.nytimes.com/2019/05/21/movies/quentin-tarantino-once-upon-a-time-in-hollywood.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/once-upon-time-hollywood-quentin-tarantino-1960s-golden-age-meta-movie-charles-manson-sharon-tate-murders - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://criterion-production.s3.amazonaws.com/vDHqMO5u7IuFHEwOt7sBKVKcZtGu5g.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-8-tarantino-s-hollywood-elegy-and-bdsm-mourning - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.festival-cannes.com/en/festival/films/once-upon-a-time-in-hollywood - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED http://www.impawards.com/2019/once_upon_a_time_in_hollywood_ver5.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.telegraph.co.uk/films/0/upon-time-hollywood-review-tarantinos-ode-pre-manson-la-pure/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.rogerebert.com/cannes/cannes-2019-once-upon-a-time-in-hollywood - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED http://www.impawards.com/2019/once_upon_a_time_in_hollywood_ver6.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://time.com/5593402/once-upon-a-time-in-hollywood-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.esquire.com/entertainment/movies/a27458589/once-upon-a-time-in-hollywood-leonardo-dicaprio-brad-pitt-quentin-tarantino-interview/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://theplaylist.net/once-upon-time-in-hollywood-cannes-review-20190521/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://twitter.com/OnceInHollywood/status/1130414077484777472 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED http://www.impawards.com/2019/once_upon_a_time_in_hollywood_ver4.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.vulture.com/2019/05/once-upon-a-time-in-hollywood-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-quentin-tarantino-once-upon-a-time-in-hollywood-20190521-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/embed/ELeMaP8EPAA?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/1049-sweet-death-veronika-voss-production-history
|
||
I 2022/06/09 11:04:51 Fulltext indexing: UO6KaG_26NP5 https://www.criterion.com/current/posts/1049-sweet-death-veronika-voss-production-history
|
||
I 2022/06/09 11:04:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[UO6KaG_26NP5 (1735154929796907008)]} 0 14
|
||
I 2022/06/09 11:04:51 SWITCHBOARD *Indexed 1343 words in URL https://www.criterion.com/current/posts/1049-sweet-death-veronika-voss-production-history [UO6KaG_26NP5]
|
||
Description: Sweet Death:Veronika Voss Production History | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 21952 bytes |
|
||
LinkStorageTime: 15 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:51 SWITCHBOARD Excluded 23 words in URL https://www.criterion.com/boxsets/1110-andre-gregory-wallace-shawn-3-films
|
||
I 2022/06/09 11:04:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[UAhyS3_26NP5 (1735154929838850048)]} 0 2
|
||
I 2022/06/09 11:04:51 Fulltext indexing: UAhyS3_26NP5 https://www.criterion.com/boxsets/1110-andre-gregory-wallace-shawn-3-films
|
||
I 2022/06/09 11:04:51 SWITCHBOARD *Indexed 414 words in URL https://www.criterion.com/boxsets/1110-andre-gregory-wallace-shawn-3-films [UAhyS3_26NP5]
|
||
Description: André Gregory & Wallace Shawn: 3 Films | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 8508 bytes |
|
||
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:51 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/6386-quentin-tarantino-s-once-upon-a-time-in-hollywood
|
||
I 2022/06/09 11:04:51 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 502, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 12)) = 239
|
||
I 2022/06/09 11:04:51 Fulltext indexing: T4pMJG_26NP5 https://www.criterion.com/current/posts/6386-quentin-tarantino-s-once-upon-a-time-in-hollywood
|
||
I 2022/06/09 11:04:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[T4pMJG_26NP5 (1735154929912250368)]} 0 3
|
||
I 2022/06/09 11:04:51 SWITCHBOARD *Indexed 662 words in URL https://www.criterion.com/current/posts/6386-quentin-tarantino-s-once-upon-a-time-in-hollywood [T4pMJG_26NP5]
|
||
Description: Quentin Tarantino’s Once Upon a Time . . . in Hollywood | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 8527 bytes |
|
||
LinkStorageTime: 9 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:51 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=allen-lewis, 224135 bytes
|
||
I 2022/06/09 11:04:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=allen-lewis, STACKING TIME = 2, PARSING TIME = 31
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/films/9a47380e81bad08a322032a14158be83/8ftdZ1FybsRNtjEkE0lr8zYDcdx8lQ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 501, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 HTCACHE storing content of url https://www.criterion.com/current/posts/4434-laughing-and-crying-with-carmen-maura, 70112 bytes
|
||
I 2022/06/09 11:04:51 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=allen-lewis
|
||
I 2022/06/09 11:04:51 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/4434-laughing-and-crying-with-carmen-maura, STACKING TIME = 1, PARSING TIME = 16
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/images/8015-990bc7db67bf5bf3c2bd807e835c9113/carmen_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 Fulltext indexing: TsLKim_26NP5 https://www.criterion.com/shop/browse/list?director=allen-lewis
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/films/b93ed3824aabf3784387991675dde82c/J2DKHtUkEGO5iYbKWnV7TIatSWH6x4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:51 REJECTED https://www.youtube.com/embed/ao4pKJxhZQQ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[TsLKim_26NP5 (1735154930280300544)]} 0 12
|
||
I 2022/06/09 11:04:52 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=allen-lewis [TsLKim_26NP5]
|
||
Description: Lewis Allen films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12453 bytes |
|
||
LinkStorageTime: 14 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:52 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/4434-laughing-and-crying-with-carmen-maura
|
||
I 2022/06/09 11:04:52 Fulltext indexing: Tk3xkG_26NP5 https://www.criterion.com/current/posts/4434-laughing-and-crying-with-carmen-maura
|
||
I 2022/06/09 11:04:52 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Tk3xkG_26NP5 (1735154930308612096)]} 0 1
|
||
I 2022/06/09 11:04:52 SWITCHBOARD *Indexed 294 words in URL https://www.criterion.com/current/posts/4434-laughing-and-crying-with-carmen-maura [Tk3xkG_26NP5]
|
||
Description: Laughing and Crying With Carmen Maura | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3411 bytes |
|
||
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:52 HTCACHE storing content of url https://www.criterion.com/boxsets/204-eisenstein-the-sound-years, 69909 bytes
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 SWITCHBOARD CRAWL: ADDED 49 LINKS FROM https://www.criterion.com/boxsets/204-eisenstein-the-sound-years, STACKING TIME = 4, PARSING TIME = 13
|
||
I 2022/06/09 11:04:52 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/be0e802e835ac927eaf3be41589e23e0.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1838-ee328a31205b114ef125fd81b54b5cd0/VZGhEsbGQY3luUNqMc64IKmXGoRe9U_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/films/19c902a73e243b9293de4c717430f639/H1rWEdJtowN7Xh9vSnOvPU9I6y0AgT_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/films/aed45dbeaf63414624b16890eb458dea/fSaXGJq2BBkowhHXw2FJft0UoNIdnI_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/films/b653a1545863d441cf9b8a8bc50946b8/SXcj1Zf8bWoyaoUEuzlFAc4gPNGwYm_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/a4754e30730b14f21ea09489a09de732.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/76b270cd5220e804477fd41e9c907d3b.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/boxsets/204-eisenstein-the-sound-years
|
||
I 2022/06/09 11:04:52 Fulltext indexing: TSqfP3_26NP5 https://www.criterion.com/boxsets/204-eisenstein-the-sound-years
|
||
I 2022/06/09 11:04:52 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[TSqfP3_26NP5 (1735154930553978880)]} 0 2
|
||
I 2022/06/09 11:04:52 SWITCHBOARD *Indexed 321 words in URL https://www.criterion.com/boxsets/204-eisenstein-the-sound-years [TSqfP3_26NP5]
|
||
Description: Eisenstein: The Sound Years | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 5861 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:52 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 495, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:52 HTCACHE storing content of url https://www.criterion.com/current/posts/1063-l-eclisse-antonioni-and-vitti, 109846 bytes
|
||
I 2022/06/09 11:04:52 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/1063-l-eclisse-antonioni-and-vitti, STACKING TIME = 1, PARSING TIME = 8
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/images/4415-66fa9593ad41744878169db79b52b613/294_017_Current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 SWITCHBOARD Excluded 30 words in URL https://www.criterion.com/current/posts/1063-l-eclisse-antonioni-and-vitti
|
||
I 2022/06/09 11:04:52 HTCACHE storing content of url https://www.criterion.com/current/posts/2079-arigato, 65630 bytes
|
||
I 2022/06/09 11:04:52 Fulltext indexing: S5oIHG_26NP5 https://www.criterion.com/current/posts/1063-l-eclisse-antonioni-and-vitti
|
||
I 2022/06/09 11:04:52 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/2079-arigato, STACKING TIME = 5, PARSING TIME = 6
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7231-/HQynBbsBpEpKg7rUCoGXwscnLL6b2Q_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7494-/aR8GPv9YcXKRW9YgcQatuPCI686YSi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/tout_image/7325-/8uSYdbbznLaIIN2ALLVLIx23JCp33f_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7615-/9oqwSPz7K69O93w5M2OzoGcJVHwTYh_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[S5oIHG_26NP5 (1735154930904203264)]} 0 9
|
||
I 2022/06/09 11:04:52 SWITCHBOARD *Indexed 737 words in URL https://www.criterion.com/current/posts/1063-l-eclisse-antonioni-and-vitti [S5oIHG_26NP5]
|
||
Description: L’eclisse: Antonioni and Vitti | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 11017 bytes |
|
||
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://s3.amazonaws.com/criterion-production/films/e62f1e61b5f52c2aaeabaeceaf58b629/BenTqN2hpuF2PKWN8v0M0BZkEMiLAM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.youtube.com/embed/dUOR7M0HpAE?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:52 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/current/posts/2079-arigato
|
||
I 2022/06/09 11:04:52 Fulltext indexing: S3xUSG_26NP5 https://www.criterion.com/current/posts/2079-arigato
|
||
I 2022/06/09 11:04:52 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[S3xUSG_26NP5 (1735154930932514816)]} 0 1
|
||
I 2022/06/09 11:04:52 SWITCHBOARD *Indexed 190 words in URL https://www.criterion.com/current/posts/2079-arigato [S3xUSG_26NP5]
|
||
Description: Arigato! | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 2241 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:52 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 489, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:52 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 489, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:53 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bennet-spencer-g, 224746 bytes
|
||
I 2022/06/09 11:04:53 HTCACHE storing content of url https://www.criterion.com/current/posts/611-the-milky-way-easy-striders, 115423 bytes
|
||
I 2022/06/09 11:04:53 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=bennet-spencer-g, STACKING TIME = 1, PARSING TIME = 28
|
||
I 2022/06/09 11:04:53 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/films/ed3e04be1a68ad7e7438a4a16d69a556/a268Pr0Wtb1F3INlbaTesMbAGJcxGg_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1874-1783a5b7ab030641aab494f13c026174/ahGfceaQ9idYzYJHUy4cVjH4aRnxZo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/611-the-milky-way-easy-striders, STACKING TIME = 1, PARSING TIME = 27
|
||
I 2022/06/09 11:04:53 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 483, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 6)) = 244
|
||
I 2022/06/09 11:04:53 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=bennet-spencer-g
|
||
I 2022/06/09 11:04:53 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Skcohm_26NP5 (1735154931578437632)]} 0 2
|
||
I 2022/06/09 11:04:53 Fulltext indexing: Skcohm_26NP5 https://www.criterion.com/shop/browse/list?director=bennet-spencer-g
|
||
I 2022/06/09 11:04:53 SWITCHBOARD *Indexed 1196 words in URL https://www.criterion.com/shop/browse/list?director=bennet-spencer-g [Skcohm_26NP5]
|
||
Description: Spencer G. Bennet films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12516 bytes |
|
||
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:53 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/611-the-milky-way-easy-striders
|
||
I 2022/06/09 11:04:53 Fulltext indexing: SXa_zG_26NP5 https://www.criterion.com/current/posts/611-the-milky-way-easy-striders
|
||
I 2022/06/09 11:04:53 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[SXa_zG_26NP5 (1735154931672809472)]} 0 3
|
||
I 2022/06/09 11:04:53 SWITCHBOARD *Indexed 914 words in URL https://www.criterion.com/current/posts/611-the-milky-way-easy-striders [SXa_zG_26NP5]
|
||
Description: The Milky Way: Easy Striders | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 13063 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:53 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 483, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:53 HTCACHE storing content of url https://www.criterion.com/current/posts/2827-the-life-of-oharu-not-reconciled, 79303 bytes
|
||
I 2022/06/09 11:04:53 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/images/5140-33579d22044380859d64f2d1e3034fb9/current_376_007_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/2827-the-life-of-oharu-not-reconciled, STACKING TIME = 6, PARSING TIME = 17
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/films/3433fcaf58b444a93922cf7375557b22/KKqwnBZIwSyvSOCP8e96G1O9fbPe10_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/2827-the-life-of-oharu-not-reconciled
|
||
I 2022/06/09 11:04:53 Fulltext indexing: SVm8mG_26NP5 https://www.criterion.com/current/posts/2827-the-life-of-oharu-not-reconciled
|
||
I 2022/06/09 11:04:53 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:04:53 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[SVm8mG_26NP5 (1735154932004159488)]} 0 5
|
||
I 2022/06/09 11:04:53 SWITCHBOARD *Indexed 946 words in URL https://www.criterion.com/current/posts/2827-the-life-of-oharu-not-reconciled [SVm8mG_26NP5]
|
||
Description: The Life of Oharu: Not Reconciled | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 14276 bytes |
|
||
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:53 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@54dd07d7[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wk(7.7.3):C19:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772672584}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wl(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772677667}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wm(7.7.3):C12:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772680783}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wn(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772680861}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wo(7.7.3):C6:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772682978}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wp(7.7.3):C22:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772688229}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wq(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772693592}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
|
||
I 2022/06/09 11:04:53 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
|
||
I 2022/06/09 11:04:53 HTCACHE storing content of url https://www.criterion.com/current/posts/6397-melodrama-debauchery-comedy-un-certain-regard, 85227 bytes
|
||
I 2022/06/09 11:04:53 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 477, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:53 SWITCHBOARD CRAWL: ADDED 86 LINKS FROM https://www.criterion.com/current/posts/6397-melodrama-debauchery-comedy-un-certain-regard, STACKING TIME = 3, PARSING TIME = 14
|
||
I 2022/06/09 11:04:53 REJECTED https://variety.com/2019/film/festivals/cannes-fire-will-come-oliver-laxe-classicism-avant-guard-egos-1203223235/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.hollywoodreporter.com/review/climb-review-1211195 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/joan-arc-bruno-dumont-am-dram-trial - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/embed/B38sjPKTm3o?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/chambre-212 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://variety.com/2019/film/festivals/karim-ainouz-cannes-un-certain-regard-the-invisible-life-1203223390/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://criterion-production.s3.amazonaws.com/IVsBX8S2Ze1yYGqZ0U1qgVqOhgerH6.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://variety.com/2019/film/reviews/a-brothers-love-review-1203215650/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://filmmakermagazine.com/107572-cannes-2019-dispatch-5-fire-will-come-tommaso/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.hollywoodreporter.com/review/a-magical-night-review-1212121 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://filmmakermagazine.com/107565-cannes-2019-dispatch-4-lux-aeterna-jeanne-young-ahmed/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.ioncinema.com/reviews/christophe-honore-chambre-212-review - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://variety.com/2019/film/festivals/brazils-invisible-life-of-euridice-gusmao-wins-cannes-un-certain-regard-award-1203225505/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/embed/izUIhIj10HA?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.rogerebert.com/cannes/cannes-2019-family-romance-llc-the-climb-the-invisible-life-of-euridice-gusmao - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.hollywoodreporter.com/review/fire-will-come-cannes-2019-1213080 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/embed/GCGkZ92cpcg?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/jeanne - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/a-vida-invisivel-de-euridice-gusmao - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/liberte-1 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/embed/8fBBGRT9ga0?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/embed/IHW9ByMtfpI?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://filmmakermagazine.com/107547-cannes-2019-dispatch-3-little-joe-liberte/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/embed/hCgbSE9tzEE?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED http://www.marthabatalha.com/en/home/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/la-femme-de-mon-frere - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://variety.com/2019/film/reviews/cannes-film-review-the-invisible-life-of-euridice-gusmao-1203225913/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.hollywoodreporter.com/news/cannes-hidden-gem-invisible-life-captures-female-life-rio-de-janeiro-1211128 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED http://www.anothergaze.com/monia-chokris-la-femme-de-mon-frere-brothers-love-ventures-beyond-hellscape-self-cannes/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-11-vulgar-confessions - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.screendaily.com/reviews/joan-of-arc-cannes-review/5139228.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/the-climb - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.festival-cannes.com/en/festival/films/o-que-arde - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 SWITCHBOARD Excluded 30 words in URL https://www.criterion.com/current/posts/6397-melodrama-debauchery-comedy-un-certain-regard
|
||
I 2022/06/09 11:04:53 Fulltext indexing: SSTLUG_26NP5 https://www.criterion.com/current/posts/6397-melodrama-debauchery-comedy-un-certain-regard
|
||
I 2022/06/09 11:04:53 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[SSTLUG_26NP5 (1735154932195000320)]} 0 10
|
||
I 2022/06/09 11:04:53 SWITCHBOARD *Indexed 997 words in URL https://www.criterion.com/current/posts/6397-melodrama-debauchery-comedy-un-certain-regard [SSTLUG_26NP5]
|
||
Description: Melodrama, Debauchery, Comedy: Un Certain Regard | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 13211 bytes |
|
||
LinkStorageTime: 12 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:53 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 477, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:04:53 HTCACHE storing content of url https://www.criterion.com/current/posts/4522-andrzej-wajda-in-honolulu, 71831 bytes
|
||
I 2022/06/09 11:04:53 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/4522-andrzej-wajda-in-honolulu, STACKING TIME = 1, PARSING TIME = 7
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6502-/VcDUKU8LPywb5MgflsEJCCJOgfrUNv_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/images/8294-0aaff10f6f76b338d30cf4327625c44a/916id_010_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6827-/sYS7xZWV366q9OANUqtCwimvTnL90D_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://honolulumuseum.org/events/films/16237-kanal - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/films/ca7e507c22c7570c08f43f1504309516/9NsSbie0yQIE4RI7eKfX3o4TlX5zvP_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6701-/ETx6s7z5azkjRZIl1n1268y6oIFngD_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:53 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6516-/1ReaXkdoavjctz1ZBIEqTXIfC0QqZK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/current/posts/4522-andrzej-wajda-in-honolulu
|
||
I 2022/06/09 11:04:54 Fulltext indexing: SRP3vG_26NP5 https://www.criterion.com/current/posts/4522-andrzej-wajda-in-honolulu
|
||
I 2022/06/09 11:04:54 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[SRP3vG_26NP5 (1735154932441415680)]} 0 2
|
||
I 2022/06/09 11:04:54 SWITCHBOARD *Indexed 298 words in URL https://www.criterion.com/current/posts/4522-andrzej-wajda-in-honolulu [SRP3vG_26NP5]
|
||
Description: Andrzej Wajda in Honolulu | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3408 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:54 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 474, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
|
||
I 2022/06/09 11:04:54 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 474, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:54 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bjoerkman-stig, 224222 bytes
|
||
I 2022/06/09 11:04:54 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=bjoerkman-stig, STACKING TIME = 1, PARSING TIME = 30
|
||
I 2022/06/09 11:04:54 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://s3.amazonaws.com/criterion-production/films/e732b8e6dcc290423dc3b347d90adb86/YS7c8q5YqFduubJtYRVVoVuWf774oA_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=bjoerkman-stig
|
||
I 2022/06/09 11:04:54 Fulltext indexing: SIDRJm_26NP5 https://www.criterion.com/shop/browse/list?director=bjoerkman-stig
|
||
I 2022/06/09 11:04:54 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[SIDRJm_26NP5 (1735154932978286592)]} 0 3
|
||
I 2022/06/09 11:04:54 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=bjoerkman-stig [SIDRJm_26NP5]
|
||
Description: Stig Björkman films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12476 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:54 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=schoedsack-ernest-b, 224747 bytes
|
||
I 2022/06/09 11:04:54 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=schoedsack-ernest-b, STACKING TIME = 1, PARSING TIME = 20
|
||
I 2022/06/09 11:04:54 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1865-2b74f037f454df1d78013f06dc4aaea4/0TUeLtsha8fzPrVMeQ8rNOnpUVmvME_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://s3.amazonaws.com/criterion-production/films/1f4199efee0716b73e643f44cffd628f/FsFD7z9JPS8zGJwwhpboY5pnNOOmIc_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:54 HostQueue forcing crawl-delay of 170 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 475, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 80)) = 170
|
||
I 2022/06/09 11:04:54 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=schoedsack-ernest-b
|
||
I 2022/06/09 11:04:54 Fulltext indexing: Rzwf9m_26NP5 https://www.criterion.com/shop/browse/list?director=schoedsack-ernest-b
|
||
I 2022/06/09 11:04:54 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Rzwf9m_26NP5 (1735154933218410496)]} 0 2
|
||
I 2022/06/09 11:04:54 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=schoedsack-ernest-b [Rzwf9m_26NP5]
|
||
Description: Ernest B. Schoedsack films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12511 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:54 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 475, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:54 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=wilde-cornel, 224168 bytes
|
||
I 2022/06/09 11:04:55 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=wilde-cornel, STACKING TIME = 6, PARSING TIME = 20
|
||
I 2022/06/09 11:04:55 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/040b9ada9b2e7f00bf67207c219e907d/SqsWAMsxDc8dEu6YmD98XbtlIJ2Qak_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=wilde-cornel
|
||
I 2022/06/09 11:04:55 Fulltext indexing: RvqSLm_26NP5 https://www.criterion.com/shop/browse/list?director=wilde-cornel
|
||
I 2022/06/09 11:04:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[RvqSLm_26NP5 (1735154933515157504)]} 0 2
|
||
I 2022/06/09 11:04:55 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=wilde-cornel [RvqSLm_26NP5]
|
||
Description: Cornel Wilde films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12479 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:55 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 475, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:55 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=shinarbaev-ermek, 224692 bytes
|
||
I 2022/06/09 11:04:55 HTCACHE storing content of url https://www.criterion.com/films/3558, 72654 bytes
|
||
I 2022/06/09 11:04:55 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse?director=shinarbaev-ermek, STACKING TIME = 1, PARSING TIME = 85
|
||
I 2022/06/09 11:04:55 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1767-0744687c884e8c056982d471b122dce3/cgKxO604g3phzPqpONSN5STBrfnV6y_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/db96c5947256e58c76b29162412c782b/fFzRqS2k4qQ7uYfoJgI45cQIQIScQ3_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/films/3558, STACKING TIME = 1, PARSING TIME = 20
|
||
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/a87bb06a2e40fbf073674fb0a669feaf/TAA69Jf5iXdiViDJjty3V3PIFzKjDp_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/homicide?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/726755430bd298a5aa424f68a792bcea/aQ0KQhoip19olkpwhmNbrAfMY6qhAB_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/609ba3283ee6e7a688a8f332948af460/UQs28Jv5Fz6nuVrbcLXq70jhwKzxXV_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/1227-/8bS0fqEqZNNhLc6lbEcIPk5Z9yyfcO_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/4e478bb135ededb77bf009fbb602208c.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/f4cd870c9d0136001f98d8ec2ac268d3/qPf9kR60ROOrUxfCNGNl7HiYLiR48u_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=shinarbaev-ermek
|
||
I 2022/06/09 11:04:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Ro-ekm_26NP5 (1735154933834973184)]} 0 2
|
||
I 2022/06/09 11:04:55 Fulltext indexing: Ro-ekm_26NP5 https://www.criterion.com/shop/browse?director=shinarbaev-ermek
|
||
I 2022/06/09 11:04:55 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse?director=shinarbaev-ermek [Ro-ekm_26NP5]
|
||
Description: Ermek Shinarbaev films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12470 bytes |
|
||
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:55 SWITCHBOARD Excluded 20 words in URL https://www.criterion.com/films/3558
|
||
I 2022/06/09 11:04:55 Fulltext indexing: RfUkwe_26NP5 https://www.criterion.com/films/3558
|
||
I 2022/06/09 11:04:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[RfUkwe_26NP5 (1735154933855944704)]} 0 1
|
||
I 2022/06/09 11:04:55 SWITCHBOARD *Indexed 347 words in URL https://www.criterion.com/films/3558 [RfUkwe_26NP5]
|
||
Description: Homicide (1991) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 4412 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:55 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 474, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:04:55 HTCACHE storing content of url https://www.criterion.com/current/posts/509-watching-sal, 70401 bytes
|
||
I 2022/06/09 11:04:55 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 SWITCHBOARD CRAWL: ADDED 41 LINKS FROM https://www.criterion.com/current/posts/509-watching-sal, STACKING TIME = 2, PARSING TIME = 5
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/images/4303-6645624b4a119533b1295cd1fa38e445/img_current_45_220_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/509-watching-sal
|
||
I 2022/06/09 11:04:55 Fulltext indexing: RFF_-G_26NP5 https://www.criterion.com/current/posts/509-watching-sal
|
||
I 2022/06/09 11:04:55 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=kaufman-boris, 224644 bytes
|
||
I 2022/06/09 11:04:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[RFF_-G_26NP5 (1735154934097117184)]} 0 6
|
||
I 2022/06/09 11:04:55 SWITCHBOARD *Indexed 694 words in URL https://www.criterion.com/current/posts/509-watching-sal [RFF_-G_26NP5]
|
||
Description: Watching Salò | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 9754 bytes |
|
||
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:55 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse?director=kaufman-boris, STACKING TIME = 1, PARSING TIME = 111
|
||
I 2022/06/09 11:04:55 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 470, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 13)) = 238
|
||
I 2022/06/09 11:04:55 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
|
||
I 2022/06/09 11:04:55 org.apache.solr.update.SolrIndexWriter Calling setCommitData with IW:org.apache.solr.update.SolrIndexWriter@fed4ccdd commitCommandVersion:0
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1843-6e179857a2b22f6fff1aee72814e6e1f/ucDdSNidzRCUS9xrmSXHu8tdlizbxG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 REJECTED https://s3.amazonaws.com/criterion-production/films/5ac9d89c2f87805b7beb1cf45f2fb262/HXRWSQehYCyfyKqH7wzDP2eO7dp7QE_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:55 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=kaufman-boris
|
||
I 2022/06/09 11:04:55 Fulltext indexing: RM-Tzm_26NP5 https://www.criterion.com/shop/browse?director=kaufman-boris
|
||
I 2022/06/09 11:04:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[RM-Tzm_26NP5 (1735154934363455488)]} 0 8
|
||
I 2022/06/09 11:04:55 SWITCHBOARD *Indexed 1196 words in URL https://www.criterion.com/shop/browse?director=kaufman-boris [RM-Tzm_26NP5]
|
||
Description: Boris Kaufman films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12482 bytes |
|
||
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:55 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:04:56 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 470, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 12)) = 239
|
||
I 2022/06/09 11:04:56 HTCACHE storing content of url https://www.criterion.com/current/posts/5751-bowling-for-columbine-by-any-means-necessary, 84962 bytes
|
||
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://criterion-production.s3.amazonaws.com/WUiA1p5V17eJIY6KvFiQxCXc80yBiX.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 SWITCHBOARD CRAWL: ADDED 59 LINKS FROM https://www.criterion.com/current/posts/5751-bowling-for-columbine-by-any-means-necessary, STACKING TIME = 2, PARSING TIME = 8
|
||
I 2022/06/09 11:04:56 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5751-/3e2CxdDLUQxmY46KEB1jbn00ytTKT9_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://criterion-production.s3.amazonaws.com/bbzzvn00I7zQcDvDxCS1eHVjRdrPUQ.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://criterion-production.s3.amazonaws.com/pYeegXJvMDTvetH6naGPkBhXJNgpRB.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/films/78a86bc12fbed832f0b341609a22fa52/lun1ptGstEhxOhmc24pORDXEVkN2Ve_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 6)) = 245
|
||
I 2022/06/09 11:04:56 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/5751-bowling-for-columbine-by-any-means-necessary
|
||
I 2022/06/09 11:04:56 Fulltext indexing: QbrssG_26NP5 https://www.criterion.com/current/posts/5751-bowling-for-columbine-by-any-means-necessary
|
||
I 2022/06/09 11:04:56 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[QbrssG_26NP5 (1735154934816440320)]} 0 19
|
||
I 2022/06/09 11:04:56 SWITCHBOARD *Indexed 1172 words in URL https://www.criterion.com/current/posts/5751-bowling-for-columbine-by-any-means-necessary [QbrssG_26NP5]
|
||
Description: Bowling for Columbine: By Any Means Necessary | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 16981 bytes |
|
||
LinkStorageTime: 28 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:56 HTCACHE storing content of url https://www.criterion.com/current/posts/2000-phantom-forms-the-phantom-carriage, 130340 bytes
|
||
I 2022/06/09 11:04:56 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/2000-phantom-forms-the-phantom-carriage, STACKING TIME = 1, PARSING TIME = 19
|
||
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/images/3756-5f2f10174229f1dc5a0cbf043e8dfa68/phantomcarriage_415_005_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/films/7265b13395ec259ff98672237c54b4c6/hl8OoSNAgm9ND4Fh7ksjUzNXplspyF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 468, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 13)) = 238
|
||
I 2022/06/09 11:04:56 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/2000-phantom-forms-the-phantom-carriage
|
||
I 2022/06/09 11:04:56 Fulltext indexing: P6kRKG_26NP5 https://www.criterion.com/current/posts/2000-phantom-forms-the-phantom-carriage
|
||
I 2022/06/09 11:04:56 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[P6kRKG_26NP5 (1735154935231676416)]} 0 4
|
||
I 2022/06/09 11:04:56 SWITCHBOARD *Indexed 1024 words in URL https://www.criterion.com/current/posts/2000-phantom-forms-the-phantom-carriage [P6kRKG_26NP5]
|
||
Description: Phantom Forms: The Phantom Carriage | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 14987 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:56 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=mishima-yukio, 224184 bytes
|
||
I 2022/06/09 11:04:56 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 468, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:56 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=mishima-yukio, STACKING TIME = 1, PARSING TIME = 62
|
||
I 2022/06/09 11:04:56 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://s3.amazonaws.com/criterion-production/films/1afc31bda9087f75091fae936b5c1ca0/1HMg9MpF1yL5AAmyha7kzSbACv2zcV_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:56 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=mishima-yukio
|
||
I 2022/06/09 11:04:56 Fulltext indexing: PxU5tm_26NP5 https://www.criterion.com/shop/browse/list?director=mishima-yukio
|
||
I 2022/06/09 11:04:56 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[PxU5tm_26NP5 (1735154935480188928)]} 0 2
|
||
I 2022/06/09 11:04:56 SWITCHBOARD *Indexed 1193 words in URL https://www.criterion.com/shop/browse/list?director=mishima-yukio [PxU5tm_26NP5]
|
||
Description: Yukio Mishima films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12491 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:57 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=inoue-akira, 224692 bytes
|
||
I 2022/06/09 11:04:57 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:57 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=inoue-akira, STACKING TIME = 1, PARSING TIME = 19
|
||
I 2022/06/09 11:04:57 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/films/c843e10ded9d547c8f1996012140b58b/KW0pFMBe2u60vjTtDjf2MiOMTfytam_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/films/608-ingmar-bergman-makes-a-movie
|
||
I 2022/06/09 11:04:57 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/shop/browse
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterion.com/films/608-ingmar-bergman-makes-a-movie - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/films/608-ingmar-bergman-makes-a-movie aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:35:14Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
|
||
I 2022/06/09 11:04:57 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Pky7re_26NP5 (1735154935680466944)]} 0 3
|
||
I 2022/06/09 11:04:57 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=inoue-akira
|
||
I 2022/06/09 11:04:57 Fulltext indexing: Pk9dJm_26NP5 https://www.criterion.com/shop/browse/list?director=inoue-akira
|
||
I 2022/06/09 11:04:57 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Pk9dJm_26NP5 (1735154935693049856)]} 0 2
|
||
I 2022/06/09 11:04:57 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=inoue-akira [Pk9dJm_26NP5]
|
||
Description: Akira Inoue films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12480 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:57 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:57 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=sjoestroem-victor, 224191 bytes
|
||
I 2022/06/09 11:04:57 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=sjoestroem-victor, STACKING TIME = 1, PARSING TIME = 77
|
||
I 2022/06/09 11:04:57 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/films/7265b13395ec259ff98672237c54b4c6/hl8OoSNAgm9ND4Fh7ksjUzNXplspyF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=sjoestroem-victor
|
||
I 2022/06/09 11:04:57 Fulltext indexing: Pk3vlm_26NP5 https://www.criterion.com/shop/browse?director=sjoestroem-victor
|
||
I 2022/06/09 11:04:57 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Pk3vlm_26NP5 (1735154936091508736)]} 0 2
|
||
I 2022/06/09 11:04:57 SWITCHBOARD *Indexed 1187 words in URL https://www.criterion.com/shop/browse?director=sjoestroem-victor [Pk3vlm_26NP5]
|
||
Description: Victor Sjöström films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12412 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:57 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 470, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:57 HTCACHE storing content of url https://www.criterion.com/current/posts/5841-schnabel-at-nyff-dolan-at-tiff, 71108 bytes
|
||
I 2022/06/09 11:04:57 SWITCHBOARD CRAWL: ADDED 64 LINKS FROM https://www.criterion.com/current/posts/5841-schnabel-at-nyff-dolan-at-tiff, STACKING TIME = 1, PARSING TIME = 6
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.tiff.net/the-review/tiff-2018-canadian-films/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.tiff.net/tiff/what-is-democracy/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.filmlinc.org/nyff2018/daily/julian-schnabel-at-eternitys-gate-closing-night-nyff56/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.tiff.net/tiff/anthropocene/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://criterion-production.s3.amazonaws.com/KHTH1OZ0QZyKcCiHHx9h8zlVsmTa1u.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.tiff.net/tiff/the-fall-of-the-american-empire/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.instagram.com/p/BezA0SAh69_/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.youtube.com/embed/k2zvPTGiJj4?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://gallery.mailchimp.com/bed63d3ce10ec9adba60ea410/files/690ef579-f1e5-4c6a-8520-d7a65adedb8e/TIFF_ANNOUNCES_THE_WORLD_PREMIERE_OF_XAVIER_DOLAN_S_THE_DEATH_AND_LIFE_OF_JOHN_F._DONOVAN.pdf - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 REJECTED https://www.instagram.com/p/Bey2fUjg_AC/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:57 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/5841-schnabel-at-nyff-dolan-at-tiff
|
||
I 2022/06/09 11:04:57 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[PVVaRG_26NP5 (1735154936227823616)]} 0 1
|
||
I 2022/06/09 11:04:57 Fulltext indexing: PVVaRG_26NP5 https://www.criterion.com/current/posts/5841-schnabel-at-nyff-dolan-at-tiff
|
||
I 2022/06/09 11:04:57 SWITCHBOARD *Indexed 437 words in URL https://www.criterion.com/current/posts/5841-schnabel-at-nyff-dolan-at-tiff [PVVaRG_26NP5]
|
||
Description: Schnabel at NYFF, Dolan at TIFF | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 5326 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:57 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 468, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:58 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 468, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:58 HTCACHE storing content of url https://www.criterion.com/current/posts/7433-the-signifyin-works-of-marlon-riggs-positive-images, 111351 bytes
|
||
I 2022/06/09 11:04:58 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=risi-dino, 224166 bytes
|
||
I 2022/06/09 11:04:58 SWITCHBOARD CRAWL: ADDED 68 LINKS FROM https://www.criterion.com/current/posts/7433-the-signifyin-works-of-marlon-riggs-positive-images, STACKING TIME = 6, PARSING TIME = 29
|
||
I 2022/06/09 11:04:58 REJECTED https://criterion-production.s3.amazonaws.com/p69daGpYEMwv1oeYjTTbMyvhbeE23f.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://criterion-production.s3.amazonaws.com/anqRb3iEvsM48b8ZZy95xyrpvn0SW0.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://criterion-production.s3.amazonaws.com/3JOp4FzRo0Yj6GGNK9WFk9JvTM7ve9.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://vhx.imgix.net/criterionchannelchartersu/assets/fe816e81-c4da-463d-852b-3b50d46072e8-0f1dcdf9.jpg?auto=format%2Ccompress&fit=crop&h=360&q=70&w=640 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://criterion-production.s3.amazonaws.com/vZ1FmsXmVr3Tl2QcaVDWbkxmlirHxz.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7433-/TaYxcGWDJ3jXRzpSOx9ehob2S9S4mm_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://criterion-production.s3.amazonaws.com/iVrw87O9tVNu8oFk2NVCDxiVbcTEMF.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7153-/nV4ZAell01TtJQf3ioJDSHLKKtpjtk_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1979-b50ddf05e790d45f0d3452882a1c5104/Yz5hb3zePSR6H1BFKoUnt0dQHSxzSj_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://criterion-production.s3.amazonaws.com/lAjYL9Xq2bJ9vfVovFjyaSRseOXVaA.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/inspired-by-marlon-riggs?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=current - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 13)) = 238
|
||
I 2022/06/09 11:04:58 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=risi-dino, STACKING TIME = 1, PARSING TIME = 67
|
||
I 2022/06/09 11:04:58 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/films/41d9df20fc245c4ecc12ef934533adc2/HrfTWPnllatTkm1Npqwr0o4Jslre1h_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/7433-the-signifyin-works-of-marlon-riggs-positive-images
|
||
I 2022/06/09 11:04:58 Fulltext indexing: PF73kG_26NP5 https://www.criterion.com/current/posts/7433-the-signifyin-works-of-marlon-riggs-positive-images
|
||
I 2022/06/09 11:04:58 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[PF73kG_26NP5 (1735154937146376192)]} 0 12
|
||
I 2022/06/09 11:04:58 SWITCHBOARD *Indexed 1878 words in URL https://www.criterion.com/current/posts/7433-the-signifyin-works-of-marlon-riggs-positive-images [PF73kG_26NP5]
|
||
Description: The Signifyin’ Works of Marlon Riggs: Positive Images | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 37014 bytes |
|
||
LinkStorageTime: 14 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:58 HTCACHE storing content of url https://www.criterion.com/films/28722-lone-wolf-and-cub-sword-of-vengeance, 76950 bytes
|
||
I 2022/06/09 11:04:58 HostQueue forcing crawl-delay of 248 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 467, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 6)) = 245
|
||
I 2022/06/09 11:04:58 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
|
||
I 2022/06/09 11:04:58 SWITCHBOARD CRAWL: ADDED 67 LINKS FROM https://www.criterion.com/films/28722-lone-wolf-and-cub-sword-of-vengeance, STACKING TIME = 2, PARSING TIME = 79
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/3b88fba9072b416d90a382e62b44a6b4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/18114be2187272d203efeda6bc250cf3.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://itunes.apple.com/us/movie/id1168909411?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/images/7742-355bb7d45a9482fc6c2ac3db60ad9495/lone_wolf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/films/ca9546c2ceb1b830a1e5f190805fb6fd/21FyOwvaD7hVXctaQrX9RVy5muIn6D_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.criterionchannel.com/lone-wolf-and-cub-sword-of-vengeance?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/images/7830-f315cd91efe00a972e37be56081e1dbf/LWC_designs_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://www.amazon.com/dp/B01M7ZWC9V - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/f936394d5e829494b0c15bbfa73d850e.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/films/58e97ae4dd0b361bd131525faf284b78/K10x3FugjRS01iYKEXsLTr0OlDQk4P_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 REJECTED https://s3.amazonaws.com/criterion-production/images/7773-ee7702bf1312d6cd3550dbfc426265bc/Current_LW_C1-grab65_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:58 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=risi-dino
|
||
I 2022/06/09 11:04:58 Fulltext indexing: PLGzxm_26NP5 https://www.criterion.com/shop/browse/list?director=risi-dino
|
||
I 2022/06/09 11:04:58 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[PLGzxm_26NP5 (1735154937361334272)]} 0 11
|
||
I 2022/06/09 11:04:58 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:04:58 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@8639c47e[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wt(7.7.3):c176:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772696001}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wu(7.7.3):C8:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772698712}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
|
||
I 2022/06/09 11:04:58 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
|
||
I 2022/06/09 11:04:58 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=risi-dino [PLGzxm_26NP5]
|
||
Description: Dino Risi films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12470 bytes |
|
||
LinkStorageTime: 20 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:58 SWITCHBOARD Excluded 17 words in URL https://www.criterion.com/films/28722-lone-wolf-and-cub-sword-of-vengeance
|
||
I 2022/06/09 11:04:58 Fulltext indexing: O3mESe_26NP5 https://www.criterion.com/films/28722-lone-wolf-and-cub-sword-of-vengeance
|
||
I 2022/06/09 11:04:58 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[O3mESe_26NP5 (1735154937399083008)]} 0 2
|
||
I 2022/06/09 11:04:58 SWITCHBOARD *Indexed 331 words in URL https://www.criterion.com/films/28722-lone-wolf-and-cub-sword-of-vengeance [O3mESe_26NP5]
|
||
Description: Lone Wolf and Cub: Sword of Vengeance (1972) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3782 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:58 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 467, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:58 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=drew-robert, 224233 bytes
|
||
I 2022/06/09 11:04:58 HTCACHE storing content of url https://www.criterion.com/current/posts/6791-alex-ross-perry-pays-a-visit-to-great-american-iconoclast-paul-schrader, 68316 bytes
|
||
I 2022/06/09 11:04:59 LOADER CRAWLER Redirection detected ('HTTP/1.1 301 Moved Permanently') for URL https://www.criterion.com/explore/214-martin-scorsese-s-top-10
|
||
I 2022/06/09 11:04:59 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/current/top-10-lists/214-martin-scorsese-s-top-10
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterion.com/explore/214-martin-scorsese-s-top-10 - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/explore/214-martin-scorsese-s-top-10 aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:37:10Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
|
||
I 2022/06/09 11:04:59 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=drew-robert, STACKING TIME = 2, PARSING TIME = 115
|
||
I 2022/06/09 11:04:59 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Oqv8Uk_26NP5 (1735154937629769728)]} 0 25
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/films/cd4e04ca68b6bfb60fdcbb7ea6b8386f/JtYzw2kwoN2ahEB7YqQiYXBBzRbTvp_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/6791-alex-ross-perry-pays-a-visit-to-great-american-iconoclast-paul-schrader, STACKING TIME = 7, PARSING TIME = 34
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7777-/bTILAb13gYYFwneHHCb5dHHD1VjRf7_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7707-/hZxxnAQeKCi5vjiDKzvdNuftSnkdOM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7807-/DXN9QiWNnsXTuLss0TTJ4r9JspRu5B_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/meet-the-filmmakers-paul-schrader - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7738-/e35rwdsj2UHHYOdCCz8E5Qr8oxxKXj_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://player.vimeo.com/video/385346582 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=drew-robert
|
||
I 2022/06/09 11:04:59 Fulltext indexing: O-aGQm_26NP5 https://www.criterion.com/shop/browse/list?director=drew-robert
|
||
I 2022/06/09 11:04:59 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[O-aGQm_26NP5 (1735154937834242048)]} 0 2
|
||
I 2022/06/09 11:04:59 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=drew-robert [O-aGQm_26NP5]
|
||
Description: Robert Drew films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12479 bytes |
|
||
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:59 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/current/posts/6791-alex-ross-perry-pays-a-visit-to-great-american-iconoclast-paul-schrader
|
||
I 2022/06/09 11:04:59 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[O2N0WG_26NP5 (1735154937859407872)]} 0 1
|
||
I 2022/06/09 11:04:59 Fulltext indexing: O2N0WG_26NP5 https://www.criterion.com/current/posts/6791-alex-ross-perry-pays-a-visit-to-great-american-iconoclast-paul-schrader
|
||
I 2022/06/09 11:04:59 SWITCHBOARD *Indexed 301 words in URL https://www.criterion.com/current/posts/6791-alex-ross-perry-pays-a-visit-to-great-american-iconoclast-paul-schrader [O2N0WG_26NP5]
|
||
Description: Alex Ross Perry Pays a Visit to Great American Iconoclast Paul Schrader | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3758 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:59 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:59 HTCACHE storing content of url https://www.criterion.com/current/posts/1437-kap-into-darkness, 84865 bytes
|
||
I 2022/06/09 11:04:59 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/1437-kap-into-darkness, STACKING TIME = 2, PARSING TIME = 7
|
||
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/films/4513899b08a6c9c55a2705f66fe70452/kvSLd5hpDEUQl2H2vLmSiEIkfamyF5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://s3.amazonaws.com/criterion-production/images/4553-dc25a80794d892dab17bfbd7ee3646a4/current_655_014_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:04:59 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/1437-kap-into-darkness
|
||
I 2022/06/09 11:04:59 Fulltext indexing: Oplu7G_26NP5 https://www.criterion.com/current/posts/1437-kap-into-darkness
|
||
I 2022/06/09 11:04:59 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Oplu7G_26NP5 (1735154938219069440)]} 0 3
|
||
I 2022/06/09 11:04:59 SWITCHBOARD *Indexed 527 words in URL https://www.criterion.com/current/posts/1437-kap-into-darkness [Oplu7G_26NP5]
|
||
Description: Kapò: Into Darkness | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 6205 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:04:59 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 468, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:59 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 468, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:04:59 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=henzell-perry, 224166 bytes
|
||
I 2022/06/09 11:05:00 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/films/27829-silence
|
||
I 2022/06/09 11:05:00 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/shop/browse
|
||
I 2022/06/09 11:05:00 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[OIDepe_26NP5 (1735154938694074368)]} 0 0
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterion.com/films/27829-silence - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/films/27829-silence aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:35:14Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
|
||
I 2022/06/09 11:05:00 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=henzell-perry, STACKING TIME = 2, PARSING TIME = 125
|
||
I 2022/06/09 11:05:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/films/fbe03f271e488f76de9836e9624a7526/arb7E8i8LJCGfMweLGxCwCClH5YVAI_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=moore-michael-1, 224192 bytes
|
||
I 2022/06/09 11:05:00 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 11)) = 240
|
||
I 2022/06/09 11:05:00 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse?director=moore-michael-1, STACKING TIME = 1, PARSING TIME = 74
|
||
I 2022/06/09 11:05:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/films/78a86bc12fbed832f0b341609a22fa52/lun1ptGstEhxOhmc24pORDXEVkN2Ve_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=henzell-perry
|
||
I 2022/06/09 11:05:00 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Odhdim_26NP5 (1735154938872332288)]} 0 2
|
||
I 2022/06/09 11:05:00 Fulltext indexing: Odhdim_26NP5 https://www.criterion.com/shop/browse/list?director=henzell-perry
|
||
I 2022/06/09 11:05:00 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=henzell-perry [Odhdim_26NP5]
|
||
Description: Perry Henzell films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12461 bytes |
|
||
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:00 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=moore-michael-1
|
||
I 2022/06/09 11:05:00 Fulltext indexing: OawGcm_26NP5 https://www.criterion.com/shop/browse?director=moore-michael-1
|
||
I 2022/06/09 11:05:00 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[OawGcm_26NP5 (1735154938936295424)]} 0 2
|
||
I 2022/06/09 11:05:00 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse?director=moore-michael-1 [OawGcm_26NP5]
|
||
Description: Michael Moore films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12430 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:00 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:05:00 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=jaglom-henry, 224731 bytes
|
||
I 2022/06/09 11:05:00 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:05:00 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=jaglom-henry, STACKING TIME = 1, PARSING TIME = 89
|
||
I 2022/06/09 11:05:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/films/ebedf2335fa256b63d4e645e8c508f12/A4UT1QlIsS1FGC0qJQAS21FqwPqpNn_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1864-79581df36332f7a1f027b81311c9e0f9/j8fKXLhpRU7m0dGwebBdwZLgrXaO26_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=jaglom-henry
|
||
I 2022/06/09 11:05:00 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[OCZQCm_26NP5 (1735154939431223296)]} 0 2
|
||
I 2022/06/09 11:05:00 Fulltext indexing: OCZQCm_26NP5 https://www.criterion.com/shop/browse/list?director=jaglom-henry
|
||
I 2022/06/09 11:05:00 SWITCHBOARD *Indexed 1200 words in URL https://www.criterion.com/shop/browse/list?director=jaglom-henry [OCZQCm_26NP5]
|
||
Description: Henry Jaglom films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12515 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:00 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=nguyen-jon, 224200 bytes
|
||
I 2022/06/09 11:05:00 HTCACHE storing content of url https://www.criterion.com/current/posts/6396-competition-highs-and-lows, 92734 bytes
|
||
I 2022/06/09 11:05:00 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=nguyen-jon, STACKING TIME = 1, PARSING TIME = 34
|
||
I 2022/06/09 11:05:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 467, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 13)) = 238
|
||
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/films/45ae4aaeb01b65d3788e09d553527a0d/O5urSQ3UPS5ELUhh25uHIS7M5ri1p3_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 SWITCHBOARD CRAWL: ADDED 116 LINKS FROM https://www.criterion.com/current/posts/6396-competition-highs-and-lows, STACKING TIME = 12, PARSING TIME = 97
|
||
I 2022/06/09 11:05:00 REJECTED https://www.telegraph.co.uk/films/0/matthias-maxime-review-crisp-sweet-canadian-coming-of-age-tale/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://variety.com/2019/film/reviews/oh-mercy-review-1203223481/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://news.yahoo.com/bong-song-double-act-behind-koreas-cannes-victory-211807398.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.festival-cannes.com/en/festival/films/le-jeune-ahmed - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.locarnofestival.ch/pardo/pardo-live/today-at-festival/2019/05/Excellence_SONG.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://time.com/5592027/cannes-review-pedro-almodovar-pain-and-glory/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://criterion-production.s3.amazonaws.com/BlcZzr94BSUcAb10XEIg1TLf9mkCQW.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.hollywoodreporter.com/review/sibyl-review-1212046 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.filmcomment.com/blog/film-of-the-week-oh-mercy/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.youtube.com/embed/YcHB6eE3I1k?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED http://www.semainedelacritique.com/en/edition/2019/movie/nuestras-madres - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.festival-cannes.com/en/festival/films/sibyl - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://film.avclub.com/robert-pattinson-and-willem-dafoe-get-lighthouse-fever-1834913555 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://cineuropa.org/en/newsdetail/373264/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://thefilmstage.com/reviews/frankie-review-cannes-isabelle-huppert-ira-sachs/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED http://www.anothergaze.com/celine-sciammas-portrait-de-la-jeune-fille-en-feu-portrait-lady-fire-explores-boundlessness-poetic-love-cannes-lesbian-feminist/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.filmcomment.com/blog/cannes-interview-adele-haenel/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:00 REJECTED https://www.youtube.com/embed/iPNfFRZjgkE?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.filmcomment.com/blog/cannes-interview-mati-diop/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.thedailybeast.com/young-ahmed-a-disturbing-portrait-of-an-islamic-terrorist-invades-cannes - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-mektoub-my-love-intermezzo-abdellatif-kechiche-20190523-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://twitter.com/jessicakiang/status/1131133907896815617 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://film.avclub.com/guessing-the-winners-and-picking-our-own-at-the-end-o-1835011696 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/it-must-be-heaven - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.nytimes.com/2019/05/25/movies/cannes-film-festival-winners-parasite.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.youtube.com/embed/uS-2B8Vl_fA?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://variety.com/2019/film/reviews/frankie-review-isabelle-huppert-1203220585/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://filmmakermagazine.com/107581-cannes-2019-dispatch-6-parasite-once-upon-a-time-in-hollywood-mektoub-my-love-intermezzo/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.theguardian.com/film/2019/may/20/portrait-of-a-lady-on-fire-review-celine-sciamma - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED http://www.lefilmfrancais.com/cinema/142135/cannes2019-le-tableau-final-des-etoiles-de-la-critique-palmometre - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.instagram.com/p/BxrbBkFhvq4/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/mektoub-my-love-intermezzo - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/roubaix-une-lumiere - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.nytimes.com/2019/05/24/movies/cannes-almodovar-kechiche.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://twitter.com/CriterionDaily/status/1132302591977807872 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://lwlies.com/festivals/young-ahmed-cannes-film-festival-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://twitter.com/CriterionDaily/status/1132318367287787521 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://variety.com/2019/film/reviews/cannes-film-review-matthias-maxime-1203223223/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/portrait-de-la-jeune-fille-en-feu - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/the-distance-between-us-and-the-sky - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://variety.com/2019/film/reviews/sibyl-review-1203225243/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/il-traditore - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/matthias-et-maxime - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.indiewire.com/2019/05/the-traitor-review-marco-bellocchio-cannes-1202144226/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.indiewire.com/2019/05/mektoub-my-love-intermezzo-unsimulated-sex-alcohol-report-abdellatif-kechiche-1202144998/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-13-trying-times - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.festival-cannes.com/en/festival/films/frankie - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://thefilmstage.com/reviews/cannes-review-with-parasite-bong-joon-ho-delivers-an-electrifying-assessment-of-social-stratification/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.youtube.com/embed/ssxK8FboFo4?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.screendaily.com/reviews/the-traitor-cannes-review/5139679.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=nguyen-jon
|
||
I 2022/06/09 11:05:01 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[NnlDam_26NP5 (1735154939833876480)]} 0 2
|
||
I 2022/06/09 11:05:01 Fulltext indexing: NnlDam_26NP5 https://www.criterion.com/shop/browse/list?director=nguyen-jon
|
||
I 2022/06/09 11:05:01 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=nguyen-jon [NnlDam_26NP5]
|
||
Description: Jon Nguyen films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12487 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:01 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 467, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:05:01 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/6396-competition-highs-and-lows
|
||
I 2022/06/09 11:05:01 Fulltext indexing: NWKbAG_26NP5 https://www.criterion.com/current/posts/6396-competition-highs-and-lows
|
||
I 2022/06/09 11:05:01 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[NWKbAG_26NP5 (1735154939941879808)]} 0 4
|
||
I 2022/06/09 11:05:01 SWITCHBOARD *Indexed 1291 words in URL https://www.criterion.com/current/posts/6396-competition-highs-and-lows [NWKbAG_26NP5]
|
||
Description: Competition Highs and Lows | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 18189 bytes |
|
||
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:01 HTCACHE storing content of url https://www.criterion.com/films/28725-lone-wolf-and-cub-baby-cart-in-peril, 76601 bytes
|
||
I 2022/06/09 11:05:01 SWITCHBOARD CRAWL: ADDED 67 LINKS FROM https://www.criterion.com/films/28725-lone-wolf-and-cub-baby-cart-in-peril, STACKING TIME = 1, PARSING TIME = 7
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/95246b965f81274a17549dcbe2b4f694.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/a73036c1a053afe9981265d73ad0222e.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/lone-wolf-and-cub-baby-cart-in-peril?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/3de10aea9022f1adb4b52927aeadd2da.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://itunes.apple.com/us/movie/id1170328408?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/images/7742-355bb7d45a9482fc6c2ac3db60ad9495/lone_wolf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.amazon.com/dp/B01MXHGDKK - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/films/c7f130a16b24d863d6a58d45b4220e8d/pYeWiMfBSfKQvp81wNDqhKe22Q1FBj_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/images/7830-f315cd91efe00a972e37be56081e1dbf/LWC_designs_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/films/58e97ae4dd0b361bd131525faf284b78/K10x3FugjRS01iYKEXsLTr0OlDQk4P_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/images/7773-ee7702bf1312d6cd3550dbfc426265bc/Current_LW_C1-grab65_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 SWITCHBOARD Excluded 17 words in URL https://www.criterion.com/films/28725-lone-wolf-and-cub-baby-cart-in-peril
|
||
I 2022/06/09 11:05:01 Fulltext indexing: NLL08e_26NP5 https://www.criterion.com/films/28725-lone-wolf-and-cub-baby-cart-in-peril
|
||
I 2022/06/09 11:05:01 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[NLL08e_26NP5 (1735154940066660352)]} 0 1
|
||
I 2022/06/09 11:05:01 SWITCHBOARD *Indexed 338 words in URL https://www.criterion.com/films/28725-lone-wolf-and-cub-baby-cart-in-peril [NLL08e_26NP5]
|
||
Description: Lone Wolf and Cub: Baby Cart in Peril (1972) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3782 bytes |
|
||
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:01 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 466, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 11)) = 240
|
||
I 2022/06/09 11:05:01 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=babenco-hector, 224795 bytes
|
||
I 2022/06/09 11:05:01 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 466, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:05:01 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=babenco-hector, STACKING TIME = 3, PARSING TIME = 197
|
||
I 2022/06/09 11:05:01 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1961-3b7b159a9ae3d500459e38e69c96a917/9P9MeXzolFQyY5OhK2XcwZ02e0Y0ZC_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 REJECTED https://s3.amazonaws.com/criterion-production/films/adc5dbcfd3feab7fb5ebe9ce4b103691/oTbeMEU4djNsFgw76dm1476k4EGUjO_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:01 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=babenco-hector
|
||
I 2022/06/09 11:05:01 Fulltext indexing: M-Eb9m_26NP5 https://www.criterion.com/shop/browse/list?director=babenco-hector
|
||
I 2022/06/09 11:05:01 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[M-Eb9m_26NP5 (1735154940651765760)]} 0 2
|
||
I 2022/06/09 11:05:01 SWITCHBOARD *Indexed 1204 words in URL https://www.criterion.com/shop/browse/list?director=babenco-hector [M-Eb9m_26NP5]
|
||
Description: Héctor Babenco films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12540 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:02 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 466, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
|
||
I 2022/06/09 11:05:02 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=micheaux-oscar, 224777 bytes
|
||
I 2022/06/09 11:05:02 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=micheaux-oscar, STACKING TIME = 1, PARSING TIME = 32
|
||
I 2022/06/09 11:05:02 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/e8c19690806d171f7e3ef5c667737ca9/v6zAyHvnNeorIDi62bcCZ2NTMmOwHJ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1873-f368b5675c9f0eb398727cb9a03b6e7b/vDeZBqbuBzw7Bpsb3KEK6hBnY6zACK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=micheaux-oscar
|
||
I 2022/06/09 11:05:02 Fulltext indexing: Mp1dxm_26NP5 https://www.criterion.com/shop/browse/list?director=micheaux-oscar
|
||
I 2022/06/09 11:05:02 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 469, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 13)) = 238
|
||
I 2022/06/09 11:05:02 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Mp1dxm_26NP5 (1735154941071196160)]} 0 3
|
||
I 2022/06/09 11:05:02 SWITCHBOARD *Indexed 1196 words in URL https://www.criterion.com/shop/browse/list?director=micheaux-oscar [Mp1dxm_26NP5]
|
||
Description: Oscar Micheaux films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12529 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:02 HTCACHE storing content of url https://www.criterion.com/films/28724-lone-wolf-and-cub-baby-cart-to-hades, 76467 bytes
|
||
I 2022/06/09 11:05:02 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bogdanovich-peter, 224781 bytes
|
||
I 2022/06/09 11:05:02 SWITCHBOARD CRAWL: ADDED 67 LINKS FROM https://www.criterion.com/films/28724-lone-wolf-and-cub-baby-cart-to-hades, STACKING TIME = 14, PARSING TIME = 21
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/9412bc6d3169c80870d1fe3af4c95bc5.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://itunes.apple.com/us/movie/id1169837282?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/lone-wolf-and-cub-baby-cart-to-hades?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1772-05f7b23e0d4044cfa66e916e5ef7a8df/atLwCDI4fWZrMlIpaGL1YTLyiGq7ND_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/86bd8d74849a8d3e70dde7c17521c24b.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/bdccef4372055a6d7eba1d9e48d671e0/KbfBzW7rIf0GLzZ6mfOiZKBiAPrKFM_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/images/7742-355bb7d45a9482fc6c2ac3db60ad9495/lone_wolf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/5957d97193290193070c833ec84b7d44/sVs3Xn4WILtQAfnENtt8DNTR4s5D7l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.amazon.com/dp/B01M6EAG9R - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/bd8e88e276dc6d474902734b8d0faf45.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/images/7830-f315cd91efe00a972e37be56081e1dbf/LWC_designs_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/58e97ae4dd0b361bd131525faf284b78/K10x3FugjRS01iYKEXsLTr0OlDQk4P_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/images/7773-ee7702bf1312d6cd3550dbfc426265bc/Current_LW_C1-grab65_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 SWITCHBOARD Excluded 20 words in URL https://www.criterion.com/films/28724-lone-wolf-and-cub-baby-cart-to-hades
|
||
I 2022/06/09 11:05:02 Fulltext indexing: MMTW2e_26NP5 https://www.criterion.com/films/28724-lone-wolf-and-cub-baby-cart-to-hades
|
||
I 2022/06/09 11:05:02 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=bogdanovich-peter, STACKING TIME = 1, PARSING TIME = 38
|
||
I 2022/06/09 11:05:02 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/e3337688841b6e5c3e22796b3a889e02/feBBtCtzScE9oOvniyPVCbTsG34x2V_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1864-79581df36332f7a1f027b81311c9e0f9/j8fKXLhpRU7m0dGwebBdwZLgrXaO26_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[MMTW2e_26NP5 (1735154941329145856)]} 0 17
|
||
I 2022/06/09 11:05:02 SWITCHBOARD *Indexed 335 words in URL https://www.criterion.com/films/28724-lone-wolf-and-cub-baby-cart-to-hades [MMTW2e_26NP5]
|
||
Description: Lone Wolf and Cub: Baby Cart to Hades (1972) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3758 bytes |
|
||
LinkStorageTime: 19 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:02 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 504, robots.delay = 0, ((waitig = 252) - (timeSinceLastAccess = 11)) = 241
|
||
I 2022/06/09 11:05:02 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=bogdanovich-peter
|
||
I 2022/06/09 11:05:02 Fulltext indexing: MhnEmm_26NP5 https://www.criterion.com/shop/browse/list?director=bogdanovich-peter
|
||
I 2022/06/09 11:05:02 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[MhnEmm_26NP5 (1735154941403594752)]} 0 2
|
||
I 2022/06/09 11:05:02 SWITCHBOARD *Indexed 1196 words in URL https://www.criterion.com/shop/browse/list?director=bogdanovich-peter [MhnEmm_26NP5]
|
||
Description: Peter Bogdanovich films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12521 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:02 HTCACHE storing content of url https://www.criterion.com/films/29404-lettres-d-amour, 70654 bytes
|
||
I 2022/06/09 11:05:02 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/films/29404-lettres-d-amour, STACKING TIME = 1, PARSING TIME = 7
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1816-e188e2102b63387dbe23fc67edb6beea/DWwbQG5RL4lfG3HYvO9uUZ78amems4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/9fbdd9a4cfc783daf7ad2275333b3a90.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/6524a1ed52ee915b2a7148ac59ab1881.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/45c628fc5070f6fee6f29ef34ca518e9.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/f56a6a3cf85f93e5dc84efb56fd8100e.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/films/09ef0ec590d8fcbf5ea01259ae3ba9b9/3OZ3L2WPoJxZgumbeOigOW0aKsXxdJ_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/29d1fe954024975e5cd88f6e316f6823.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/lettres-d-amour?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/84f60757a2466574aa9497b56c5b2ec1.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/dbd64381dc74a65e71a743b59a63b9c6.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://s3.amazonaws.com/criterion-production/images/9518-74f3b68e17d1c132c4fb7dd0555570cc/Current_29404id_015_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:02 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/films/29404-lettres-d-amour
|
||
I 2022/06/09 11:05:02 Fulltext indexing: MHDG8e_26NP5 https://www.criterion.com/films/29404-lettres-d-amour
|
||
I 2022/06/09 11:05:02 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[MHDG8e_26NP5 (1735154941601775616)]} 0 2
|
||
I 2022/06/09 11:05:02 SWITCHBOARD *Indexed 270 words in URL https://www.criterion.com/films/29404-lettres-d-amour [MHDG8e_26NP5]
|
||
Description: Lettres d’amour (1942) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3061 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:02 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 484, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 8)) = 242
|
||
I 2022/06/09 11:05:03 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 484, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:05:03 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=adolphson-edvin, 224809 bytes
|
||
I 2022/06/09 11:05:03 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=adolphson-edvin, STACKING TIME = 1, PARSING TIME = 19
|
||
I 2022/06/09 11:05:03 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/films/bb3d92ac28b94e8cbad1c169e15230eb/gxgY0bIbXgkuyFw4Zs4uwe1uD5PxnV_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1915-d024b93e4429f5b05d7f0bdc8d59c415/shXfQUMZTWnY6hrjrAHIhcBlosHu4c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=adolphson-edvin
|
||
I 2022/06/09 11:05:03 Fulltext indexing: L7wf3m_26NP5 https://www.criterion.com/shop/browse/list?director=adolphson-edvin
|
||
I 2022/06/09 11:05:03 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[L7wf3m_26NP5 (1735154942056857600)]} 0 3
|
||
I 2022/06/09 11:05:03 SWITCHBOARD *Indexed 1198 words in URL https://www.criterion.com/shop/browse/list?director=adolphson-edvin [L7wf3m_26NP5]
|
||
Description: Edvin Adolphson films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12524 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:03 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=lelouch-claude, 224291 bytes
|
||
I 2022/06/09 11:05:03 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 500, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:05:03 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=lelouch-claude, STACKING TIME = 1, PARSING TIME = 29
|
||
I 2022/06/09 11:05:03 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/films/89fd00884467657bae436047de8cd9b2/7RuTtvX525S0ENzFaMXEcuoqKM280Y_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=lelouch-claude
|
||
I 2022/06/09 11:05:03 Fulltext indexing: L4Iihm_26NP5 https://www.criterion.com/shop/browse/list?director=lelouch-claude
|
||
I 2022/06/09 11:05:03 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[L4Iihm_26NP5 (1735154942343118848)]} 0 2
|
||
I 2022/06/09 11:05:03 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=lelouch-claude [L4Iihm_26NP5]
|
||
Description: Claude Lelouch films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12584 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:03 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 500, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:05:03 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=sjoeberg-alf, 225777 bytes
|
||
I 2022/06/09 11:05:03 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=sjoeberg-alf, STACKING TIME = 1, PARSING TIME = 21
|
||
I 2022/06/09 11:05:03 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/films/f05e3d8b8d48f95dff100aa47f9f61ed/98HE82POwakYBdneI0UmYxj3CmWRyN_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/films/493d910d7cfcce1d103f6354027d6d37/NT4wqOdozXSzODYv04w2NGpHYiqc23_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1777-0bf5a031bd21c3d35f64323b27f49d77/BYXas4KPZ66EXCahX6uErMaxE3Xs3e_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=sjoeberg-alf
|
||
I 2022/06/09 11:05:03 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
|
||
I 2022/06/09 11:05:03 Fulltext indexing: Lfk-Dm_26NP5 https://www.criterion.com/shop/browse/list?director=sjoeberg-alf
|
||
I 2022/06/09 11:05:03 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Lfk-Dm_26NP5 (1735154942616797184)]} 0 4
|
||
I 2022/06/09 11:05:03 SWITCHBOARD *Indexed 1212 words in URL https://www.criterion.com/shop/browse/list?director=sjoeberg-alf [Lfk-Dm_26NP5]
|
||
Description: Alf Sjöberg films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12566 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:03 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 515, robots.delay = 0, ((waitig = 257) - (timeSinceLastAccess = 12)) = 245
|
||
I 2022/06/09 11:05:03 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:05:03 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@4cf853f4[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wt(7.7.3):c176:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772696001}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wu(7.7.3):C8:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772698712}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wv(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772703827}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
|
||
I 2022/06/09 11:05:03 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
|
||
I 2022/06/09 11:05:03 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=smith-kevin, 224168 bytes
|
||
I 2022/06/09 11:05:03 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=smith-kevin, STACKING TIME = 1, PARSING TIME = 20
|
||
I 2022/06/09 11:05:03 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://s3.amazonaws.com/criterion-production/films/7219028d5161c33275211a71074f5f4a/ma1iHSdXi2xvx6cDYrynn8vKKNkODq_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:03 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=smith-kevin
|
||
I 2022/06/09 11:05:03 Fulltext indexing: LI70Dm_26NP5 https://www.criterion.com/shop/browse/list?director=smith-kevin
|
||
I 2022/06/09 11:05:03 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[LI70Dm_26NP5 (1735154942864261120)]} 0 4
|
||
I 2022/06/09 11:05:04 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=smith-kevin [LI70Dm_26NP5]
|
||
Description: Kevin Smith films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12472 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:04 HostQueue forcing crawl-delay of 251 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 522, robots.delay = 0, ((waitig = 261) - (timeSinceLastAccess = 10)) = 251
|
||
I 2022/06/09 11:05:04 HTCACHE storing content of url https://www.criterion.com/films/27624-the-white-angel, 73839 bytes
|
||
I 2022/06/09 11:05:04 SWITCHBOARD CRAWL: ADDED 62 LINKS FROM https://www.criterion.com/films/27624-the-white-angel, STACKING TIME = 1, PARSING TIME = 11
|
||
I 2022/06/09 11:05:04 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/c5fe70d9457505b773db808a0aa1bbae.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/images/3719-34e86eb3abe14685658f5613efd3b146/matarazzo_nobodyschildren-4_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/the-white-angel?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/films/224ad3784ebfb34f98b1af628337f3da/gf5q2Dxvw2rDGLoNCNOnF3L53EUKqK_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/fd022ca340ea50b496b2555066d48d57.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/f0ef8434f807ab5082a6a0b63f2111a4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1803-6a8b76d7af61cbee31aced4a8191a85a/zKIrZpwHhlb5ETRsoB0UujdI9OwYFz_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/films/b59efc718fc3309a4eb76255310280b8/MpKeZ33lims6VNmUQPeUZ06u1HCgd5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/films/c3671b7d05dd992c80de898da6f724a8/iKAAnLTwUhBFo0X62zBb8ijm258Sey_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/185ba19b70895755d4b0dce4cd86460c.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/572b757a5f9510a89da713901d2a6d39.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6475-/lZ5EyHLWra1qaKr6A01KvgMRozwhlx_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/films/e033d65a68d10236f332c06ce3725b59/5Pvpc7nsRpgo8lu3IQVlAeheME1g9w_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/6ab1e62fe3a341659643ec9abd877ef8.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/films/27624-the-white-angel
|
||
I 2022/06/09 11:05:04 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[LD6nNe_26NP5 (1735154943084462080)]} 0 2
|
||
I 2022/06/09 11:05:04 Fulltext indexing: LD6nNe_26NP5 https://www.criterion.com/films/27624-the-white-angel
|
||
I 2022/06/09 11:05:04 SWITCHBOARD *Indexed 312 words in URL https://www.criterion.com/films/27624-the-white-angel [LD6nNe_26NP5]
|
||
Description: The White Angel (1955) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3438 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:04 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=clayton-jack, 224160 bytes
|
||
I 2022/06/09 11:05:04 HostQueue forcing crawl-delay of 253 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 525, robots.delay = 0, ((waitig = 262) - (timeSinceLastAccess = 9)) = 253
|
||
I 2022/06/09 11:05:04 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=clayton-jack, STACKING TIME = 1, PARSING TIME = 89
|
||
I 2022/06/09 11:05:04 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/films/4a2579a876453c44391878218c0ff019/BXl2PKj5T4lPvbzJpQb8AQ49rf6uBh_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=clayton-jack
|
||
I 2022/06/09 11:05:04 Fulltext indexing: LFVvMm_26NP5 https://www.criterion.com/shop/browse/list?director=clayton-jack
|
||
I 2022/06/09 11:05:04 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[LFVvMm_26NP5 (1735154943366529024)]} 0 2
|
||
I 2022/06/09 11:05:04 SWITCHBOARD *Indexed 1186 words in URL https://www.criterion.com/shop/browse/list?director=clayton-jack [LFVvMm_26NP5]
|
||
Description: Jack Clayton films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12460 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:04 HostQueue forcing crawl-delay of 252 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 525, robots.delay = 0, ((waitig = 262) - (timeSinceLastAccess = 10)) = 252
|
||
I 2022/06/09 11:05:04 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=macpherson-kenneth, 224746 bytes
|
||
I 2022/06/09 11:05:04 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=macpherson-kenneth, STACKING TIME = 1, PARSING TIME = 25
|
||
I 2022/06/09 11:05:04 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/films/64a7765172f162a06129a504d81dcc64/RKbm2Glxv6ZMeQjvZyFuk4mZkZanvo_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1873-f368b5675c9f0eb398727cb9a03b6e7b/vDeZBqbuBzw7Bpsb3KEK6hBnY6zACK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 HTCACHE storing content of url https://www.criterion.com/current/posts/442-the-hours-and-times-kurosawa-and-the-art-of-epic-storytelling, 65581 bytes
|
||
I 2022/06/09 11:05:04 SWITCHBOARD CRAWL: ADDED 41 LINKS FROM https://www.criterion.com/current/posts/442-the-hours-and-times-kurosawa-and-the-art-of-epic-storytelling, STACKING TIME = 5, PARSING TIME = 5
|
||
I 2022/06/09 11:05:04 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://s3.amazonaws.com/criterion-production/images/4280-0b082dc4055ce557083e17481a378dd9/current_2_302_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:04 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 506, robots.delay = 0, ((waitig = 253) - (timeSinceLastAccess = 12)) = 241
|
||
I 2022/06/09 11:05:04 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=macpherson-kenneth
|
||
I 2022/06/09 11:05:04 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[K59zXm_26NP5 (1735154943846776832)]} 0 2
|
||
I 2022/06/09 11:05:04 Fulltext indexing: K59zXm_26NP5 https://www.criterion.com/shop/browse/list?director=macpherson-kenneth
|
||
I 2022/06/09 11:05:04 SWITCHBOARD *Indexed 1196 words in URL https://www.criterion.com/shop/browse/list?director=macpherson-kenneth [K59zXm_26NP5]
|
||
Description: Kenneth Macpherson films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12509 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:04 SWITCHBOARD Excluded 26 words in URL https://www.criterion.com/current/posts/442-the-hours-and-times-kurosawa-and-the-art-of-epic-storytelling
|
||
I 2022/06/09 11:05:04 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[KWQVsG_26NP5 (1735154943872991232)]} 0 1
|
||
I 2022/06/09 11:05:04 Fulltext indexing: KWQVsG_26NP5 https://www.criterion.com/current/posts/442-the-hours-and-times-kurosawa-and-the-art-of-epic-storytelling
|
||
I 2022/06/09 11:05:04 SWITCHBOARD *Indexed 438 words in URL https://www.criterion.com/current/posts/442-the-hours-and-times-kurosawa-and-the-art-of-epic-storytelling [KWQVsG_26NP5]
|
||
Description: The Hours and Times: Kurosawa and the Art of Epic Storytelling | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 5422 bytes |
|
||
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:05 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 506, robots.delay = 0, ((waitig = 253) - (timeSinceLastAccess = 10)) = 243
|
||
I 2022/06/09 11:05:05 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=pfleghar-michael, 224306 bytes
|
||
I 2022/06/09 11:05:05 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=pfleghar-michael, STACKING TIME = 1, PARSING TIME = 67
|
||
I 2022/06/09 11:05:05 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/films/89fd00884467657bae436047de8cd9b2/7RuTtvX525S0ENzFaMXEcuoqKM280Y_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/my_criterion/4426-jonathan-keogh
|
||
I 2022/06/09 11:05:05 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/account/my-criterion
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterion.com/my_criterion/4426-jonathan-keogh - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/my_criterion/4426-jonathan-keogh to https://www.criterion.com/account/my-criterion placed on crawler queue for double-check
|
||
I 2022/06/09 11:05:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[J8jGN6_26NP5 (1735154944212729856)]} 0 6
|
||
I 2022/06/09 11:05:05 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=pfleghar-michael
|
||
I 2022/06/09 11:05:05 Fulltext indexing: Ksx0Cm_26NP5 https://www.criterion.com/shop/browse/list?director=pfleghar-michael
|
||
I 2022/06/09 11:05:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Ksx0Cm_26NP5 (1735154944287178752)]} 0 2
|
||
I 2022/06/09 11:05:05 SWITCHBOARD *Indexed 1186 words in URL https://www.criterion.com/shop/browse/list?director=pfleghar-michael [Ksx0Cm_26NP5]
|
||
Description: Michael Pfleghar films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12582 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:05 HostQueue forcing crawl-delay of 254 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 529, robots.delay = 0, ((waitig = 264) - (timeSinceLastAccess = 10)) = 254
|
||
I 2022/06/09 11:05:05 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=chabrol-claude, 224703 bytes
|
||
I 2022/06/09 11:05:05 HTCACHE storing content of url https://www.criterion.com/current/posts/6374-jessica-hausner-s-little-joe, 73730 bytes
|
||
I 2022/06/09 11:05:05 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=chabrol-claude, STACKING TIME = 1, PARSING TIME = 98
|
||
I 2022/06/09 11:05:05 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/films/56ba3e5cf5c30c4ca37d11dbc92db30c/TIrcSypBNof95PXcIsPiciN0HD59y4_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/films/e2ec7ab6b825e43071466bfb89fe7fff/wOzbQfzoXkpgxILMzKBKRMOwT7xfb7_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 10)) = 249
|
||
I 2022/06/09 11:05:05 SWITCHBOARD CRAWL: ADDED 65 LINKS FROM https://www.criterion.com/current/posts/6374-jessica-hausner-s-little-joe, STACKING TIME = 1, PARSING TIME = 21
|
||
I 2022/06/09 11:05:05 REJECTED https://www.telegraph.co.uk/films/0/little-joe-ben-whishaw-falls-mutant-flower-chilly-sci-fi-fable/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-chang-pedro-almodovar-ken-loach-jessica-hausner-20190518-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.festival-cannes.com/en/festival/films/little-joe - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.indiewire.com/2019/05/little-joe-review-cannes-1202142527/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://variety.com/2019/film/news/jessica-hausner-little-joe-cannes-film-festival-interview-1203219442/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/posts/7823-tribeca-2022 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/little-joe-jessica-hausner-sci-fi-plant-horror-drama - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.youtube.com/embed/S7ihx84V1q4?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/posts/7819-early-summer-reading - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/series/did-you-see-this - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://criterion-production.s3.amazonaws.com/eHPF8Mm6sTdbJvdVijP8UW2JzbXnuB.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.youtube.com/embed/I4cdpfJ-k5A?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://film.avclub.com/postwar-drama-and-an-unnerving-spin-on-a-sci-fi-classic-1834865079 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://deadline.com/2019/05/jessica-hausner-little-joe-cannes-interview-news-1202610970/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/category/1-on-film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-5-haitian-zombis-insidious-plants-takashi-miike - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/series/cannes-2019 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/posts/7822-irma-vep-revamp - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/category/20-the-daily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/author/654-david-hudson - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://criterion-v2.herokuapp.com/current/posts/7818-american-neorealism-now - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:05 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=chabrol-claude
|
||
I 2022/06/09 11:05:05 Fulltext indexing: KTPXcm_26NP5 https://www.criterion.com/shop/browse/list?director=chabrol-claude
|
||
I 2022/06/09 11:05:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[KTPXcm_26NP5 (1735154944731774976)]} 0 2
|
||
I 2022/06/09 11:05:05 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=chabrol-claude [KTPXcm_26NP5]
|
||
Description: Claude Chabrol films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12508 bytes |
|
||
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kDluwa_26NP5 (1735154944745406464)]} 0 0
|
||
I 2022/06/09 11:05:05 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/account/my-criterion
|
||
I 2022/06/09 11:05:05 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/login
|
||
I 2022/06/09 11:05:05 REJECTED https://www.criterion.com/account/my-criterion - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/account/my-criterion aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:34:07Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
|
||
I 2022/06/09 11:05:05 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/6374-jessica-hausner-s-little-joe
|
||
I 2022/06/09 11:05:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Jk4sYG_26NP5 (1735154944763232256)]} 0 1
|
||
I 2022/06/09 11:05:05 Fulltext indexing: Jk4sYG_26NP5 https://www.criterion.com/current/posts/6374-jessica-hausner-s-little-joe
|
||
I 2022/06/09 11:05:05 SWITCHBOARD *Indexed 537 words in URL https://www.criterion.com/current/posts/6374-jessica-hausner-s-little-joe [Jk4sYG_26NP5]
|
||
Description: Jessica Hausner’s Little Joe | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 6523 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:05 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 10)) = 249
|
||
I 2022/06/09 11:05:06 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/current/top-10-lists/216-paul-dano-s-top-10
|
||
I 2022/06/09 11:05:06 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/current/posts
|
||
I 2022/06/09 11:05:06 REJECTED https://www.criterion.com/current/top-10-lists/216-paul-dano-s-top-10 - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/current/top-10-lists/216-paul-dano-s-top-10 aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:34:26Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
|
||
I 2022/06/09 11:05:06 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[JfDyUG_26NP5 (1735154945000210432)]} 0 2
|
||
I 2022/06/09 11:05:06 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 11)) = 248
|
||
I 2022/06/09 11:05:06 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 10)) = 249
|
||
I 2022/06/09 11:05:06 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=kinugasa-teinosuke, 224223 bytes
|
||
I 2022/06/09 11:05:06 SWITCHBOARD CRAWL: ADDED 42 LINKS FROM https://www.criterion.com/shop/browse/list?director=kinugasa-teinosuke, STACKING TIME = 1, PARSING TIME = 71
|
||
I 2022/06/09 11:05:06 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:06 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:06 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:06 REJECTED https://s3.amazonaws.com/criterion-production/films/84f05f64e1f280532d29c921e32e79f9/TpWnFHWJR7iUJtg686Ei4gSe78oZWc_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:06 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:06 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:06 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:06 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:06 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:06 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 513, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 12)) = 244
|
||
I 2022/06/09 11:05:06 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=kinugasa-teinosuke
|
||
I 2022/06/09 11:05:06 Fulltext indexing: Iuksgm_26NP5 https://www.criterion.com/shop/browse/list?director=kinugasa-teinosuke
|
||
I 2022/06/09 11:05:06 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Iuksgm_26NP5 (1735154945803419648)]} 0 2
|
||
I 2022/06/09 11:05:06 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=kinugasa-teinosuke [Iuksgm_26NP5]
|
||
Description: Teinosuke Kinugasa films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12509 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:06 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 513, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 10)) = 246
|
||
I 2022/06/09 11:05:07 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=to-johnnie, 224108 bytes
|
||
I 2022/06/09 11:05:07 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=to-johnnie, STACKING TIME = 1, PARSING TIME = 18
|
||
I 2022/06/09 11:05:07 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://s3.amazonaws.com/criterion-production/films/78f00702358370de10b7256ded97d10b/qh2QGOHZiI77jVyFWnv9ex9XhAUTy0_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=to-johnnie
|
||
I 2022/06/09 11:05:07 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[IuaxXm_26NP5 (1735154946180907008)]} 0 2
|
||
I 2022/06/09 11:05:07 Fulltext indexing: IuaxXm_26NP5 https://www.criterion.com/shop/browse?director=to-johnnie
|
||
I 2022/06/09 11:05:07 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse?director=to-johnnie [IuaxXm_26NP5]
|
||
Description: Johnnie To films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12402 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:07 HTCACHE storing content of url https://www.criterion.com/current/author/208-david-chute, 50989 bytes
|
||
I 2022/06/09 11:05:07 SWITCHBOARD CRAWL: ADDED 41 LINKS FROM https://www.criterion.com/current/author/208-david-chute, STACKING TIME = 1, PARSING TIME = 4
|
||
I 2022/06/09 11:05:07 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[H3dbaG_26NP5 (1735154946289958912)]} 0 48
|
||
I 2022/06/09 11:05:07 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/current/author/208-david-chute
|
||
I 2022/06/09 11:05:07 Fulltext indexing: H3dbaG_26NP5 https://www.criterion.com/current/author/208-david-chute
|
||
I 2022/06/09 11:05:07 SWITCHBOARD *Indexed 156 words in URL https://www.criterion.com/current/author/208-david-chute [H3dbaG_26NP5]
|
||
Description: David Chute | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 1826 bytes |
|
||
LinkStorageTime: 49 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:07 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 502, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 8)) = 243
|
||
I 2022/06/09 11:05:07 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=allegret-marc, 224653 bytes
|
||
I 2022/06/09 11:05:07 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=allegret-marc, STACKING TIME = 1, PARSING TIME = 20
|
||
I 2022/06/09 11:05:07 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://s3.amazonaws.com/criterion-production/films/28e497421d0e485cece5e9269c16af35/ZfUeJFt1aesMSRGbLbQqTuhBztrEGP_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1766-0926bfc8985e1333759badc8421feb20/CcaaWhABn32RGpmaY0QfWfQzx5pNgV_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:07 REJECTED https://www.criterion.com/current/posts/Braden%20King%20https:/twitter.com/bradenking/status/1478847388223692801 - no response body (http return code = 404)
|
||
I 2022/06/09 11:05:07 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[HX_aXG_26NP5 (1735154946424176640)]} 0 0
|
||
I 2022/06/09 11:05:07 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=allegret-marc
|
||
I 2022/06/09 11:05:07 Fulltext indexing: IJMy1m_26NP5 https://www.criterion.com/shop/browse/list?director=allegret-marc
|
||
I 2022/06/09 11:05:07 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[IJMy1m_26NP5 (1735154946470313984)]} 0 2
|
||
I 2022/06/09 11:05:07 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse/list?director=allegret-marc [IJMy1m_26NP5]
|
||
Description: Marc Allégret films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12470 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:07 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 508, robots.delay = 0, ((waitig = 254) - (timeSinceLastAccess = 10)) = 244
|
||
I 2022/06/09 11:05:07 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 508, robots.delay = 0, ((waitig = 254) - (timeSinceLastAccess = 10)) = 244
|
||
I 2022/06/09 11:05:08 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 508, robots.delay = 0, ((waitig = 254) - (timeSinceLastAccess = 11)) = 243
|
||
I 2022/06/09 11:05:08 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=penn-arthur, 224275 bytes
|
||
I 2022/06/09 11:05:08 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=penn-arthur, STACKING TIME = 1, PARSING TIME = 32
|
||
I 2022/06/09 11:05:08 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/films/89fd00884467657bae436047de8cd9b2/7RuTtvX525S0ENzFaMXEcuoqKM280Y_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=shimizu-hiroshi, 226453 bytes
|
||
I 2022/06/09 11:05:08 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 521, robots.delay = 0, ((waitig = 260) - (timeSinceLastAccess = 18)) = 242
|
||
I 2022/06/09 11:05:08 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse/list?director=shimizu-hiroshi, STACKING TIME = 6, PARSING TIME = 55
|
||
I 2022/06/09 11:05:08 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/films/c02c4aa609824030e40216497743497f/dhf3ReLOhyiAk1XzOwejbjQYO9yibb_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 HTCACHE storing content of url https://www.criterion.com/current/posts/1504-everlasting-process, 69206 bytes
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1791-33ac56c0ab6c46473fbb827201db6455/cl4pDTM4ZYZyfvJyUFlcDrISlIuq1Z_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/films/3cec2eb98c004b821d42cc6e14bfb0fc/MjYLPC6QAnciKmRDONHLk59AXcIvqR_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/films/e62f1e61b5f52c2aaeabaeceaf58b629/BenTqN2hpuF2PKWN8v0M0BZkEMiLAM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/films/8f2f7b3ac4527fdb2bc4d5dd0c4edc6a/REhK9I9DGXP9cIB0AMU5avMjepHy2I_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/1504-everlasting-process, STACKING TIME = 6, PARSING TIME = 19
|
||
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/films/f4a75fa9e4e3797bba7a179dd774d412/n4PstJirHzLRuR7mdBBOY7ntCG6Suo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://ericskillman.blogspot.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://s3.amazonaws.com/criterion-production/images/4578-2d849516a06bdf1cedc75cbe45b9686a/current_samsmyth_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 REJECTED https://samsmyth.blogspot.com/2010/06/process-everlasting-moments-dvd-cover.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:08 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=penn-arthur
|
||
I 2022/06/09 11:05:08 Fulltext indexing: G54Oum_26NP5 https://www.criterion.com/shop/browse/list?director=penn-arthur
|
||
I 2022/06/09 11:05:08 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[G54Oum_26NP5 (1735154947648913408)]} 0 4
|
||
I 2022/06/09 11:05:08 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=penn-arthur [G54Oum_26NP5]
|
||
Description: Arthur Penn films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12581 bytes |
|
||
LinkStorageTime: 11 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:08 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=shimizu-hiroshi
|
||
I 2022/06/09 11:05:08 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[GycWEm_26NP5 (1735154947727556608)]} 0 2
|
||
I 2022/06/09 11:05:08 Fulltext indexing: GycWEm_26NP5 https://www.criterion.com/shop/browse/list?director=shimizu-hiroshi
|
||
I 2022/06/09 11:05:08 SWITCHBOARD *Indexed 1212 words in URL https://www.criterion.com/shop/browse/list?director=shimizu-hiroshi [GycWEm_26NP5]
|
||
Description: Hiroshi Shimizu films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12665 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:08 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/current/posts/1504-everlasting-process
|
||
I 2022/06/09 11:05:08 Fulltext indexing: Go4m6G_26NP5 https://www.criterion.com/current/posts/1504-everlasting-process
|
||
I 2022/06/09 11:05:08 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 10)) = 249
|
||
I 2022/06/09 11:05:08 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Go4m6G_26NP5 (1735154947744333824)]} 0 1
|
||
I 2022/06/09 11:05:08 SWITCHBOARD *Indexed 254 words in URL https://www.criterion.com/current/posts/1504-everlasting-process [Go4m6G_26NP5]
|
||
Description: Everlasting Process | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 2937 bytes |
|
||
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:08 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
|
||
I 2022/06/09 11:05:08 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:05:08 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@bac5c720[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wt(7.7.3):c176:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772696001}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wj(7.7.3):c145:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772667004}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wu(7.7.3):C8:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772698712}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wv(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772703827}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ww(7.7.3):C19:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772708817}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
|
||
I 2022/06/09 11:05:08 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
|
||
I 2022/06/09 11:05:08 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 11)) = 248
|
||
I 2022/06/09 11:05:09 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=w-pabst-g, 226393 bytes
|
||
I 2022/06/09 11:05:09 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse/list?director=w-pabst-g, STACKING TIME = 1, PARSING TIME = 21
|
||
I 2022/06/09 11:05:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/7d539514e0694356b704de0bd0985ddc/PDCF8VbHXZoKJ3Lhx3IvzOzNlg83oB_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/01f2b09b21d5479965b2b93422ba1072/tUKPob5t1DjOrGRVmJEEfRaalbiOzy_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/8cff4ed5c749a90d25f50667ab1908d2/erzB7rRyF3un8YavvACbHzygC7P0lm_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/61a18b721d7da3c80336baf91a7bd9f7/vjFC4gmkYHWrhV09dgqGRyHtLcJ5Rp_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=w-pabst-g
|
||
I 2022/06/09 11:05:09 Fulltext indexing: GggVHm_26NP5 https://www.criterion.com/shop/browse/list?director=w-pabst-g
|
||
I 2022/06/09 11:05:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[GggVHm_26NP5 (1735154948258136064)]} 0 5
|
||
I 2022/06/09 11:05:09 SWITCHBOARD *Indexed 1213 words in URL https://www.criterion.com/shop/browse/list?director=w-pabst-g [GggVHm_26NP5]
|
||
Description: G. W. Pabst films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12667 bytes |
|
||
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:09 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 523, robots.delay = 0, ((waitig = 261) - (timeSinceLastAccess = 12)) = 249
|
||
I 2022/06/09 11:05:09 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=franju-georges, 224696 bytes
|
||
I 2022/06/09 11:05:09 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=franju-georges, STACKING TIME = 0, PARSING TIME = 20
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/fc3c1f281c7c268b95d6ccfcb9c09753/09SjT40uvff2VmvuzTfllqpz5yKg2o_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/257d1602c6a7da83cb3a70db7349bbaf/OXzoGLF8Og7Ffj3X2nJk35Q9nqpSa1_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 HTCACHE storing content of url https://www.criterion.com/current/posts/5808-fall-festival-starters, 75140 bytes
|
||
I 2022/06/09 11:05:09 SWITCHBOARD CRAWL: ADDED 69 LINKS FROM https://www.criterion.com/current/posts/5808-fall-festival-starters, STACKING TIME = 6, PARSING TIME = 10
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED http://film.britishcouncil.org/the-souvenir - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.independent.co.uk/arts-entertainment/films/features/matthias-schoenaerts-interview-film-racer-and-the-jailbird-terrence-malick-a8442311.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.esquire.com/entertainment/movies/a21068561/suspiria-teaser-trailer/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/embed/3uGIEY7tdg8?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED http://www.labiennale.org/en/news/restored-films-venezia-classici - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://deadline.com/2016/11/alfonso-cuaron-movie-fight-crew-mexico-city-police-1201847622/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/embed/PSoRx87OO6k?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED http://www.labiennale.org/en/news/first-man-damien-chazelle-opening-film-75th-venice-film-festival - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/eRQoFkgvhx4BSKGEehVCLrMDahjA17.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.filmlinc.org/nyff2018/daily/alfonso-cuarons-roma-announced-as-nyff56-centerpiece/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.nytimes.com/2018/07/18/movies/alfonso-cuarns-roma-new-york-film-festival.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED http://cineuropa.org/en/newsdetail/357023/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://variety.com/2018/film/festivals/first-man-damien-chazelle-ryan-gosling-venice-film-festival-opening-night-1202877318/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://variety.com/2018/film/news/the-sisters-brothers-suspiria-my-brilliant-friend-venice-film-festival-1202878014/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED http://www.labiennale.org/en/news/pre-opening-event-75th-festival-tuesday-28-august - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/embed/Gj2oli0MLSU?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://variety.com/2017/film/podcasts/playback-podcast-damien-chazelle-la-la-land-1201963282/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=franju-georges
|
||
I 2022/06/09 11:05:09 Fulltext indexing: FdOYmm_26NP5 https://www.criterion.com/shop/browse/list?director=franju-georges
|
||
I 2022/06/09 11:05:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[FdOYmm_26NP5 (1735154948501405696)]} 0 6
|
||
I 2022/06/09 11:05:09 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse/list?director=franju-georges [FdOYmm_26NP5]
|
||
Description: Georges Franju films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12510 bytes |
|
||
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:09 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/5808-fall-festival-starters
|
||
I 2022/06/09 11:05:09 Fulltext indexing: FWPlsG_26NP5 https://www.criterion.com/current/posts/5808-fall-festival-starters
|
||
I 2022/06/09 11:05:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[FWPlsG_26NP5 (1735154948544397312)]} 0 2
|
||
I 2022/06/09 11:05:09 SWITCHBOARD *Indexed 577 words in URL https://www.criterion.com/current/posts/5808-fall-festival-starters [FWPlsG_26NP5]
|
||
Description: Fall Festival Starters | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 7404 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:09 HostQueue forcing crawl-delay of 250 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 520, robots.delay = 0, ((waitig = 260) - (timeSinceLastAccess = 10)) = 250
|
||
I 2022/06/09 11:05:09 HTCACHE storing content of url https://www.criterion.com/films/28017-the-wicked-lady, 70959 bytes
|
||
I 2022/06/09 11:05:09 SWITCHBOARD CRAWL: ADDED 61 LINKS FROM https://www.criterion.com/films/28017-the-wicked-lady, STACKING TIME = 1, PARSING TIME = 7
|
||
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/614becb858a85ddb393d426b5743891d.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/044cdbf494d19764a30df44d01451bc7.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/bd2452146ea73af63f76a550204841f9/4yJd8DqIYDNQ5jaoJpFxnPPkbFjdAG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/images/3913-13e0f359ffe35a4c4e0598e2e9db3246/madonnaof7moons_1432_003_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/772b7e5558887c5daed761fe8fd4153d/Y7bOdy6FLY9xSaO7A52u4izRfBkmmC_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/3b1076b25c261c70b6e88b3a91e91b01.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/74be83402097d3f4c5d9e7331de31471/43YJwgdfANxgafSJNaTDUwGxR8Gait_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.amazon.com/dp/B00JJH2I3W - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/e13cb552465c40bea2e1f7459f42e861.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://itunes.apple.com/us/movie/id835386915?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1812-2bdeb818c107e95738f59894990c22b2/oTM1jWGYaWx6KHPFFGsXiyVmbdpCPi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/the-wicked-lady?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/b59efc718fc3309a4eb76255310280b8/MpKeZ33lims6VNmUQPeUZ06u1HCgd5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/films/28017-the-wicked-lady
|
||
I 2022/06/09 11:05:09 Fulltext indexing: FLju_e_26NP5 https://www.criterion.com/films/28017-the-wicked-lady
|
||
I 2022/06/09 11:05:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[FLju_e_26NP5 (1735154948663934976)]} 0 1
|
||
I 2022/06/09 11:05:09 SWITCHBOARD *Indexed 286 words in URL https://www.criterion.com/films/28017-the-wicked-lady [FLju_e_26NP5]
|
||
Description: The Wicked Lady (1945) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3024 bytes |
|
||
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:09 HTCACHE storing content of url https://www.criterion.com/films/28019-madonna-of-the-seven-moons, 71648 bytes
|
||
I 2022/06/09 11:05:09 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 504, robots.delay = 0, ((waitig = 252) - (timeSinceLastAccess = 7)) = 245
|
||
I 2022/06/09 11:05:09 SWITCHBOARD CRAWL: ADDED 62 LINKS FROM https://www.criterion.com/films/28019-madonna-of-the-seven-moons, STACKING TIME = 1, PARSING TIME = 65
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/33a18e5b6c729a18d7c79c6c5510f3a5.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/5516ffe44b57e6de93b7d80b7c0a7936.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/images/3913-13e0f359ffe35a4c4e0598e2e9db3246/madonnaof7moons_1432_003_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/5f3050443408e30c95355064b9d16259.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/f448ba6e5e6976d113fbadfdbf654f1a.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/cbbe2b461ec0a30f180e6745ee9577e7/Oyu16EuKreAQCBgp029vYl7Nx6NV86_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1812-2bdeb818c107e95738f59894990c22b2/oTM1jWGYaWx6KHPFFGsXiyVmbdpCPi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/bd2452146ea73af63f76a550204841f9/4yJd8DqIYDNQ5jaoJpFxnPPkbFjdAG_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.amazon.com/dp/B00JE58M84 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.criterionchannel.com/madonna-of-the-seven-moons?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/ae888c07c9f50bca9b2df4ca8ab675b8.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/224ad3784ebfb34f98b1af628337f3da/gf5q2Dxvw2rDGLoNCNOnF3L53EUKqK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://itunes.apple.com/us/movie/id826939330?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 REJECTED https://s3.amazonaws.com/criterion-production/films/e033d65a68d10236f332c06ce3725b59/5Pvpc7nsRpgo8lu3IQVlAeheME1g9w_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:09 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/films/28019-madonna-of-the-seven-moons
|
||
I 2022/06/09 11:05:09 Fulltext indexing: FLa3He_26NP5 https://www.criterion.com/films/28019-madonna-of-the-seven-moons
|
||
I 2022/06/09 11:05:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[FLa3He_26NP5 (1735154948956487680)]} 0 3
|
||
I 2022/06/09 11:05:09 SWITCHBOARD *Indexed 289 words in URL https://www.criterion.com/films/28019-madonna-of-the-seven-moons [FLa3He_26NP5]
|
||
Description: Madonna of the Seven Moons (1945) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3000 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:09 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 504, robots.delay = 0, ((waitig = 252) - (timeSinceLastAccess = 11)) = 241
|
||
I 2022/06/09 11:05:10 HTCACHE storing content of url https://www.criterion.com/current/posts/3965-jan-troell-enduring-film-pioneer, 76386 bytes
|
||
I 2022/06/09 11:05:10 SWITCHBOARD CRAWL: ADDED 62 LINKS FROM https://www.criterion.com/current/posts/3965-jan-troell-enduring-film-pioneer, STACKING TIME = 1, PARSING TIME = 10
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/f4a75fa9e4e3797bba7a179dd774d412/n4PstJirHzLRuR7mdBBOY7ntCG6Suo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/aaceb4cad8621ca9617f4c00b8ad4748/5xi0GwA3BbtdOq3TOnIIC17roOR2Pu_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/images/6801-6a0c932a832bfd1bc395b0884eecdc17/Screen_Shot_2016-03-09_at_2.18.04_PM_medium.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/ccf8a3f2353002103ef420fd02fe2585/cE4nJ2rcnsqFoXZOGdTHQz1j9zLv3e_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/images/6640-e6a74f730289b5b381e1e83111c845dc/livmax_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/af3a424e036ce6064ba4c3b884c82128/cwe2k8wIo3C0zHyHipEgHMC2IsVcXq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED http://www.filmcomment.com/blog/interview-jan-troell/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 HTCACHE storing content of url https://www.criterion.com/current/posts/1061-on-it-s-impossible-to-learn-to-plowby-reading-books, 68580 bytes
|
||
I 2022/06/09 11:05:10 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/1061-on-it-s-impossible-to-learn-to-plowby-reading-books, STACKING TIME = 1, PARSING TIME = 15
|
||
I 2022/06/09 11:05:10 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/3965-jan-troell-enduring-film-pioneer
|
||
I 2022/06/09 11:05:10 Fulltext indexing: FCB1VG_26NP5 https://www.criterion.com/current/posts/3965-jan-troell-enduring-film-pioneer
|
||
I 2022/06/09 11:05:10 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[FCB1VG_26NP5 (1735154949346557952)]} 0 4
|
||
I 2022/06/09 11:05:10 SWITCHBOARD *Indexed 450 words in URL https://www.criterion.com/current/posts/3965-jan-troell-enduring-film-pioneer [FCB1VG_26NP5]
|
||
Description: Jan Troell, Enduring Film Pioneer | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 5455 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:10 SWITCHBOARD Excluded 25 words in URL https://www.criterion.com/current/posts/1061-on-it-s-impossible-to-learn-to-plowby-reading-books
|
||
I 2022/06/09 11:05:10 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 489, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:05:10 Fulltext indexing: Enn3sG_26NP5 https://www.criterion.com/current/posts/1061-on-it-s-impossible-to-learn-to-plowby-reading-books
|
||
I 2022/06/09 11:05:10 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Enn3sG_26NP5 (1735154949379063808)]} 0 2
|
||
I 2022/06/09 11:05:10 SWITCHBOARD *Indexed 274 words in URL https://www.criterion.com/current/posts/1061-on-it-s-impossible-to-learn-to-plowby-reading-books [Enn3sG_26NP5]
|
||
Description: On It’s Impossible to Learn to Plowby Reading Books | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3654 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:10 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 489, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:05:10 HTCACHE storing content of url https://www.criterion.com/current/posts/397-kill-rebel-samurai-cinema, 177782 bytes
|
||
I 2022/06/09 11:05:10 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/397-kill-rebel-samurai-cinema, STACKING TIME = 2, PARSING TIME = 13
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 HTCACHE storing content of url https://www.criterion.com/current/posts/801-the-lacemaker, 67077 bytes
|
||
I 2022/06/09 11:05:10 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/801-the-lacemaker, STACKING TIME = 1, PARSING TIME = 10
|
||
I 2022/06/09 11:05:10 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 473, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:05:10 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/397-kill-rebel-samurai-cinema
|
||
I 2022/06/09 11:05:10 Fulltext indexing: Em7eeG_26NP5 https://www.criterion.com/current/posts/397-kill-rebel-samurai-cinema
|
||
I 2022/06/09 11:05:10 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Em7eeG_26NP5 (1735154950026035200)]} 0 14
|
||
I 2022/06/09 11:05:10 SWITCHBOARD *Indexed 1527 words in URL https://www.criterion.com/current/posts/397-kill-rebel-samurai-cinema [Em7eeG_26NP5]
|
||
Description: Kill!: Rebel Samurai Cinema | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 24560 bytes |
|
||
LinkStorageTime: 16 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:10 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/801-the-lacemaker
|
||
I 2022/06/09 11:05:10 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[EjPnwG_26NP5 (1735154950051201024)]} 0 1
|
||
I 2022/06/09 11:05:10 Fulltext indexing: EjPnwG_26NP5 https://www.criterion.com/current/posts/801-the-lacemaker
|
||
I 2022/06/09 11:05:10 SWITCHBOARD *Indexed 432 words in URL https://www.criterion.com/current/posts/801-the-lacemaker [EjPnwG_26NP5]
|
||
Description: The Lacemaker | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 5669 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:10 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
|
||
I 2022/06/09 11:05:10 org.apache.solr.update.SolrIndexWriter Calling setCommitData with IW:org.apache.solr.update.SolrIndexWriter@fed4ccdd commitCommandVersion:0
|
||
I 2022/06/09 11:05:10 HTCACHE storing content of url https://www.criterion.com/films/31785-once-upon-a-time-in-china-v, 68838 bytes
|
||
I 2022/06/09 11:05:10 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 467, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:05:10 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/LegWC4s0uQ2V7gyLB5vUk8PKN3soHqTiGHJRtfCs.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1987-beb71f216e96d1ff2d0f8231f5b8b975/44LVkvftLRcr5paF4enJfBFTe5mI2c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/yzWM7KVvJrSi1rjOfBNHtiCAotHAYB6AMYoqRyZq.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/films/31785-once-upon-a-time-in-china-v, STACKING TIME = 6, PARSING TIME = 14
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/e8d4cd2c5dd1b2541a3c8325a6d1805f/W5gKGPvtkm5evOJ9devGYL935KMAdy_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/b796fc21a57558358eb7f9e54fa5e6d0/2WXT8ULgPXXbikn39pHMz1m7dVbtt7_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/tXUnJYbZkEw00gUBu3JqZMEsT5PUfzfkaYukXytQ.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/wSkXZAgGlsAZuwZjPkBwJNtFTlGtpESWZyGxXSIq.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/YHKHgqSXtNOG2QX3R98RPaZwblXrPHoxSOe6oSL1.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://s3.amazonaws.com/criterion-production/films/cd4ae6bdbaea9fd9c9aff1d69f924bc4/5wErYoFwVfkciAfnpRbFIhPqv7tIC5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:10 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/tL2y6baWLC9qRe0Xa6hlsVKbZKpfXMQNnF5JjZjn.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 SWITCHBOARD Excluded 13 words in URL https://www.criterion.com/films/31785-once-upon-a-time-in-china-v
|
||
I 2022/06/09 11:05:11 Fulltext indexing: EXDJ_e_26NP5 https://www.criterion.com/films/31785-once-upon-a-time-in-china-v
|
||
I 2022/06/09 11:05:11 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[EXDJ_e_26NP5 (1735154950220021760)]} 0 4
|
||
I 2022/06/09 11:05:11 SWITCHBOARD *Indexed 301 words in URL https://www.criterion.com/films/31785-once-upon-a-time-in-china-v [EXDJ_e_26NP5]
|
||
Description: Once Upon a Time in China V (1994) | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 3005 bytes |
|
||
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:11 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
|
||
I 2022/06/09 11:05:11 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 467, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
|
||
I 2022/06/09 11:05:11 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 467, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:05:11 HTCACHE storing content of url https://www.criterion.com/current/posts/3605-three-reasons-the-bridge, 65954 bytes
|
||
I 2022/06/09 11:05:11 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/3605-three-reasons-the-bridge, STACKING TIME = 1, PARSING TIME = 9
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.youtube.com/embed/AqAQsYmu_1Q?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/films/dc0c6d72367b18727c846047f2b39cb6/JOpfLu2e0pJYqYoJ67UvpPXUgKcIP1_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/current/posts/3605-three-reasons-the-bridge
|
||
I 2022/06/09 11:05:11 Fulltext indexing: DrKO6G_26NP5 https://www.criterion.com/current/posts/3605-three-reasons-the-bridge
|
||
I 2022/06/09 11:05:11 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DrKO6G_26NP5 (1735154950776815616)]} 0 2
|
||
I 2022/06/09 11:05:11 SWITCHBOARD *Indexed 217 words in URL https://www.criterion.com/current/posts/3605-three-reasons-the-bridge [DrKO6G_26NP5]
|
||
Description: Three Reasons: The Bridge | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 2536 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:11 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=clouse-robert, 225276 bytes
|
||
I 2022/06/09 11:05:11 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse/list?director=clouse-robert, STACKING TIME = 4, PARSING TIME = 82
|
||
I 2022/06/09 11:05:11 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/films/294fc27b4aa7b43a4f34fbce39a90e89/HVUGoVCQ8abK5Srs4VPKUfbR66QTG1_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 467, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1957-f90d4c48a2f932ffe7df386499f9477e/73k4EkSiXEfsdi097fieFBGdb39vlg_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/films/d3ef5d03f0388465ff6d95625ee4e504/TPEJFhfV5tnvG022UXCwt2LOHhis7v_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=clouse-robert
|
||
I 2022/06/09 11:05:11 Fulltext indexing: DsMcPm_26NP5 https://www.criterion.com/shop/browse/list?director=clouse-robert
|
||
I 2022/06/09 11:05:11 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DsMcPm_26NP5 (1735154951059931136)]} 0 4
|
||
I 2022/06/09 11:05:11 SWITCHBOARD *Indexed 1203 words in URL https://www.criterion.com/shop/browse/list?director=clouse-robert [DsMcPm_26NP5]
|
||
Description: Robert Clouse films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12567 bytes |
|
||
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:11 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=soukis-robert, 224790 bytes
|
||
I 2022/06/09 11:05:11 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 467, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
|
||
I 2022/06/09 11:05:11 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=soukis-robert, STACKING TIME = 5, PARSING TIME = 36
|
||
I 2022/06/09 11:05:11 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/films/a40089af5a99e65997b7ab84b63c23e6/xwkezz5WNkgtavKsf03aC9CxPq4gTb_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:11 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1809-a4a8b84c4cbcababe9073629fd726b50/S4JbdHupEZur0VszttgXmG8hjRdrG2_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=soukis-robert
|
||
I 2022/06/09 11:05:12 Fulltext indexing: DneMom_26NP5 https://www.criterion.com/shop/browse/list?director=soukis-robert
|
||
I 2022/06/09 11:05:12 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DneMom_26NP5 (1735154951328366592)]} 0 2
|
||
I 2022/06/09 11:05:12 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=soukis-robert [DneMom_26NP5]
|
||
Description: Robert Soukis films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12525 bytes |
|
||
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:12 HTCACHE storing content of url https://www.criterion.com/current/author/859-sean-gilman, 49572 bytes
|
||
I 2022/06/09 11:05:12 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/859-sean-gilman, STACKING TIME = 0, PARSING TIME = 7
|
||
I 2022/06/09 11:05:12 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DXC_GG_26NP5 (1735154951369261056)]} 0 1
|
||
I 2022/06/09 11:05:12 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/current/author/859-sean-gilman
|
||
I 2022/06/09 11:05:12 Fulltext indexing: DXC_GG_26NP5 https://www.criterion.com/current/author/859-sean-gilman
|
||
I 2022/06/09 11:05:12 SWITCHBOARD *Indexed 126 words in URL https://www.criterion.com/current/author/859-sean-gilman [DXC_GG_26NP5]
|
||
Description: Sean Gilman | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 1479 bytes |
|
||
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:12 HostQueue forcing crawl-delay of 166 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 458, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 84)) = 166
|
||
I 2022/06/09 11:05:12 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=dieterle-william, 224237 bytes
|
||
I 2022/06/09 11:05:12 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=dieterle-william, STACKING TIME = 1, PARSING TIME = 31
|
||
I 2022/06/09 11:05:12 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://s3.amazonaws.com/criterion-production/films/711e752ce9b5c42995bb463693a0f371/RN7rkLPEIs51hWPn9yes29Zlzsj0cW_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 463, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
|
||
I 2022/06/09 11:05:12 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=dieterle-william
|
||
I 2022/06/09 11:05:12 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DeGpkm_26NP5 (1735154951782400000)]} 0 2
|
||
I 2022/06/09 11:05:12 Fulltext indexing: DeGpkm_26NP5 https://www.criterion.com/shop/browse/list?director=dieterle-william
|
||
I 2022/06/09 11:05:12 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=dieterle-william [DeGpkm_26NP5]
|
||
Description: William Dieterle films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12494 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:12 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 463, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
|
||
I 2022/06/09 11:05:12 HTCACHE storing content of url https://www.criterion.com/current/posts/6378-bertrand-bonello-s-zombi-child, 70747 bytes
|
||
I 2022/06/09 11:05:12 REJECTED https://criterion-production.s3.amazonaws.com/quRlGcDfNCBO6BhcNNb1VaeKiJjpie.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.screendaily.com/reviews/zombi-child-cannes-review/5139570.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://film.avclub.com/more-zombies-and-a-new-downer-from-a-past-cannes-winne-1834839786 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://filmmakermagazine.com/107533-cannes-2019-dispatch-2-bacurau-zombi-child/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 SWITCHBOARD CRAWL: ADDED 60 LINKS FROM https://www.criterion.com/current/posts/6378-bertrand-bonello-s-zombi-child, STACKING TIME = 9, PARSING TIME = 16
|
||
I 2022/06/09 11:05:12 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=cline-edward, 224209 bytes
|
||
I 2022/06/09 11:05:12 REJECTED https://www.telegraph.co.uk/films/0/zombi-child-review-disquieting-tale-voodoo-colonialism-la-francaise/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-5-haitian-zombis-insidious-plants-takashi-miike - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.quinzaine-realisateurs.com/en/film/zombi-child/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.hollywoodreporter.com/review/zombi-child-review-1210505 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 SWITCHBOARD Excluded 23 words in URL https://www.criterion.com/current/posts/6378-bertrand-bonello-s-zombi-child
|
||
I 2022/06/09 11:05:12 Fulltext indexing: DI69TG_26NP5 https://www.criterion.com/current/posts/6378-bertrand-bonello-s-zombi-child
|
||
I 2022/06/09 11:05:12 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DI69TG_26NP5 (1735154952223850496)]} 0 3
|
||
I 2022/06/09 11:05:12 SWITCHBOARD *Indexed 504 words in URL https://www.criterion.com/current/posts/6378-bertrand-bonello-s-zombi-child [DI69TG_26NP5]
|
||
Description: Bertrand Bonello’s Zombi Child | Current | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 6025 bytes |
|
||
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:12 SWITCHBOARD CRAWL: ADDED 42 LINKS FROM https://www.criterion.com/shop/browse/list?director=cline-edward, STACKING TIME = 1, PARSING TIME = 114
|
||
I 2022/06/09 11:05:12 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://s3.amazonaws.com/criterion-production/films/ed560ce125d981a74da5f0b112c643c4/L3qXe0Ml9IhoWPs1K4gtHCLOCKSxMe_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:12 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 461, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
|
||
I 2022/06/09 11:05:13 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=cline-edward
|
||
I 2022/06/09 11:05:13 Fulltext indexing: DPM8Wm_26NP5 https://www.criterion.com/shop/browse/list?director=cline-edward
|
||
I 2022/06/09 11:05:13 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[DPM8Wm_26NP5 (1735154952318222336)]} 0 2
|
||
I 2022/06/09 11:05:13 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=cline-edward [DPM8Wm_26NP5]
|
||
Description: Edward Cline films on Disc and Streaming | The Criterion Collection
|
||
MimeType: text/html | Charset: UTF-8 | Size: 12499 bytes |
|
||
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
|
||
I 2022/06/09 11:05:13 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=zemeckis-robert, 224178 bytes
|
||
I 2022/06/09 11:05:13 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=zemeckis-robert, STACKING TIME = 1, PARSING TIME = 29
|
||
I 2022/06/09 11:05:13 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:13 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:13 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:13 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|
||
I 2022/06/09 11:05:13 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
|