crawler/DATA/LOG/yacy119.log
2025-03-26 09:12:37 +09:00

5484 lines
1.0 MiB
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

I 2022/06/09 11:03:27 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/8a356a6e467b1afa1e71357294fa98ab.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6639-/Dq2KgcCxAhBaHCaklmh2ymdWM4bXfs_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://www.criterionchannel.com/a-brief-history-of-time?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://s3.amazonaws.com/criterion-production/films/59391d0914ee5e1f9ae246722f7b2d0e/84gqckqQjOBKLal72bAP241JlCOFPs_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/058efdfc5c225efceab5482e14293a25.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://itunes.apple.com/us/movie/id805754216?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/films/28559-a-brief-history-of-time
I 2022/06/09 11:03:27 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[C_gOFe_26NP5 (1735154841854935040)]} 0 2
I 2022/06/09 11:03:27 Fulltext indexing: C_gOFe_26NP5 https://www.criterion.com/films/28559-a-brief-history-of-time
I 2022/06/09 11:03:27 SWITCHBOARD *Indexed 382 words in URL https://www.criterion.com/films/28559-a-brief-history-of-time [C_gOFe_26NP5]
Description: A Brief History of Time (1991) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5011 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:27 HTCACHE storing content of url https://www.criterion.com/shop/collection/186-pier-paolo-pasolini/list, 56883 bytes
I 2022/06/09 11:03:27 SWITCHBOARD CRAWL: ADDED 50 LINKS FROM https://www.criterion.com/shop/collection/186-pier-paolo-pasolini/list, STACKING TIME = 1, PARSING TIME = 5
I 2022/06/09 11:03:27 REJECTED https://s3.amazonaws.com/criterion-production/films/ee8a1a60af6f29665de6356f7feb9e3a/FFe8SUEMJEMlJw100DkEOXVejyDsha_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://s3.amazonaws.com/criterion-production/films/8d41805aabbf9c68049033a9e54fc4ca/5HBkbTpi2BcdDfPwUmTIH76T5jR9jA_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://s3.amazonaws.com/criterion-production/films/e1710ec4789ca8b72c4fe33f3c448330/ZGHNHmAeEKny73AkgfjtzVBra8ZZDH_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://s3.amazonaws.com/criterion-production/films/04fdbad7f9a7fc61fe5d6c7261e46845/sJaIpk06p5yy7DpmKUVe39P4KDuYSi_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://s3.amazonaws.com/criterion-production/films/65d5e2b831a35d5d9834dc81f2cdf43d/q4Zt7iiONKEVKdkAQb9yvCDgXOT8BG_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://s3.amazonaws.com/criterion-production/films/22e6497dc50a3771d97e6cee35b58ad1/cXPrZSzVV5Zs7cKMSLjmXBckfyGiN9_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://s3.amazonaws.com/criterion-production/films/56166014193fe00f622115685dba2487/g6y2g4zZHy2J3caJizYcgZBfh7FnQ9_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 REJECTED https://s3.amazonaws.com/criterion-production/films/c1dca25b53f05bf28a029306a986f9be/8hOAp0htPTJ4M1LATpfXoCZKwxOu7y_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:27 SWITCHBOARD Excluded 13 words in URL https://www.criterion.com/shop/collection/186-pier-paolo-pasolini/list
I 2022/06/09 11:03:27 Fulltext indexing: C-Y8Nm_26NP5 https://www.criterion.com/shop/collection/186-pier-paolo-pasolini/list
I 2022/06/09 11:03:27 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[C-Y8Nm_26NP5 (1735154842002784256)]} 0 1
I 2022/06/09 11:03:27 SWITCHBOARD *Indexed 174 words in URL https://www.criterion.com/shop/collection/186-pier-paolo-pasolini/list [C-Y8Nm_26NP5]
Description: Pier Paolo Pasolini | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2013 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:27 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 347, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:03:28 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 347, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:28 HTCACHE storing content of url https://www.criterion.com/current/posts/5678-2-or-3-things-i-know-about-godzilla, 82262 bytes
I 2022/06/09 11:03:28 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 345, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 6)) = 244
I 2022/06/09 11:03:28 REJECTED https://www.filmstruck.com/us/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7777-/bTILAb13gYYFwneHHCb5dHHD1VjRf7_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://criterion-production.s3.amazonaws.com/dJ8vMmmNV59H1EeHBQy24FY4BXnc2C.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7707-/hZxxnAQeKCi5vjiDKzvdNuftSnkdOM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://criterion-production.s3.amazonaws.com/KaLp83rs7MOMOpdBh1ao1o3vyFH0g1.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/5678-2-or-3-things-i-know-about-godzilla, STACKING TIME = 6, PARSING TIME = 14
I 2022/06/09 11:03:28 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7807-/DXN9QiWNnsXTuLss0TTJ4r9JspRu5B_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7738-/e35rwdsj2UHHYOdCCz8E5Qr8oxxKXj_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://s3.amazonaws.com/criterion-production/films/f0260853d1a5a44039eabd2bbcd9af56/vKeL3Z8JQS8CcpVGf71hCklwYsRZ4G_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 LOADER CRAWLER Redirection detected ('HTTP/1.1 301 Moved Permanently') for URL https://www.criterion.com/current/posts/
I 2022/06/09 11:03:28 LOADER CRAWLER ..Redirecting request to: http://www.criterion.com/current/posts
I 2022/06/09 11:03:28 HostQueue opened HostQueue /root/yacy/DATA/INDEX/webportal/QUEUES/CrawlerCoreStacks/www.criterion.com-#HUGk5Z.80 with 0 urls.
I 2022/06/09 11:03:28 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/5678-2-or-3-things-i-know-about-godzilla
I 2022/06/09 11:03:28 Fulltext indexing: C9dBGG_26NP5 https://www.criterion.com/current/posts/5678-2-or-3-things-i-know-about-godzilla
I 2022/06/09 11:03:28 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[C9dBGG_26NP5 (1735154842706378752)]} 0 3
I 2022/06/09 11:03:28 SWITCHBOARD *Indexed 1022 words in URL https://www.criterion.com/current/posts/5678-2-or-3-things-i-know-about-godzilla [C9dBGG_26NP5]
Description: 2 or 3 Things I Know About Godzilla | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 14667 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:28 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=dunham-lena, 224161 bytes
I 2022/06/09 11:03:28 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[C5zpxG_26NP5 (1735154842759856128)]} 0 0
I 2022/06/09 11:03:28 REJECTED https://www.criterion.com/current/posts/ - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/current/posts/ to http://www.criterion.com/current/posts placed on crawler queue for double-check
I 2022/06/09 11:03:28 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=dunham-lena, STACKING TIME = 0, PARSING TIME = 21
I 2022/06/09 11:03:28 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://s3.amazonaws.com/criterion-production/films/2881929c283b8fb2cb0fe1d8c05bc189/vosQefD5ERUUYpVjTbTIBXTl4TX8vQ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:28 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=dunham-lena
I 2022/06/09 11:03:28 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:03:28 Fulltext indexing: C9j_nm_26NP5 https://www.criterion.com/shop/browse/list?director=dunham-lena
I 2022/06/09 11:03:28 HostBalancer (re-)initialized the round-robin queue; 2 hosts.
I 2022/06/09 11:03:28 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[C9j_nm_26NP5 (1735154842862616576)]} 0 50
I 2022/06/09 11:03:28 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=dunham-lena [C9j_nm_26NP5]
Description: Lena Dunham films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12477 bytes |
LinkStorageTime: 54 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:28 HostQueue forcing crawl-delay of 167 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 352, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 83)) = 167
I 2022/06/09 11:03:28 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:03:28 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@7a92f3a2[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vf(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772539925}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vz(7.7.3):C4:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772604529}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w0(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772604627}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w1(7.7.3):C16:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772608698}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:03:28 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:03:28 HostBalancer (re-)initialized the round-robin queue; 2 hosts.
I 2022/06/09 11:03:28 LOADER Forcing sleep of 233 ms for host www.criterion.com
I 2022/06/09 11:03:28 HostQueue forcing crawl-delay of 224 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 352, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 26)) = 224
I 2022/06/09 11:03:29 HTCACHE storing content of url https://www.criterion.com/current/posts/2328-a-tale-of-two-blobs, 71331 bytes
I 2022/06/09 11:03:29 SWITCHBOARD CRAWL: ADDED 55 LINKS FROM https://www.criterion.com/current/posts/2328-a-tale-of-two-blobs, STACKING TIME = 1, PARSING TIME = 10
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/images/4929-c623b53271f132fdc57e7352dce71b76/Blob_Current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.youtube.com/embed/GODDLgM1gKo?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED http://thecolonialtheatre.com/category/events/blobfest/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/films/6790c92a200a349fa2918b59929a5b7c/SxJFUmRYI29BBqY4Im7uNTWIYo2pJi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/current/posts/2328-a-tale-of-two-blobs
I 2022/06/09 11:03:29 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[C3q8EG_26NP5 (1735154843377467392)]} 0 2
I 2022/06/09 11:03:29 Fulltext indexing: C3q8EG_26NP5 https://www.criterion.com/current/posts/2328-a-tale-of-two-blobs
I 2022/06/09 11:03:29 SWITCHBOARD *Indexed 296 words in URL https://www.criterion.com/current/posts/2328-a-tale-of-two-blobs [C3q8EG_26NP5]
Description: A Tale of Two Blobs | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3255 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:29 LOADER Forcing sleep of 249 ms for host www.criterion.com
I 2022/06/09 11:03:29 HostQueue forcing crawl-delay of 170 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 348, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 81)) = 169
I 2022/06/09 11:03:29 LOADER CRAWLER Redirection detected ('HTTP/1.1 301 Moved Permanently') for URL http://www.criterion.com/current/posts
I 2022/06/09 11:03:29 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/current/posts
I 2022/06/09 11:03:29 REJECTED http://www.criterion.com/current/posts - cannot load: load error - CRAWLER Redirect of URL=http://www.criterion.com/current/posts aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:34:26Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
I 2022/06/09 11:03:29 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[w55JSIHUGk5Z (1735154843400536064)]} 0 0
I 2022/06/09 11:03:29 LOADER Forcing sleep of 68 ms for host www.criterion.com
I 2022/06/09 11:03:29 LOADER Forcing sleep of 58 ms for host www.criterion.com
I 2022/06/09 11:03:29 LOADER Forcing sleep of 47 ms for host www.criterion.com
I 2022/06/09 11:03:29 LOADER Forcing sleep of 36 ms for host www.criterion.com
I 2022/06/09 11:03:29 LOADER Forcing sleep of 26 ms for host www.criterion.com
I 2022/06/09 11:03:29 LOADER Forcing sleep of 15 ms for host www.criterion.com
I 2022/06/09 11:03:29 LOADER Forcing sleep of 4 ms for host www.criterion.com
I 2022/06/09 11:03:29 HostQueue forcing crawl-delay of 248 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 348, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 2)) = 248
I 2022/06/09 11:03:29 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=sirk-douglas, 225342 bytes
I 2022/06/09 11:03:29 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse/list?director=sirk-douglas, STACKING TIME = 1, PARSING TIME = 24
I 2022/06/09 11:03:29 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/films/e033d65a68d10236f332c06ce3725b59/5Pvpc7nsRpgo8lu3IQVlAeheME1g9w_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/films/ef45f7fd0ccef1e881abc80c6b24ff60/GgZDqyYrxCYlRn5FxwXi15SB4qyUTU_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/films/a782f16d09b72735c821138834714959/9C6jYX9HITuLUZsznj9IvSTDeliVF7_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=sirk-douglas
I 2022/06/09 11:03:29 Fulltext indexing: C5US4m_26NP5 https://www.criterion.com/shop/browse/list?director=sirk-douglas
I 2022/06/09 11:03:29 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[C5US4m_26NP5 (1735154843875540992)]} 0 2
I 2022/06/09 11:03:29 SWITCHBOARD *Indexed 1201 words in URL https://www.criterion.com/shop/browse/list?director=sirk-douglas [C5US4m_26NP5]
Description: Douglas Sirk films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12620 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:29 HTCACHE storing content of url https://www.criterion.com/current/posts/2025-in-praise-of-karloff-the-uncanny, 110106 bytes
I 2022/06/09 11:03:29 SWITCHBOARD CRAWL: ADDED 57 LINKS FROM https://www.criterion.com/current/posts/2025-in-praise-of-karloff-the-uncanny, STACKING TIME = 6, PARSING TIME = 16
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/films/3b3f5d2ed5284982b3c8ad89529a0004/hcXpUtUe5U86AeRXF6U4kP5WhAfxFK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 355, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/films/79db9535ede5c086db79d6ecdd52ee38/PdAnbV2oK5GcJBQSQ7Xxd480zUoIZg_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 HTCACHE storing content of url https://www.criterion.com/films/28587-zatoichi-in-desperation, 70467 bytes
I 2022/06/09 11:03:29 SWITCHBOARD CRAWL: ADDED 59 LINKS FROM https://www.criterion.com/films/28587-zatoichi-in-desperation, STACKING TIME = 5, PARSING TIME = 10
I 2022/06/09 11:03:29 REJECTED https://itunes.apple.com/us/movie/id731150883?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/images/4059-f72dad77dca95364d1d29c5143879ade/zatoichi_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/images/7534-8c4ef090a24dba6f61af4126a673cd93/Current_28304id_013_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/films/173480d6f89e5fb1fe8fab2f8e7ccc5c/rlnW3TLUrVZJcJ8ZyWDnE0avCnaEnx_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.amazon.com/dp/B00G4HYNQU - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/films/67b1624d36723adc095a547dbab9a4ea/thEX8E524itp1BxKsD1fZYPogk6S3g_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/films/5452396f7aaef64efffa8fd81a21816d/D6xjbVqepRZC3vkCg6udVJ3cp7ryXM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/15d92efb60fb6a1c407d937dfa1e70cc.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/films/46814ce3abeb4b2311fb8543f8cccfc9/RxLBoxAZHDluyMr4dol7kupZYOuvpz_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 SWITCHBOARD Excluded 30 words in URL https://www.criterion.com/current/posts/2025-in-praise-of-karloff-the-uncanny
I 2022/06/09 11:03:29 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ChAZkG_26NP5 (1735154844051701760)]} 0 3
I 2022/06/09 11:03:29 Fulltext indexing: ChAZkG_26NP5 https://www.criterion.com/current/posts/2025-in-praise-of-karloff-the-uncanny
I 2022/06/09 11:03:29 SWITCHBOARD *Indexed 871 words in URL https://www.criterion.com/current/posts/2025-in-praise-of-karloff-the-uncanny [ChAZkG_26NP5]
Description: In Praise of Karloff the Uncanny | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 11065 bytes |
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:29 SWITCHBOARD Excluded 13 words in URL https://www.criterion.com/films/28587-zatoichi-in-desperation
I 2022/06/09 11:03:29 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[CyDCme_26NP5 (1735154844063236096)]} 0 1
I 2022/06/09 11:03:29 Fulltext indexing: CyDCme_26NP5 https://www.criterion.com/films/28587-zatoichi-in-desperation
I 2022/06/09 11:03:29 SWITCHBOARD *Indexed 271 words in URL https://www.criterion.com/films/28587-zatoichi-in-desperation [CyDCme_26NP5]
Description: Zatoichi in Desperation (1972) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2821 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:29 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 354, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:29 HTCACHE storing content of url https://www.criterion.com/current/posts/2026--my-mind-was-always-on-the-commoners-shindo-on-kuroneko-in-his-body-of-work, 134619 bytes
I 2022/06/09 11:03:29 SWITCHBOARD CRAWL: ADDED 57 LINKS FROM https://www.criterion.com/current/posts/2026--my-mind-was-always-on-the-commoners-shindo-on-kuroneko-in-his-body-of-work, STACKING TIME = 4, PARSING TIME = 13
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/films/173480d6f89e5fb1fe8fab2f8e7ccc5c/rlnW3TLUrVZJcJ8ZyWDnE0avCnaEnx_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7727-/qx2k3PIjLfkrgAFY3PEEkGgPA1fCk5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/images/3767-b1b5b9ec8646eb134004b2c83e9914b9/current_1005_064_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/tout_image/7735-/kECvBjKyZyFG0JI5nP3XVFfNN0ztz2_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/tout_image/7760-/OaHu0IpnoI9bUBJ6FlnbJ8HwWBgVc1_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:29 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7769-/NBQbLbxo8yFqiUVeRMXqV1qjES6eSF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/2026--my-mind-was-always-on-the-commoners-shindo-on-kuroneko-in-his-body-of-work
I 2022/06/09 11:03:30 Fulltext indexing: CibP3G_26NP5 https://www.criterion.com/current/posts/2026--my-mind-was-always-on-the-commoners-shindo-on-kuroneko-in-his-body-of-work
I 2022/06/09 11:03:30 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[CibP3G_26NP5 (1735154844443869184)]} 0 5
I 2022/06/09 11:03:30 SWITCHBOARD *Indexed 860 words in URL https://www.criterion.com/current/posts/2026--my-mind-was-always-on-the-commoners-shindo-on-kuroneko-in-his-body-of-work [CibP3G_26NP5]
Description: “My Mind Was Always on the Commoners”: Shindo on Kuroneko in His Body of Work | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 16190 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:30 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 358, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:03:30 HTCACHE storing content of url https://www.criterion.com/current/posts/2385-chef-du-cinema-my-man-godfrey, 91772 bytes
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7820-/YKRO4oMLJouJhzJ0UVxMAsXpd2O4sF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED http://chefducinema.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://s3.amazonaws.com/criterion-production/images/4946-2bcab35000daa3f9821ae8aa04ba13d9/godfrey_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7799-/BcSOGzRAmQVrNktlZ6a8juCv8YVP7g_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://s3.amazonaws.com/criterion-production/films/faae05a1b170149a490b96749d940a7f/BeabnGRtbsbWPi0gQYY7xJClHmZjMg_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 SWITCHBOARD CRAWL: ADDED 58 LINKS FROM https://www.criterion.com/current/posts/2385-chef-du-cinema-my-man-godfrey, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7821-/DIxDSXUzZ7yk0yAo3JyIs4aQdkSJqq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.youtube.com/embed/SOI3dHzg2KE?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/tout_image/7812-/cfFE90t4MLOwAR88lqDIdl2lvUw6SX_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/2385-chef-du-cinema-my-man-godfrey
I 2022/06/09 11:03:30 Fulltext indexing: CkBEJG_26NP5 https://www.criterion.com/current/posts/2385-chef-du-cinema-my-man-godfrey
I 2022/06/09 11:03:30 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[CkBEJG_26NP5 (1735154844601155584)]} 0 2
I 2022/06/09 11:03:30 SWITCHBOARD *Indexed 697 words in URL https://www.criterion.com/current/posts/2385-chef-du-cinema-my-man-godfrey [CkBEJG_26NP5]
Description: Chef du Cinema: My Man Godfrey | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 9656 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:30 HTCACHE storing content of url https://www.criterion.com/current/author/206-geoffrey-macnab, 54974 bytes
I 2022/06/09 11:03:30 SWITCHBOARD CRAWL: ADDED 42 LINKS FROM https://www.criterion.com/current/author/206-geoffrey-macnab, STACKING TIME = 1, PARSING TIME = 11
I 2022/06/09 11:03:30 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=nichols-mike, 224150 bytes
I 2022/06/09 11:03:30 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 390, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:30 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/current/author/206-geoffrey-macnab
I 2022/06/09 11:03:30 Fulltext indexing: ClQ64G_26NP5 https://www.criterion.com/current/author/206-geoffrey-macnab
I 2022/06/09 11:03:30 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ClQ64G_26NP5 (1735154844805627904)]} 0 1
I 2022/06/09 11:03:30 SWITCHBOARD *Indexed 194 words in URL https://www.criterion.com/current/author/206-geoffrey-macnab [ClQ64G_26NP5]
Description: Geoffrey Macnab | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3021 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:30 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=nichols-mike, STACKING TIME = 1, PARSING TIME = 26
I 2022/06/09 11:03:30 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://s3.amazonaws.com/criterion-production/films/c1cb7c7c93760075005158d586b67d45/ace3Y8tk9zZ6RU58hIflxVsxZUIA2B_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=nichols-mike
I 2022/06/09 11:03:30 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ClI52m_26NP5 (1735154844897902592)]} 0 2
I 2022/06/09 11:03:30 Fulltext indexing: ClI52m_26NP5 https://www.criterion.com/shop/browse/list?director=nichols-mike
I 2022/06/09 11:03:30 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=nichols-mike [ClI52m_26NP5]
Description: Mike Nichols films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12462 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:30 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 390, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:30 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=gutierrez-alea-tomas, 224268 bytes
I 2022/06/09 11:03:30 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=gutierrez-alea-tomas, STACKING TIME = 1, PARSING TIME = 24
I 2022/06/09 11:03:30 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://s3.amazonaws.com/criterion-production/films/ab7f9848b3ca887189e1062a54bc1cf8/KsP2XYjmdLyOg5TnhB6uZpsw2DhU95_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 HTCACHE storing content of url https://www.criterion.com/current/posts/1003-the-sounds-of-the-last-emperor, 60710 bytes
I 2022/06/09 11:03:30 HTCACHE storing content of url https://www.criterion.com/shop/collection/88-america-america/list, 77697 bytes
I 2022/06/09 11:03:30 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 447, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 7)) = 243
I 2022/06/09 11:03:30 SWITCHBOARD CRAWL: ADDED 39 LINKS FROM https://www.criterion.com/current/posts/1003-the-sounds-of-the-last-emperor, STACKING TIME = 6, PARSING TIME = 14
I 2022/06/09 11:03:30 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:30 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 SWITCHBOARD CRAWL: ADDED 88 LINKS FROM https://www.criterion.com/shop/collection/88-america-america/list, STACKING TIME = 6, PARSING TIME = 27
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1862-a6d27bb1690cceebe0e316b9d42ad371/Mk6tOX6R5Dhlgj3yJMCixqZ8WNp7XK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/0209c2e9a550dc761a71347df9f9dfec/eVnquBJJylhxw6k2m7RFM1HnrId6Su_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/b2bb5976af7cadce150efc34535ee8eb/rWqt8eIuJvh8LS4tAQYKV6gOl4bnHs_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/2eab6c19498ccc9c45709af90f0cc5da/POE0rbVpiFeXo7erOISf126v663x9J_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/6facf4e6621b173ff79555f9eddb9f27/8g5alAbUvCYJg7YGwVq1B3KHU7OrN4_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/f2abc1a7be5b54ca95c85cba8ea0dec5/B54vSU2FueuGdEg0nrnhNaAlZcXU9P_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/7d7a1f90e727ed3ed3a484e831a1aff1/A6RNDXhKE0ae9792X8Ws0pzvNS4XFi_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/c701f80ae220c9948a01832f1b96bb49/pXj4meWTEKkclkBiVCrtId5yTzyGKL_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/8f6204686eb4d6748b8694fb686ccae8/TH4q5H01qsQUmk7G5kJDx7gvxk0gFZ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/14e0f640159eab2ffad751c2576b03ad/wVg4QTiXtwJdDc0Bj69D5j8CrAhn8C_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/859cb0d19e424453f8d3634b401bb8c3/XyQ5pkrk4GrUByTTdbK4LQRskAytRF_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/64093139506eeea73f973aee07e3f9ec/ZTvHQrwEUsJIjlvtHztiZ6g0UCH85m_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/b098e03f9c272b78b726f827d2c440c2/ZcXKzw1dgfIfnHaFlCBogSemi3gRyC_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/ee7cacd47c70cec49f345ca16baecdd8/Y5xkElKWgW3k44qZO5BSCnqu6fKHYG_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/aa6732f5649c11b23429c37280a91c06/6ks3CBn0gVkR60pKAQuy3Y0JrMwghK_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/14949519d4ef697f4b757e439e028365/SBqr45UQLhBtTuqrdKLZXOMhaVElXM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/f7b1ce8ef4021bfdce6c4b86f77b185a/v3fOxFyrvjrGkDDtXktcmZTM2CcaU0_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/58c740124a15f1c79dd1c68bbedfc605/kT0FaLFVzZ8HUvpqsoKIi98fEdZL8i_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/78a86bc12fbed832f0b341609a22fa52/lun1ptGstEhxOhmc24pORDXEVkN2Ve_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/406c5bfd8089fa0d386cb7bdc5850df9/EdQqw8cU4kk7owg1vm6AUfvvV8Fju8_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/737b414279e7a0df660ac086e87c64ae/kEmt8wcBIShHoV7oEBzALnZY2uYuim_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/1f597146f3b839ef4db1a723d16b0e33/idUm9r72QpdtWLq5FaUa4rXfC68W65_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/a5e3fd6f03567fe6a67f8b3b9eaa867d/mjZx83OMMuc168S7NOnc4FdO4ZI1rL_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/508da13d74af7209190380bec79002ab/KzkULFHCmg1KGSgM6OAm6LQUONSDWJ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/3ab3e39d4fbbf677631b93028f6045ea/uKI5bkkhVMW2pC5keD0U3WaykHKvqW_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/95ebf87290f77e1410d8a873b03dc1b7/0W8nS4IPFCEVH4hoZmxy3fa5RzjrTs_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/0a9d8cafec1031c98436007b77553d6f/8RnJTFRJr7Ho5oNBlduI3C9erCpY1t_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/f5361612515871553628e26305ac00a5/yc3y38kQLI8Mz6XaqTG0KhvdViYpyZ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/677caf3a735a98d9b25dda5866e02425/AviAMLQTTTBrVvmfpiblXCNpkIDwdP_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/bb12c3d02a5fd5ecfeaa59b6c83033b1/Bw7MZjZOFvVBdeObsJ8awPzqsIbH5i_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/ae35595d33e11e2c7c1009d865635cfa/6M5V8AX281t3tnvWTMI3J6hG0Fa33Z_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/ee06eb98a3f4ddfae7322b95916708d6/hIa0cGCGSYfRfpArTPaFotv3ycmxc8_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/81aefdc48e19d12c4fa0664e7f575b59/ZwhEP5AiwjZhejZ0ytK4Ot53TtIMWF_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/faae05a1b170149a490b96749d940a7f/BeabnGRtbsbWPi0gQYY7xJClHmZjMg_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/da73b74372f154774aea78cafa6ea7f0/dlUrVNiTxRucktTqhZcCqtPJDakTS6_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/3467090198c948e9fe375d217ce8a10f/nsKgsPFKsXYHSq1dakh5q0P2MUwFel_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/efd916dcd675095973e44745477f9b21/xUcTSIdAupkO5KPIrrvCkfJpbLO4jP_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/fddbe4548b28e34ca3d6080678695a35/MjPzEfOGPaJf4LBIPQuCWr1ZVy7zt3_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/7cabb0ef9ddb181882c3ea186935cafc/c2bhsytxhguD846PNcIr9m5mvMwcdi_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/9676ccbf8da26b964e0339ec979c1815/krYPZfP7VVBB2wRzYqXJaw2tJtzPR4_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/7577ed3ce6bab96d5193c4b744e0098a/1xw1IyFisGyE1qcyYVvxGWy8nUZv89_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/496b3eb80993646a4767cf95a924582a/RBaE1DWa6XGV5DLWc4PrervcjpdC94_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/b8d150ed20e90cb009b9cd92f6b838a2/OkOUDcVNqdlmsGnZOh3S4rYkZfKhfF_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/783be5c175b4ef821d6711057a9150b0/UOWfWBRkOsrUikiEyC54Jaz5tEOKl5_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/597124f416f05a040868b747ef5dbb39/Yg8FHf5KpTrN5RJ01e73Y5kuxWAI3m_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/0c501a3c29a18782d925bfd55fee19c3/feNpxjpTQNQM7gYDZsB0KNItFGQASQ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=gutierrez-alea-tomas
I 2022/06/09 11:03:31 Fulltext indexing: CvhmHm_26NP5 https://www.criterion.com/shop/browse/list?director=gutierrez-alea-tomas
I 2022/06/09 11:03:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[CvhmHm_26NP5 (1735154845425336320)]} 0 6
I 2022/06/09 11:03:31 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=gutierrez-alea-tomas [CvhmHm_26NP5]
Description: Tomás Gutiérrez Alea films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12497 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:31 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/current/posts/1003-the-sounds-of-the-last-emperor
I 2022/06/09 11:03:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[CbUx2G_26NP5 (1735154845446307840)]} 0 1
I 2022/06/09 11:03:31 Fulltext indexing: CbUx2G_26NP5 https://www.criterion.com/current/posts/1003-the-sounds-of-the-last-emperor
I 2022/06/09 11:03:31 SWITCHBOARD *Indexed 176 words in URL https://www.criterion.com/current/posts/1003-the-sounds-of-the-last-emperor [CbUx2G_26NP5]
Description: The Sounds of The Last Emperor | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2119 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:31 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/shop/collection/88-america-america/list
I 2022/06/09 11:03:31 Fulltext indexing: CkFvdm_26NP5 https://www.criterion.com/shop/collection/88-america-america/list
I 2022/06/09 11:03:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[CkFvdm_26NP5 (1735154845474619392)]} 0 3
I 2022/06/09 11:03:31 SWITCHBOARD *Indexed 457 words in URL https://www.criterion.com/shop/collection/88-america-america/list [CkFvdm_26NP5]
Description: America, America | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4230 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:31 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 447, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:31 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 447, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:31 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=okamoto-kihachi, 226387 bytes
I 2022/06/09 11:03:31 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse/list?director=okamoto-kihachi, STACKING TIME = 1, PARSING TIME = 46
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/a68a3eb9174b4c4eae09eb80909bfc2a/QSNBDikYheP1Q5VMWv2DjEmrooILwQ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/4b25766f5b49a52427263008b7c8280f/D1oQctY2FyV7C8jLs1Xnr5OrWOFouD_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1881-2e14142b9750bb21d87343a64d586df6/XnAxRRg69cY4XvRmU3g74mQrwW7gE9_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/4d342f93b7630bcae0798f9c95240e05/7HOFeSGbcoM4k98Pl4W27BUAN2uH3b_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 HTCACHE storing content of url https://www.criterion.com/films/3831-the-pearls-of-the-crown, 72280 bytes
I 2022/06/09 11:03:31 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=okamoto-kihachi
I 2022/06/09 11:03:31 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 471, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 9)) = 241
I 2022/06/09 11:03:31 SWITCHBOARD CRAWL: ADDED 60 LINKS FROM https://www.criterion.com/films/3831-the-pearls-of-the-crown, STACKING TIME = 10, PARSING TIME = 13
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/1524-/aAo34zbAGl7GMcJYwIHoiarpIwnoR9_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/the-pearls-of-the-crown?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/efd531e4f833ecf6ed4864afa507c11e/11QuJYuU3oyeIXl1PO6quna7UKCBH2_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/a1eaa5dba4cf9838112f9fe70f96143c.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/images/8124-9553df09fe42f024e1191b571cdb3e1f/sacha_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/655106c6e5c5cb1345e565eaf218815e/H76zJ42x9gqvXla7tZzQflVDhzJYJz_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 Fulltext indexing: B7RIwm_26NP5 https://www.criterion.com/shop/browse/list?director=okamoto-kihachi
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/c36e3cd8f2611cc263cee1c634636a94.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/e9ab4ffc13bec4967aaa7d3eee07511b/8xmrLVUekb35HLv4dnVLCJ8vehvltt_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/94cf0c82e7df5bb8c09439168218d976/Ll0cFvXAawJsbIRpykHOXGFgRm4n1c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[B7RIwm_26NP5 (1735154846221205504)]} 0 8
I 2022/06/09 11:03:31 SWITCHBOARD *Indexed 1213 words in URL https://www.criterion.com/shop/browse/list?director=okamoto-kihachi [B7RIwm_26NP5]
Description: Kihachi Okamoto films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12639 bytes |
LinkStorageTime: 19 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:31 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/e1acd1238b684c8c3b3a53c55ef97043.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1798-8d9b98e7cf85202e997b51d4ca11eb93/wY65gvPf1AlhK0jQI0nQhMrk94bjn8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/e833674cc1d13bdfd087c0e662bf73d3.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=clair-rene, 225828 bytes
I 2022/06/09 11:03:31 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/films/3831-the-pearls-of-the-crown
I 2022/06/09 11:03:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[COnuHe_26NP5 (1735154846278877184)]} 0 1
I 2022/06/09 11:03:31 Fulltext indexing: COnuHe_26NP5 https://www.criterion.com/films/3831-the-pearls-of-the-crown
I 2022/06/09 11:03:31 SWITCHBOARD *Indexed 314 words in URL https://www.criterion.com/films/3831-the-pearls-of-the-crown [COnuHe_26NP5]
Description: The Pearls of the Crown (1937) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3374 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:31 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=clair-rene, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:03:31 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/218404ac7edf68ec3ec120877941b76b/5HG0y4GpYvncVrHeqefNSqEglA9uLl_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/5f4fc2e5d3ebab58daaae1dc5d73a1ac/N7iNQnZu33LYmihkfPYU9o0Un9XM6k_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/38de1d92192de3a5d761faf4e6c2ecb7/blafBy2lxntqX5n4hNZ9BGKhVMOFIP_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 REJECTED https://s3.amazonaws.com/criterion-production/films/308caf2319dae6175adc209d54e5caab/XC3YJYaagMcer4KuJo2wO84GLxShsz_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:31 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=clair-rene
I 2022/06/09 11:03:31 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[CWm9jm_26NP5 (1735154846379540480)]} 0 2
I 2022/06/09 11:03:31 Fulltext indexing: CWm9jm_26NP5 https://www.criterion.com/shop/browse/list?director=clair-rene
I 2022/06/09 11:03:31 SWITCHBOARD *Indexed 1206 words in URL https://www.criterion.com/shop/browse/list?director=clair-rene [CWm9jm_26NP5]
Description: René Clair films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12624 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:32 HTCACHE storing content of url https://www.criterion.com/films/28311-zatoichi-and-the-chess-expert, 70874 bytes
I 2022/06/09 11:03:32 SWITCHBOARD CRAWL: ADDED 59 LINKS FROM https://www.criterion.com/films/28311-zatoichi-and-the-chess-expert, STACKING TIME = 1, PARSING TIME = 9
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/films/6fede1f031c07b843ffa8965d47043f3/9QWkE37UXlpfhZrTIsaZHdWmooGJ1a_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/films/554a88f3e461b3af84a8d7c74395c982/ITGgOs4mQf6gDsezMZFForMyh7w2oR_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/images/4059-f72dad77dca95364d1d29c5143879ade/zatoichi_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/images/7534-8c4ef090a24dba6f61af4126a673cd93/Current_28304id_013_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://itunes.apple.com/us/movie/id728079508?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/films/95aa03afbe874b8c4a758fad40b4758e/u5w5NxMwJtRaRc6Bpf6VadqJIB97YE_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/093ecfefeb495a1678abbbf3974746d4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 HostQueue forcing crawl-delay of 232 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 491, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 18)) = 232
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/films/8fa83fea9e1e866a11e31a00dbe58c97/L5ozj9f0PX4PQ864pbUt7PYTLfOySF_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.amazon.com/dp/B00GX5151Y - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 SWITCHBOARD Excluded 13 words in URL https://www.criterion.com/films/28311-zatoichi-and-the-chess-expert
I 2022/06/09 11:03:32 Fulltext indexing: Bkh5Je_26NP5 https://www.criterion.com/films/28311-zatoichi-and-the-chess-expert
I 2022/06/09 11:03:32 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Bkh5Je_26NP5 (1735154846509563904)]} 0 1
I 2022/06/09 11:03:32 SWITCHBOARD *Indexed 277 words in URL https://www.criterion.com/films/28311-zatoichi-and-the-chess-expert [Bkh5Je_26NP5]
Description: Zatoichi and the Chess Expert (1965) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2873 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:32 HTCACHE storing content of url https://www.criterion.com/boxsets/487-the-lower-depths, 70074 bytes
I 2022/06/09 11:03:32 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 511, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 10)) = 245
I 2022/06/09 11:03:32 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/boxsets/487-the-lower-depths, STACKING TIME = 2, PARSING TIME = 9
I 2022/06/09 11:03:32 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/films/18edf0e45ef525e94a2f216ec8c0ac0c/ssmWoOUy6rJctS1cNG45DNWrlpulXN_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/639c76ad9eed54963b04de2f446d2117.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/films/9dcecbfcb9d84e037bcdfbcd1e6b3dc9/Y4cPFzSZbE23XAdEGCQ2Ydszik9QwM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1869-e5c85aaf05b37b7eb3c006cf0e4733b6/oc21ci95QjfyEYXMwqohFekz0EFAbu_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/e1b3ff6d9d4f53b0b2d47c23ba79eca3.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/boxsets/487-the-lower-depths
I 2022/06/09 11:03:32 Fulltext indexing: CHmTj3_26NP5 https://www.criterion.com/boxsets/487-the-lower-depths
I 2022/06/09 11:03:32 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[CHmTj3_26NP5 (1735154846797922304)]} 0 2
I 2022/06/09 11:03:32 SWITCHBOARD *Indexed 278 words in URL https://www.criterion.com/boxsets/487-the-lower-depths [CHmTj3_26NP5]
Description: The Lower Depths | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6656 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:32 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=zetterling-mai, 224307 bytes
I 2022/06/09 11:03:32 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=zetterling-mai, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:03:32 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 HTCACHE storing content of url https://www.criterion.com/current/author/809-jay-caspian-kang, 49406 bytes
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/films/89fd00884467657bae436047de8cd9b2/7RuTtvX525S0ENzFaMXEcuoqKM280Y_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/809-jay-caspian-kang, STACKING TIME = 9, PARSING TIME = 3
I 2022/06/09 11:03:32 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=zetterling-mai
I 2022/06/09 11:03:32 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[CN-2xm_26NP5 (1735154846954160128)]} 0 2
I 2022/06/09 11:03:32 Fulltext indexing: CN-2xm_26NP5 https://www.criterion.com/shop/browse?director=zetterling-mai
I 2022/06/09 11:03:32 SWITCHBOARD *Indexed 1185 words in URL https://www.criterion.com/shop/browse?director=zetterling-mai [CN-2xm_26NP5]
Description: Mai Zetterling films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12523 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:32 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/current/author/809-jay-caspian-kang
I 2022/06/09 11:03:32 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[BWTOGG_26NP5 (1735154846960451584)]} 0 0
I 2022/06/09 11:03:32 Fulltext indexing: BWTOGG_26NP5 https://www.criterion.com/current/author/809-jay-caspian-kang
I 2022/06/09 11:03:32 SWITCHBOARD *Indexed 121 words in URL https://www.criterion.com/current/author/809-jay-caspian-kang [BWTOGG_26NP5]
Description: Jay Caspian Kang | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1528 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:32 HTCACHE storing content of url https://www.criterion.com/current/posts/5561-in-search-of-ozu, 64802 bytes
I 2022/06/09 11:03:32 SWITCHBOARD CRAWL: ADDED 57 LINKS FROM https://www.criterion.com/current/posts/5561-in-search-of-ozu, STACKING TIME = 1, PARSING TIME = 8
I 2022/06/09 11:03:32 REJECTED https://player.vimeo.com/video/264628606?title=0&byline=0&portrait=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.filmstruck.com/us/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7777-/bTILAb13gYYFwneHHCb5dHHD1VjRf7_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7707-/hZxxnAQeKCi5vjiDKzvdNuftSnkdOM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/films/99b641cebefcafe104ca69e00a87ca70/iIixQEvof48QNdsb65bsUrBM3McAIf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/films/a8ea8ea6c23887c9172372d22d52a87a/L1PCM8Z0TUWLZ3RLdDMcMOOvKvPGl2_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7807-/DXN9QiWNnsXTuLss0TTJ4r9JspRu5B_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7738-/e35rwdsj2UHHYOdCCz8E5Qr8oxxKXj_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.filmstruck.com/us/watch/bundle/1520001248 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 HostQueue forcing crawl-delay of 256 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 541, robots.delay = 0, ((waitig = 270) - (timeSinceLastAccess = 14)) = 256
I 2022/06/09 11:03:32 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/current/posts/5561-in-search-of-ozu
I 2022/06/09 11:03:32 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[B1GDVG_26NP5 (1735154847030706176)]} 0 1
I 2022/06/09 11:03:32 Fulltext indexing: B1GDVG_26NP5 https://www.criterion.com/current/posts/5561-in-search-of-ozu
I 2022/06/09 11:03:32 SWITCHBOARD *Indexed 309 words in URL https://www.criterion.com/current/posts/5561-in-search-of-ozu [B1GDVG_26NP5]
Description: In Search of Ozu | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3783 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:32 HTCACHE storing content of url https://www.criterion.com/current/posts/3495-three-reasons-the-thin-blue-line, 65956 bytes
I 2022/06/09 11:03:32 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/3495-three-reasons-the-thin-blue-line, STACKING TIME = 4, PARSING TIME = 48
I 2022/06/09 11:03:32 REJECTED https://www.youtube.com/embed/MzZROKde8Nc?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/films/cc08b14f22e5cb4d4abc35e2bd1e76eb/duZpzWtzx94ixvPnyavkOi9YOZqWR6_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:32 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/current/posts/3495-three-reasons-the-thin-blue-line
I 2022/06/09 11:03:32 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[BttKHG_26NP5 (1735154847135563776)]} 0 1
I 2022/06/09 11:03:32 Fulltext indexing: BttKHG_26NP5 https://www.criterion.com/current/posts/3495-three-reasons-the-thin-blue-line
I 2022/06/09 11:03:32 SWITCHBOARD *Indexed 219 words in URL https://www.criterion.com/current/posts/3495-three-reasons-the-thin-blue-line [BttKHG_26NP5]
Description: Three Reasons: The Thin Blue Line | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2558 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:32 HostQueue forcing crawl-delay of 265 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 551, robots.delay = 0, ((waitig = 275) - (timeSinceLastAccess = 10)) = 265
I 2022/06/09 11:03:33 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=lindberg-per, 224709 bytes
I 2022/06/09 11:03:33 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=oda-motoyoshi, 224739 bytes
I 2022/06/09 11:03:33 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse?director=lindberg-per, STACKING TIME = 1, PARSING TIME = 49
I 2022/06/09 11:03:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 HostQueue forcing crawl-delay of 266 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 557, robots.delay = 0, ((waitig = 278) - (timeSinceLastAccess = 12)) = 266
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1915-d024b93e4429f5b05d7f0bdc8d59c415/shXfQUMZTWnY6hrjrAHIhcBlosHu4c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://s3.amazonaws.com/criterion-production/films/490667c07df5c2a3e3b5e76219af4a20/IZXe9hzAvJvVLk1ZjjptbbtlVLDt8B_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse?director=oda-motoyoshi, STACKING TIME = 1, PARSING TIME = 98
I 2022/06/09 11:03:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1953-0d1595bafe2ba6d32c5f5e1bc6215555/bG4INWRMocURqMHZl7ZgyVgw1O47Tx_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://s3.amazonaws.com/criterion-production/films/12777a13abcdaa547d51c2273947ddff/mgBLNMreWObbgdQFJlszuoTPqR8Nay_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=lindberg-per
I 2022/06/09 11:03:33 HTCACHE storing content of url https://www.criterion.com/current/posts/675-early-summer, 101347 bytes
I 2022/06/09 11:03:33 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Be5Tjm_26NP5 (1735154847851741184)]} 0 3
I 2022/06/09 11:03:33 Fulltext indexing: Be5Tjm_26NP5 https://www.criterion.com/shop/browse?director=lindberg-per
I 2022/06/09 11:03:33 SWITCHBOARD *Indexed 1200 words in URL https://www.criterion.com/shop/browse?director=lindberg-per [Be5Tjm_26NP5]
Description: Per Lindberg films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12482 bytes |
LinkStorageTime: 18 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:33 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/675-early-summer, STACKING TIME = 6, PARSING TIME = 24
I 2022/06/09 11:03:33 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED http://www.davidbordwell.net/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 HostQueue forcing crawl-delay of 265 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 556, robots.delay = 0, ((waitig = 278) - (timeSinceLastAccess = 13)) = 265
I 2022/06/09 11:03:33 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=oda-motoyoshi
I 2022/06/09 11:03:33 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[BTj1xm_26NP5 (1735154847981764608)]} 0 2
I 2022/06/09 11:03:33 Fulltext indexing: BTj1xm_26NP5 https://www.criterion.com/shop/browse?director=oda-motoyoshi
I 2022/06/09 11:03:33 SWITCHBOARD *Indexed 1203 words in URL https://www.criterion.com/shop/browse?director=oda-motoyoshi [BTj1xm_26NP5]
Description: Motoyoshi Oda films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12505 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:33 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/675-early-summer
I 2022/06/09 11:03:33 Fulltext indexing: BSwCUG_26NP5 https://www.criterion.com/current/posts/675-early-summer
I 2022/06/09 11:03:33 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[BSwCUG_26NP5 (1735154848047824896)]} 0 4
I 2022/06/09 11:03:33 SWITCHBOARD *Indexed 742 words in URL https://www.criterion.com/current/posts/675-early-summer [BSwCUG_26NP5]
Description: Early Summer | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 9447 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:33 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:03:33 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 556, robots.delay = 0, ((waitig = 278) - (timeSinceLastAccess = 34)) = 244
I 2022/06/09 11:03:33 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=pietrangeli-antonio, 224204 bytes
I 2022/06/09 11:03:33 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:03:33 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@e5ba99d9[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vf(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772539925}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vz(7.7.3):C4:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772604529}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w0(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772604627}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w1(7.7.3):C16:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772608698}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w2(7.7.3):C25:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772613778}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:03:33 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:03:33 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=pietrangeli-antonio, STACKING TIME = 2, PARSING TIME = 35
I 2022/06/09 11:03:33 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 REJECTED https://s3.amazonaws.com/criterion-production/films/62e398f79d4f817613884e489db8046d/IE5uw8MoxPs6597htd7IqOPG2PjoLw_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:33 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=harvey-herk, 224164 bytes
I 2022/06/09 11:03:33 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=pietrangeli-antonio
I 2022/06/09 11:03:33 Fulltext indexing: BNqhtm_26NP5 https://www.criterion.com/shop/browse/list?director=pietrangeli-antonio
I 2022/06/09 11:03:34 HostQueue forcing crawl-delay of 266 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 555, robots.delay = 0, ((waitig = 277) - (timeSinceLastAccess = 11)) = 266
I 2022/06/09 11:03:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[BNqhtm_26NP5 (1735154848485081088)]} 0 15
I 2022/06/09 11:03:34 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=pietrangeli-antonio [BNqhtm_26NP5]
Description: Antonio Pietrangeli films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12481 bytes |
LinkStorageTime: 79 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:34 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=harvey-herk, STACKING TIME = 6, PARSING TIME = 105
I 2022/06/09 11:03:34 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/51353d1bb3a1bb83a7f9f1b9f1844272/IGXFgI5PulNUIH2YZNE3IMg44hLAlZ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 HTCACHE storing content of url https://www.criterion.com/films/27795-total-balalaika-show, 77327 bytes
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/05c5af1846f326e136061c83fef2dca7/hB0X9GTfbMFvgc9rud2B9zxfQgb9p0_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/8a85c0d507e555f8712f0c586f19289b.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/93d3c285b01ddca7fdfc67da336c1192.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 SWITCHBOARD CRAWL: ADDED 74 LINKS FROM https://www.criterion.com/films/27795-total-balalaika-show, STACKING TIME = 5, PARSING TIME = 18
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/ecf3feaffeb272d4813d78b9a9df9b3c/x6inKLlvsx0ZpS2F8SDRT52VJjeqyK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/bb7a6b8c82556fa885e56f7977c1b52d/nEZYgB9dOznIjVMhAPzZcTqbTrqgZL_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/1bc8a77a9b2f7a58e889e08691f79d38/Y5ncLoucjsqMLboktsAudYE5HuJ5Eo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/f8bf41c3e8d2266f423881ceb3159429/58bZDer5maXJjg6GDgD8Tyrr6ZZAuT_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/total-balalaika-show?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1805-20ea90ca5dd4192605c8592660180de0/mD3Hcvh7QvvNLghVU3Ba1T74LZ4YRB_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/images/3766-ff6aa079e27077432f53409f08cb425d/kaurismakicowboys_1484_006_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/3ed41f79deffcb3052099b02c9660e9b/zQOZgJoUsBEgi8arpM5w8aT22vGo6W_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/f6129c0a1ecee18ed998a3dbe42076a0/wTsLY6lBbjYapH6WoRr6PlVXIDc2Nz_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/dc8cf509446d1038b080ab3cb277c47b.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/559-ccfb77f3950996a20e2ad32ad9066a0d/UiCzPtNaNpkbBnI6lC3l8ErTM2FwAf_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED http://janusfilms.com/lehavre - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/c090fb506860818a4f8693732acfd341.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/727bd0170b8dca242c2454a8b3b5e90e/VDcUDb6l8jh3wbelwVZdYDnkGEoumV_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/fdfbd230ca10df2fde7939762a75725c/hpudPXhRIeQlGJOhIPkDieYc1hfELO_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/1944-/m9WY253FRV16J8S15CDXMwajbtiT6A_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=harvey-herk
I 2022/06/09 11:03:34 Fulltext indexing: BDXism_26NP5 https://www.criterion.com/shop/browse/list?director=harvey-herk
I 2022/06/09 11:03:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[BDXism_26NP5 (1735154848627687424)]} 0 2
I 2022/06/09 11:03:34 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=harvey-herk [BDXism_26NP5]
Description: Herk Harvey films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12471 bytes |
LinkStorageTime: 9 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:34 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/films/27795-total-balalaika-show
I 2022/06/09 11:03:34 Fulltext indexing: BDObCe_26NP5 https://www.criterion.com/films/27795-total-balalaika-show
I 2022/06/09 11:03:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[BDObCe_26NP5 (1735154848650756096)]} 0 2
I 2022/06/09 11:03:34 SWITCHBOARD *Indexed 425 words in URL https://www.criterion.com/films/27795-total-balalaika-show [BDObCe_26NP5]
Description: Total Balalaika Show (1994) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4726 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:34 HTCACHE storing content of url https://www.criterion.com/current/author/280-audie-bock, 58792 bytes
I 2022/06/09 11:03:34 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/current/author/280-audie-bock, STACKING TIME = 1, PARSING TIME = 4
I 2022/06/09 11:03:34 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 SWITCHBOARD Excluded 17 words in URL https://www.criterion.com/current/author/280-audie-bock
I 2022/06/09 11:03:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[A-IGkG_26NP5 (1735154848730447872)]} 0 1
I 2022/06/09 11:03:34 Fulltext indexing: A-IGkG_26NP5 https://www.criterion.com/current/author/280-audie-bock
I 2022/06/09 11:03:34 SWITCHBOARD *Indexed 266 words in URL https://www.criterion.com/current/author/280-audie-bock [A-IGkG_26NP5]
Description: Audie Bock | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3962 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:34 HostQueue forcing crawl-delay of 259 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 547, robots.delay = 0, ((waitig = 273) - (timeSinceLastAccess = 14)) = 259
I 2022/06/09 11:03:34 HostQueue forcing crawl-delay of 256 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 547, robots.delay = 0, ((waitig = 273) - (timeSinceLastAccess = 17)) = 256
I 2022/06/09 11:03:34 HTCACHE storing content of url https://www.criterion.com/films/18790, 74101 bytes
I 2022/06/09 11:03:34 SWITCHBOARD CRAWL: ADDED 69 LINKS FROM https://www.criterion.com/films/18790, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/1539-/jgdaeCEarNDG2GZvfUrUwN1vQDE6m4_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/5e6aa22b24993346f75043b1d4d6d3e5.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/21d6c647d6b04343bcb9a475c8bf161e/mCz7xlfXoonwsP48ZmChNuVOxzzHRT_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/a566ad3ebd078fab9a7d5e9f38626175/IgwwjTBIA6xgGaE0g7KLJ1duHOzP47_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/170a20632c3aaea17daab26fe62ae0b4/KEnmrGEhKGIUoIrMqG2vVjSk0nkr1h_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1861-90438cf50ad15d3b8cdd632b9049c397/ZZY6GTioh4PUuW60COYrjYinoS4Hxx_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/64-57d8407a56d539102cbf0ac7447b9cf1/QeUKe3B05Cr992HdqhMTzrsiRoaANx_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/192e9b31938fde709293949e1f8dbaed/bMI9dd5UlAm2oxcALuRFqMPNoysrzR_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/f43c8ae04f032cb63cb19009d307c5e6/8Fri0sBQEIHHHZ7FmnW5IrGGp5YB3z_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1799-fcaac95edb35eab1748fd0aa854780a9/uv1on3L75tq2TTgNYXA1xJJMo4tbSi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/8a0020e8eb3fe98d3908d89e788a06c6.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/48-c60c3029b09059cc159990d75bc5f43a/oLRKVBiIlV0WRkQXQkz0IhVhDBO5HV_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/the-most-beautiful?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/a2a79f79f7fe3eba194e592b1a92e19f.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/58e97ae4dd0b361bd131525faf284b78/K10x3FugjRS01iYKEXsLTr0OlDQk4P_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/5af42201428bc14061890e527459f9b3/5p0q5rUeJNtfNhq2clz4MoYx7nkIO4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/films/18790
I 2022/06/09 11:03:34 Fulltext indexing: A9D8te_26NP5 https://www.criterion.com/films/18790
I 2022/06/09 11:03:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[A9D8te_26NP5 (1735154849286193152)]} 0 2
I 2022/06/09 11:03:34 SWITCHBOARD *Indexed 365 words in URL https://www.criterion.com/films/18790 [A9D8te_26NP5]
Description: The Most Beautiful (1944) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4199 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:34 HTCACHE storing content of url https://www.criterion.com/films/28410-dry-summer, 72809 bytes
I 2022/06/09 11:03:34 SWITCHBOARD CRAWL: ADDED 57 LINKS FROM https://www.criterion.com/films/28410-dry-summer, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/dry-summer?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/84c3f8629cdce14f0862f0d33c39072b/3CZmgSEceYo5JIQEhtNqpxIMiCeKJP_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1976-ece4132f4abef8c4e7beb0a0edffc9a8/y26UyQwNxt4FguJSgQIZWpCNlLsjHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/images/4064-1ce618fd316bc27a23c930b2ed327ff0/current_toukiboukifg_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/d0f987c18dcf99b7aec3ff0e08890355.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/images/4068-a9562cb7b943f916ba336533dff257b0/drysummer_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/2246578d818804de9a010ec2f6761940/LS7nGzD3CGCHvbfnW7CFToCn83HJRf_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/c3671b7d05dd992c80de898da6f724a8/iKAAnLTwUhBFo0X62zBb8ijm258Sey_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://s3.amazonaws.com/criterion-production/films/49e3fce60e807242669361b7f18cfaf4/wmTcBChKCL1gZNBagJAHXx22sGKgKw_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:34 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/films/28410-dry-summer
I 2022/06/09 11:03:34 Fulltext indexing: A8FCme_26NP5 https://www.criterion.com/films/28410-dry-summer
I 2022/06/09 11:03:34 HostQueue forcing crawl-delay of 258 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 541, robots.delay = 0, ((waitig = 270) - (timeSinceLastAccess = 12)) = 258
I 2022/06/09 11:03:34 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[A8FCme_26NP5 (1735154849365884928)]} 0 2
I 2022/06/09 11:03:34 SWITCHBOARD *Indexed 310 words in URL https://www.criterion.com/films/28410-dry-summer [A8FCme_26NP5]
Description: Dry Summer (1964) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3377 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:35 HostQueue forcing crawl-delay of 260 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 541, robots.delay = 0, ((waitig = 270) - (timeSinceLastAccess = 11)) = 259
I 2022/06/09 11:03:35 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=robinson-bruce, 224826 bytes
I 2022/06/09 11:03:35 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=robinson-bruce, STACKING TIME = 2, PARSING TIME = 22
I 2022/06/09 11:03:35 REJECTED https://s3.amazonaws.com/criterion-production/films/979e33ba0898b1efdb6edc222952fb46/7n9ZlJZc3CMZ8Mi0VjmApO7EHyXcU5_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://s3.amazonaws.com/criterion-production/films/5716fff29aa63160f69f236998a86920/yxrkSh76S9jOMNCqNuz6shfs83VGUS_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 HostQueue forcing crawl-delay of 254 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 540, robots.delay = 0, ((waitig = 270) - (timeSinceLastAccess = 16)) = 254
I 2022/06/09 11:03:35 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=robinson-bruce
I 2022/06/09 11:03:35 Fulltext indexing: A4_5fm_26NP5 https://www.criterion.com/shop/browse/list?director=robinson-bruce
I 2022/06/09 11:03:35 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[A4_5fm_26NP5 (1735154849984544768)]} 0 2
I 2022/06/09 11:03:35 SWITCHBOARD *Indexed 1198 words in URL https://www.criterion.com/shop/browse/list?director=robinson-bruce [A4_5fm_26NP5]
Description: Bruce Robinson films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12572 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:35 HostQueue forcing crawl-delay of 260 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 540, robots.delay = 0, ((waitig = 270) - (timeSinceLastAccess = 10)) = 260
I 2022/06/09 11:03:35 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=daves-delmer, 224708 bytes
I 2022/06/09 11:03:35 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=g-alejandro, 224233 bytes
I 2022/06/09 11:03:35 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=daves-delmer, STACKING TIME = 1, PARSING TIME = 117
I 2022/06/09 11:03:35 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://s3.amazonaws.com/criterion-production/films/ab1d9b806f1efe28878526241f632b9a/Zm61wXSmEUBWzzoMzmJOGYPvQ8Cg5v_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://s3.amazonaws.com/criterion-production/films/95a86570565f93d4441bb107100f4bd6/dDeQTXWZt5xPqkN4JlGErbbrORF7zI_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 HostQueue forcing crawl-delay of 259 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 540, robots.delay = 0, ((waitig = 270) - (timeSinceLastAccess = 11)) = 259
I 2022/06/09 11:03:35 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=g-alejandro, STACKING TIME = 5, PARSING TIME = 47
I 2022/06/09 11:03:35 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://s3.amazonaws.com/criterion-production/films/4183334bba36a2aa385a8c7d3915e524/mp16EB371UByNpzZ8trEQghLx1EiZv_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:35 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=daves-delmer
I 2022/06/09 11:03:35 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AyLeRm_26NP5 (1735154850582233088)]} 0 2
I 2022/06/09 11:03:35 Fulltext indexing: AyLeRm_26NP5 https://www.criterion.com/shop/browse/list?director=daves-delmer
I 2022/06/09 11:03:35 SWITCHBOARD *Indexed 1195 words in URL https://www.criterion.com/shop/browse/list?director=daves-delmer [AyLeRm_26NP5]
Description: Delmer Daves films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12541 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:36 HTCACHE storing content of url https://www.criterion.com/boxsets/776-eclipse-series-25-basil-dearden-s-london-underground, 70886 bytes
I 2022/06/09 11:03:36 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/boxsets/776-eclipse-series-25-basil-dearden-s-london-underground, STACKING TIME = 1, PARSING TIME = 12
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/films/47895a8d9f9825f27dd5e5b436e628f3/U7eXTA8UQTq8hDiPiZwdl6RZ1mDTfh_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/films/1f5b233ba37803f74eb2167af55dc09a/9345reYU8uP8BxYacHK3UqP3O4G7Yw_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/f706ce2a0bafe7ece26e5840e3a867f7.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/films/dbf9889ad5057bc17dc11a1919858c20/i94IOMepX6MIU71e3PeoPBePD1vPAq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/a7b467da0d8abcbae1ebfc27812e8d17.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/f9f40f91716edad79af50b28c2faf086.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/19b5cd82f60dde2929e6d55373478be8.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1801-f20f53bb95aa5f59c515e4ee49da6b9a/6qXfc82MIMrd3BPizcOOdvAgk7LkrV_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/films/67a6541aeef54defbf3f02417c245d6b/dPnyXu4t7cVJv5uTdTLwRc3u4WD4Uz_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=g-alejandro
I 2022/06/09 11:03:36 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Aw9Fym_26NP5 (1735154850754199552)]} 0 1
I 2022/06/09 11:03:36 Fulltext indexing: Aw9Fym_26NP5 https://www.criterion.com/shop/browse/list?director=g-alejandro
I 2022/06/09 11:03:36 SWITCHBOARD *Indexed 1193 words in URL https://www.criterion.com/shop/browse/list?director=g-alejandro [Aw9Fym_26NP5]
Description: Alejandro G. Iñárritu films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12503 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:36 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/boxsets/776-eclipse-series-25-basil-dearden-s-london-underground
I 2022/06/09 11:03:36 HostQueue forcing crawl-delay of 257 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 535, robots.delay = 0, ((waitig = 267) - (timeSinceLastAccess = 10)) = 257
I 2022/06/09 11:03:36 Fulltext indexing: ArkIw3_26NP5 https://www.criterion.com/boxsets/776-eclipse-series-25-basil-dearden-s-london-underground
I 2022/06/09 11:03:36 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ArkIw3_26NP5 (1735154850789851136)]} 0 2
I 2022/06/09 11:03:36 SWITCHBOARD *Indexed 365 words in URL https://www.criterion.com/boxsets/776-eclipse-series-25-basil-dearden-s-london-underground [ArkIw3_26NP5]
Description: Eclipse Series 25: Basil Deardens London Underground | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5992 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:36 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=hughes-allen, 224187 bytes
I 2022/06/09 11:03:36 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=hughes-allen, STACKING TIME = 1, PARSING TIME = 19
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/films/75010780ef0a9ccbdad848daa6019c69/xwHKNInOOeEk23oh2D3AWtoc9nMZRb_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=hughes-allen
I 2022/06/09 11:03:36 Fulltext indexing: Ar-GLm_26NP5 https://www.criterion.com/shop/browse/list?director=hughes-allen
I 2022/06/09 11:03:36 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Ar-GLm_26NP5 (1735154850924068864)]} 0 2
I 2022/06/09 11:03:36 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=hughes-allen [Ar-GLm_26NP5]
Description: Allen Hughes films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12480 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:36 HTCACHE storing content of url https://www.criterion.com/current/posts/5429-the-darkness-of-war-in-wooden-crosses, 68083 bytes
I 2022/06/09 11:03:36 SWITCHBOARD CRAWL: ADDED 55 LINKS FROM https://www.criterion.com/current/posts/5429-the-darkness-of-war-in-wooden-crosses, STACKING TIME = 1, PARSING TIME = 6
I 2022/06/09 11:03:36 REJECTED https://www.filmstruck.com/us/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7777-/bTILAb13gYYFwneHHCb5dHHD1VjRf7_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7707-/hZxxnAQeKCi5vjiDKzvdNuftSnkdOM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.filmstruck.com/us/watch/bundle/1520001209 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7807-/DXN9QiWNnsXTuLss0TTJ4r9JspRu5B_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7738-/e35rwdsj2UHHYOdCCz8E5Qr8oxxKXj_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://player.vimeo.com/video/257548137 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/films/119cf6abd03957cb617d43f1ba762223/dgadXeBrbXuAf3cGdp9lrLVymRuiSd_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 HostQueue forcing crawl-delay of 251 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 532, robots.delay = 0, ((waitig = 266) - (timeSinceLastAccess = 15)) = 251
I 2022/06/09 11:03:36 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/current/posts/5429-the-darkness-of-war-in-wooden-crosses
I 2022/06/09 11:03:36 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AlAdYG_26NP5 (1735154851089743872)]} 0 1
I 2022/06/09 11:03:36 Fulltext indexing: AlAdYG_26NP5 https://www.criterion.com/current/posts/5429-the-darkness-of-war-in-wooden-crosses
I 2022/06/09 11:03:36 SWITCHBOARD *Indexed 273 words in URL https://www.criterion.com/current/posts/5429-the-darkness-of-war-in-wooden-crosses [AlAdYG_26NP5]
Description: The Darkness of War in Wooden Crosses | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3478 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:36 HTCACHE storing content of url https://www.criterion.com/films/28306-zatoichi-s-flashing-sword, 70986 bytes
I 2022/06/09 11:03:36 HostQueue forcing crawl-delay of 259 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 529, robots.delay = 0, ((waitig = 264) - (timeSinceLastAccess = 6)) = 258
I 2022/06/09 11:03:36 SWITCHBOARD CRAWL: ADDED 59 LINKS FROM https://www.criterion.com/films/28306-zatoichi-s-flashing-sword, STACKING TIME = 1, PARSING TIME = 10
I 2022/06/09 11:03:36 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/1cf96ec4973659b9df05f3f2d8f6e8a7.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.amazon.com/dp/B004D1F1W4 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/images/4059-f72dad77dca95364d1d29c5143879ade/zatoichi_current_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/images/7534-8c4ef090a24dba6f61af4126a673cd93/Current_28304id_013_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://itunes.apple.com/us/movie/id724049363?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/films/06aae5d548f169e22473a1560b8af40b/C9uzPZ2an3M9AoDXjTI7aMH8KBOYDU_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/films/04e10116294528ec2da38cb6c0a5b044/3pKnBkuWZlWe0VWIvtR6D92XRuWPU2_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/films/dd560c50567da0e7848c98570c7eaed8/Vrl4qmICJyQhToXmjIGg0AL98jK6QM_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 REJECTED https://s3.amazonaws.com/criterion-production/films/e3a8b87cb8cbc3285b3eb00da28a693a/dyeVpIcW7gE7OWYsi3kOh9W2fHMXIs_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:36 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/films/28306-zatoichi-s-flashing-sword
I 2022/06/09 11:03:36 Fulltext indexing: AjKK4e_26NP5 https://www.criterion.com/films/28306-zatoichi-s-flashing-sword
I 2022/06/09 11:03:36 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AjKK4e_26NP5 (1735154851394879488)]} 0 3
I 2022/06/09 11:03:36 SWITCHBOARD *Indexed 281 words in URL https://www.criterion.com/films/28306-zatoichi-s-flashing-sword [AjKK4e_26NP5]
Description: Zatoichis Flashing Sword (1964) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2877 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:36 HTCACHE storing content of url https://www.criterion.com/current/posts/2314-summer-with-monika-summer-dreaming, 79970 bytes
I 2022/06/09 11:03:36 HostQueue forcing crawl-delay of 261 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 526, robots.delay = 0, ((waitig = 263) - (timeSinceLastAccess = 2)) = 261
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/films/06ca64e64f53a32a330eb79247c5dd6b/hpbmGuJ0K6p12f6Y2bcZSk0BaKAUs1_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 SWITCHBOARD CRAWL: ADDED 56 LINKS FROM https://www.criterion.com/current/posts/2314-summer-with-monika-summer-dreaming, STACKING TIME = 4, PARSING TIME = 8
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/images/4924-4955207a3083f6cddcb8c737c6ea62a3/current_394_060_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/2314-summer-with-monika-summer-dreaming
I 2022/06/09 11:03:37 Fulltext indexing: Ae8IxG_26NP5 https://www.criterion.com/current/posts/2314-summer-with-monika-summer-dreaming
I 2022/06/09 11:03:37 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Ae8IxG_26NP5 (1735154851749298176)]} 0 3
I 2022/06/09 11:03:37 SWITCHBOARD *Indexed 942 words in URL https://www.criterion.com/current/posts/2314-summer-with-monika-summer-dreaming [Ae8IxG_26NP5]
Description: Summer with Monika: Summer Dreaming | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 14727 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:37 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 526, robots.delay = 0, ((waitig = 263) - (timeSinceLastAccess = 14)) = 249
I 2022/06/09 11:03:37 HTCACHE storing content of url https://www.criterion.com/films/218, 94881 bytes
I 2022/06/09 11:03:37 SWITCHBOARD CRAWL: ADDED 87 LINKS FROM https://www.criterion.com/films/218, STACKING TIME = 2, PARSING TIME = 88
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/posts/3048-206945a9d9280c8d7f6c0c6da7897294/Jules_Jim_3r_Feature_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/films/3b542aa4345014825677034e88a86e6d/n8W6S9aQt25iXGj6YmmCmbuqM3Pz1k_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.amazon.com/dp/B004D376RA - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/1171-f203339ee3e385e0b4ca5c06493bc15e/GEmaOGGxbmOk1JqIrI6nv5xO1IlpoK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/posts/3051-d8d84455c211bc356a564a780f6ea619/FRANCOIS_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/films/98a574fceaf7129521ca226ad6781179/0r8k1qr8ovKte9kj3fQR1ji3nXxYG6_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/films/3131ec3285f60f1acc9c90c8b42a1d5f/RofcqCXxGhvqtfNczxIk9a9ijuZSCI_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/films/98a574fceaf7129521ca226ad6781179/0r8k1qr8ovKte9kj3fQR1ji3nXxYG6_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/films/bfc8b1f1add9c0479f15115114ca1bca/mPun3IS9hLJyKvz5XEMII63cGgWfUo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/88-f80e6163654229c9479d58b818d2ba7b/C5nTskqzT7oQYGUHFFFOopLwIaABDN_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/jules-and-jim?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/images/7779-e3a680779cc510eda911dc18874c8385/Current_268id_078_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://itunes.apple.com/us/movie/id590835029?at=10layR - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7744-/3t1gZA95qhyh4URI6lfaq5raIcUvxV_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/films/533a7a11e79b504496c8d89ff63d0712/dFTLRh4U6evoXeAnqd56rWH3nCUx32_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/images/3648-6383b98d5c31119045013207a89b18c2/current_698_201_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/62-f80e6163654229c9479d58b818d2ba7b/5c4wBF6FZO4bp4iYRZJtfqX2PbgN4t_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/films/6419fdfbf3442b70b449e2780ae6ac04/Jli0DzbsLXV9c4V1sPirW4n2Lw5FR8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 HTCACHE storing content of url https://www.criterion.com/current/posts/1334-tv-on-the-radio, 70195 bytes
I 2022/06/09 11:03:37 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/films/e2503ca1f726cccd07242a6f9b09d465/7FscZpQOPu1MA6NZgaw4USztuWuS2n_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/1667b51f99cf2f225e99c56c72726fa3.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/242af177ce3fb237f966b86ff2fa6d7e.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/carousel-files/998ced07f289c248e0e45ee8fa284d93.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6556-/7Lo8IrNQB92tB5vJ9lBgrQYchbZIQZ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/1334-tv-on-the-radio, STACKING TIME = 6, PARSING TIME = 11
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/films/9fbc0bbe06284eca9d7de4fcf1158ed7/P38oZJKFuJuNUi7ZRNaHJIH6WkAKQZ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 521, robots.delay = 0, ((waitig = 260) - (timeSinceLastAccess = 14)) = 246
I 2022/06/09 11:03:37 REJECTED https://www.wnyc.org/shows/lopate/episodes/2009/12/17/segments/146395 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 SWITCHBOARD Excluded 23 words in URL https://www.criterion.com/films/218
I 2022/06/09 11:03:37 Fulltext indexing: Abkeme_26NP5 https://www.criterion.com/films/218
I 2022/06/09 11:03:37 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Abkeme_26NP5 (1735154852241080320)]} 0 6
I 2022/06/09 11:03:37 SWITCHBOARD *Indexed 566 words in URL https://www.criterion.com/films/218 [Abkeme_26NP5]
Description: Jules and Jim (1962) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 8395 bytes |
LinkStorageTime: 12 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:37 SWITCHBOARD Excluded 20 words in URL https://www.criterion.com/current/posts/1334-tv-on-the-radio
I 2022/06/09 11:03:37 Fulltext indexing: AYxMYG_26NP5 https://www.criterion.com/current/posts/1334-tv-on-the-radio
I 2022/06/09 11:03:37 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AYxMYG_26NP5 (1735154852258906112)]} 0 1
I 2022/06/09 11:03:37 SWITCHBOARD *Indexed 288 words in URL https://www.criterion.com/current/posts/1334-tv-on-the-radio [AYxMYG_26NP5]
Description: TV on the Radio | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3207 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:37 HTCACHE storing content of url https://www.criterion.com/current/posts/4694-scoring-silent-hitchcock, 67196 bytes
I 2022/06/09 11:03:37 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/4694-scoring-silent-hitchcock, STACKING TIME = 1, PARSING TIME = 6
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/images/8558-d38a5109c61e9513c05b725c02f249e4/lodger2_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.youtube.com/embed/m2rwxdUKt68 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/films/6d943264d831d3159db1baf3a2c2b230/7B0Jv1Tlc6O5qskDxKzmjmtnUu4gfA_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:37 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 517, robots.delay = 0, ((waitig = 258) - (timeSinceLastAccess = 12)) = 246
I 2022/06/09 11:03:37 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/current/posts/4694-scoring-silent-hitchcock
I 2022/06/09 11:03:37 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AVr1hG_26NP5 (1735154852500078592)]} 0 1
I 2022/06/09 11:03:37 Fulltext indexing: AVr1hG_26NP5 https://www.criterion.com/current/posts/4694-scoring-silent-hitchcock
I 2022/06/09 11:03:37 SWITCHBOARD *Indexed 312 words in URL https://www.criterion.com/current/posts/4694-scoring-silent-hitchcock [AVr1hG_26NP5]
Description: Scoring Silent Hitchcock | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3491 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:38 HTCACHE storing content of url https://www.criterion.com/current/posts/6589-cluny-brown-the-joys-of-plumbing, 82438 bytes
I 2022/06/09 11:03:38 SWITCHBOARD CRAWL: ADDED 58 LINKS FROM https://www.criterion.com/current/posts/6589-cluny-brown-the-joys-of-plumbing, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:03:38 REJECTED https://s3.amazonaws.com/criterion-production/films/70e6c5bc30a6ed381e5d4d0683bec561/6bFWIQhM625dDBlb8pwoRt3svKDhP7_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://criterion-production.s3.amazonaws.com/7ZJyB6W7a68LYSPObKq7skmISdXC5Y.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://criterion-production.s3.amazonaws.com/CbmDesOBMoc8ZD27QFddOyKYR9oLWq.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6589-/izRUQqIou1oYxRqasmHa40JjZHtpYN_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 514, robots.delay = 0, ((waitig = 257) - (timeSinceLastAccess = 12)) = 245
I 2022/06/09 11:03:38 HTCACHE storing content of url https://www.criterion.com/current/author/302-adrian-danks, 48669 bytes
I 2022/06/09 11:03:38 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 510, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 11)) = 244
I 2022/06/09 11:03:38 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/302-adrian-danks, STACKING TIME = 11, PARSING TIME = 205
I 2022/06/09 11:03:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/6589-cluny-brown-the-joys-of-plumbing
I 2022/06/09 11:03:38 Fulltext indexing: AUacTG_26NP5 https://www.criterion.com/current/posts/6589-cluny-brown-the-joys-of-plumbing
I 2022/06/09 11:03:38 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AUacTG_26NP5 (1735154853166972928)]} 0 3
I 2022/06/09 11:03:38 SWITCHBOARD *Indexed 1087 words in URL https://www.criterion.com/current/posts/6589-cluny-brown-the-joys-of-plumbing [AUacTG_26NP5]
Description: Cluny Brown: The Joys of Plumbing | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 15460 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:38 SWITCHBOARD Excluded 12 words in URL https://www.criterion.com/current/author/302-adrian-danks
I 2022/06/09 11:03:38 Fulltext indexing: AQwQcG_26NP5 https://www.criterion.com/current/author/302-adrian-danks
I 2022/06/09 11:03:38 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AQwQcG_26NP5 (1735154853173264384)]} 0 1
I 2022/06/09 11:03:38 SWITCHBOARD *Indexed 119 words in URL https://www.criterion.com/current/author/302-adrian-danks [AQwQcG_26NP5]
Description: Adrian Danks | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1434 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:38 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 510, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 14)) = 241
I 2022/06/09 11:03:38 HTCACHE storing content of url https://www.criterion.com/current/posts/2-coup-de-torchon, 90192 bytes
I 2022/06/09 11:03:38 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/2-coup-de-torchon, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:03:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:38 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 506, robots.delay = 0, ((waitig = 253) - (timeSinceLastAccess = 12)) = 241
I 2022/06/09 11:03:38 SWITCHBOARD Excluded 30 words in URL https://www.criterion.com/current/posts/2-coup-de-torchon
I 2022/06/09 11:03:38 Fulltext indexing: AKdVLG_26NP5 https://www.criterion.com/current/posts/2-coup-de-torchon
I 2022/06/09 11:03:38 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AKdVLG_26NP5 (1735154853624152064)]} 0 3
I 2022/06/09 11:03:38 SWITCHBOARD *Indexed 578 words in URL https://www.criterion.com/current/posts/2-coup-de-torchon [AKdVLG_26NP5]
Description: Coup de torchon | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 7147 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:38 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:03:39 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:03:39 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@f72c0480[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vf(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772539925}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vz(7.7.3):C4:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772604529}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w0(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772604627}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w1(7.7.3):C16:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772608698}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w2(7.7.3):C25:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772613778}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w3(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772619037}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:03:39 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:03:39 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=ray-nicholas, 225292 bytes
I 2022/06/09 11:03:39 HostQueue forcing crawl-delay of 194 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 509, robots.delay = 0, ((waitig = 254) - (timeSinceLastAccess = 61)) = 193
I 2022/06/09 11:03:39 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse/list?director=ray-nicholas, STACKING TIME = 6, PARSING TIME = 86
I 2022/06/09 11:03:39 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://s3.amazonaws.com/criterion-production/films/df93d59f964f578be1c4ed9db26df611/LHDGeFoUAUtHamDnXwv012TKgg62mL_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://s3.amazonaws.com/criterion-production/films/7b61cc952472acaf45e25d18ecda3a6c/AvkrECEvwDaYkrM6obhacxPy1j0Nwy_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://s3.amazonaws.com/criterion-production/films/d4d406ae48cd0fbfc78216e2efb5bf43/NEeX7phTkN3xjrd1deXb0RmokIXDG3_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=ray-nicholas
I 2022/06/09 11:03:39 Fulltext indexing: AOFqZm_26NP5 https://www.criterion.com/shop/browse/list?director=ray-nicholas
I 2022/06/09 11:03:39 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AOFqZm_26NP5 (1735154854025756672)]} 0 4
I 2022/06/09 11:03:39 SWITCHBOARD *Indexed 1202 words in URL https://www.criterion.com/shop/browse/list?director=ray-nicholas [AOFqZm_26NP5]
Description: Nicholas Ray films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12587 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:39 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 509, robots.delay = 0, ((waitig = 254) - (timeSinceLastAccess = 10)) = 244
I 2022/06/09 11:03:39 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=peckinpah-sam, 224125 bytes
I 2022/06/09 11:03:39 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=peckinpah-sam, STACKING TIME = 5, PARSING TIME = 25
I 2022/06/09 11:03:39 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://s3.amazonaws.com/criterion-production/films/beca22ceb97188c09bd8ef6090c6bca7/XQmMDjG2VLEmmHY6Y6TslMJBOi1w2p_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
I 2022/06/09 11:03:39 org.apache.solr.update.SolrIndexWriter Calling setCommitData with IW:org.apache.solr.update.SolrIndexWriter@fed4ccdd commitCommandVersion:0
I 2022/06/09 11:03:39 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 510, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 10)) = 245
I 2022/06/09 11:03:39 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=peckinpah-sam
I 2022/06/09 11:03:39 Fulltext indexing: AI1OVm_26NP5 https://www.criterion.com/shop/browse?director=peckinpah-sam
I 2022/06/09 11:03:39 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AI1OVm_26NP5 (1735154854423166976)]} 0 5
I 2022/06/09 11:03:39 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse?director=peckinpah-sam [AI1OVm_26NP5]
Description: Sam Peckinpah films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12412 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:39 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:03:39 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=fukuda-jun, 226493 bytes
I 2022/06/09 11:03:39 SWITCHBOARD CRAWL: ADDED 55 LINKS FROM https://www.criterion.com/shop/browse?director=fukuda-jun, STACKING TIME = 1, PARSING TIME = 22
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://s3.amazonaws.com/criterion-production/films/2e9d099057c503535d8c9314a6a56293/YCZYqbdZEnOETDz9YttzHFy11p3gH6_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://s3.amazonaws.com/criterion-production/films/7cf358ae483b929adbb8016808ede9ea/fy6RjXMYWsx6hrOnVfu8CH09U1IWa5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://s3.amazonaws.com/criterion-production/films/7097d0cec67e4882cb538b53d8b72ec0/5Rx66b23nRm2oZXgaE98MZKJODK1Vo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://s3.amazonaws.com/criterion-production/films/f311e3fffb5e84072caffb39d66b32da/t97YUGHe6GAj7GsXWJ0fIl99zhrgWb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1953-0d1595bafe2ba6d32c5f5e1bc6215555/bG4INWRMocURqMHZl7ZgyVgw1O47Tx_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://s3.amazonaws.com/criterion-production/films/ddbb0db3267c57bf94cf0dc41f9c1e87/fJJtBeYp3LO4WrD9clkj4lAn0IzGG6_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:39 HostQueue forcing crawl-delay of 227 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 512, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 29)) = 227
I 2022/06/09 11:03:39 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=fukuda-jun
I 2022/06/09 11:03:39 Fulltext indexing: AIuPxm_26NP5 https://www.criterion.com/shop/browse?director=fukuda-jun
I 2022/06/09 11:03:39 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AIuPxm_26NP5 (1735154854715719680)]} 0 3
I 2022/06/09 11:03:39 SWITCHBOARD *Indexed 1212 words in URL https://www.criterion.com/shop/browse?director=fukuda-jun [AIuPxm_26NP5]
Description: Jun Fukuda films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12605 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:39 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=arnold-jack, 224147 bytes
I 2022/06/09 11:03:40 HTCACHE storing content of url https://www.criterion.com/current/posts/2522-the-golden-age-of-sonny-fox, 76069 bytes
I 2022/06/09 11:03:40 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=arnold-jack, STACKING TIME = 1, PARSING TIME = 94
I 2022/06/09 11:03:40 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/films/38afc8a08b5c31eabd9e75a4cd1cee08/fdD4Lk4KeFdh5gGU3xwYImxc9kzoO6_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 SWITCHBOARD CRAWL: ADDED 59 LINKS FROM https://www.criterion.com/current/posts/2522-the-golden-age-of-sonny-fox, STACKING TIME = 5, PARSING TIME = 15
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/films/9fbc0bbe06284eca9d7de4fcf1158ed7/P38oZJKFuJuNUi7ZRNaHJIH6WkAKQZ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED http://www.myfoxny.com/story/19646059/sonny-fox - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED http://www.westport-news.com/news/article/A-Sonny-walk-down-memory-lane-for-Fox-s-3889579.php - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED http://www.emmys.tv/events/2012/10/fox-tales-sonny-fox - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED http://www.paleycenter.org/2012-fall-sonny-fox - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED http://sonnyfoxtv.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.youtube.com/embed/_Kt-SzifHeY?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/images/5013-792dc67de674d5d474d51e4da0afefaf/GATV_CURRENT_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 512, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 12)) = 244
I 2022/06/09 11:03:40 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=arnold-jack
I 2022/06/09 11:03:40 Fulltext indexing: AF1yEm_26NP5 https://www.criterion.com/shop/browse?director=arnold-jack
I 2022/06/09 11:03:40 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AF1yEm_26NP5 (1735154854964232192)]} 0 2
I 2022/06/09 11:03:40 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse?director=arnold-jack [AF1yEm_26NP5]
Description: Jack Arnold films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12397 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:40 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/2522-the-golden-age-of-sonny-fox
I 2022/06/09 11:03:40 Fulltext indexing: AEzf9G_26NP5 https://www.criterion.com/current/posts/2522-the-golden-age-of-sonny-fox
I 2022/06/09 11:03:40 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AEzf9G_26NP5 (1735154854993592320)]} 0 2
I 2022/06/09 11:03:40 SWITCHBOARD *Indexed 373 words in URL https://www.criterion.com/current/posts/2522-the-golden-age-of-sonny-fox [AEzf9G_26NP5]
Description: The Golden Age of Sonny Fox | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4178 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:40 HTCACHE storing content of url https://www.criterion.com/films/27674-the-moment-of-truth, 80492 bytes
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/the-moment-of-truth?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 SWITCHBOARD CRAWL: ADDED 64 LINKS FROM https://www.criterion.com/films/27674-the-moment-of-truth, STACKING TIME = 4, PARSING TIME = 8
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/images/8506-2c8496e99ec5ab870f0acfdac1d7ad23/rodeo3_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/9bb932c27a6c273fb0ca6f8187680237.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/films/b8be211db9c95b290da6ca9de447393a/3zf1EegMQAQ4OdftMWSASg8Ez9Dg3J_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/films/3bfeccc88c1706a5dd401df39582d7cf/eM5j469VIjNvWJjFs4OGOnOpTBbpXJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/images/3807-c1f3d8e13fdde1b3bbbe151d670c0d7e/current_341_080thumb_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/5927ac2a9f8c4f35186a67a56463f3d5.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/1147-17eeb8b0f2c8187af38e96c3beb52ed4/FVUsWstpKyefYMhht9ykKlxvcIcWoI_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/665a35f49d9ed9ee5e377fe2e56ae5d5.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/5e008e1b7775e0fb580178fbde8296d0.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/films/c7d69afbd3078ae53535d17aa48ae881/cQSfBqFPiI3Kzcku6zJ9wB5Ysb01OD_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/films/8cc71f23f230fe161df0a179a2f01231/7ssZeQ2jRKasoS7tOXcxzgfpIfQUW2_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/fc3e150892b4045531192b33da6a25ea.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/37424cddb21112e6d495322578edb355.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 SWITCHBOARD Excluded 20 words in URL https://www.criterion.com/films/27674-the-moment-of-truth
I 2022/06/09 11:03:40 Fulltext indexing: AAtJpe_26NP5 https://www.criterion.com/films/27674-the-moment-of-truth
I 2022/06/09 11:03:40 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[AAtJpe_26NP5 (1735154855076429824)]} 0 2
I 2022/06/09 11:03:40 SWITCHBOARD *Indexed 378 words in URL https://www.criterion.com/films/27674-the-moment-of-truth [AAtJpe_26NP5]
Description: The Moment of Truth (1965) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4620 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:40 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 510, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 12)) = 243
I 2022/06/09 11:03:40 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 510, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 10)) = 245
I 2022/06/09 11:03:40 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=fukuda-jun, 226988 bytes
I 2022/06/09 11:03:40 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 512, robots.delay = 0, ((waitig = 256) - (timeSinceLastAccess = 11)) = 245
I 2022/06/09 11:03:40 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/films/7097d0cec67e4882cb538b53d8b72ec0/5Rx66b23nRm2oZXgaE98MZKJODK1Vo_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1953-0d1595bafe2ba6d32c5f5e1bc6215555/bG4INWRMocURqMHZl7ZgyVgw1O47Tx_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/films/ddbb0db3267c57bf94cf0dc41f9c1e87/fJJtBeYp3LO4WrD9clkj4lAn0IzGG6_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 SWITCHBOARD CRAWL: ADDED 48 LINKS FROM https://www.criterion.com/shop/browse/list?director=fukuda-jun, STACKING TIME = 3, PARSING TIME = 67
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/films/2e9d099057c503535d8c9314a6a56293/YCZYqbdZEnOETDz9YttzHFy11p3gH6_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/films/f311e3fffb5e84072caffb39d66b32da/t97YUGHe6GAj7GsXWJ0fIl99zhrgWb_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 REJECTED https://s3.amazonaws.com/criterion-production/films/7cf358ae483b929adbb8016808ede9ea/fy6RjXMYWsx6hrOnVfu8CH09U1IWa5_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:40 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=arnold-jack, 224197 bytes
I 2022/06/09 11:03:41 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=fukuda-jun
I 2022/06/09 11:03:41 Fulltext indexing: q495-m_26NP5 https://www.criterion.com/shop/browse/list?director=fukuda-jun
I 2022/06/09 11:03:41 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[q495-m_26NP5 (1735154855892221952)]} 0 7
I 2022/06/09 11:03:41 SWITCHBOARD *Indexed 1217 words in URL https://www.criterion.com/shop/browse/list?director=fukuda-jun [q495-m_26NP5]
Description: Jun Fukuda films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12711 bytes |
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:41 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=arnold-jack, STACKING TIME = 1, PARSING TIME = 49
I 2022/06/09 11:03:41 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/films/38afc8a08b5c31eabd9e75a4cd1cee08/fdD4Lk4KeFdh5gGU3xwYImxc9kzoO6_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 HTCACHE storing content of url https://www.criterion.com/current/posts/703-le-deuxi-me-souffle-after-the-fall, 87153 bytes
I 2022/06/09 11:03:41 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 511, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 13)) = 242
I 2022/06/09 11:03:41 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/703-le-deuxi-me-souffle-after-the-fall, STACKING TIME = 8, PARSING TIME = 25
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/images/4361-324f30b2800c9833ecd88db6b0a49963/img_current_776_014_medium.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=arnold-jack
I 2022/06/09 11:03:41 Fulltext indexing: qW1Nlm_26NP5 https://www.criterion.com/shop/browse/list?director=arnold-jack
I 2022/06/09 11:03:41 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[qW1Nlm_26NP5 (1735154856129200128)]} 0 4
I 2022/06/09 11:03:41 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=arnold-jack [qW1Nlm_26NP5]
Description: Jack Arnold films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12473 bytes |
LinkStorageTime: 9 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:41 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/703-le-deuxi-me-souffle-after-the-fall
I 2022/06/09 11:03:41 Fulltext indexing: GdLlSG_26NP5 https://www.criterion.com/current/posts/703-le-deuxi-me-souffle-after-the-fall
I 2022/06/09 11:03:41 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[GdLlSG_26NP5 (1735154856221474816)]} 0 3
I 2022/06/09 11:03:41 SWITCHBOARD *Indexed 966 words in URL https://www.criterion.com/current/posts/703-le-deuxi-me-souffle-after-the-fall [GdLlSG_26NP5]
Description: Le deuxième souffle: After the Fall | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 15618 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:41 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 511, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 10)) = 245
I 2022/06/09 11:03:41 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=peckinpah-sam, 224159 bytes
I 2022/06/09 11:03:41 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=peckinpah-sam, STACKING TIME = 1, PARSING TIME = 25
I 2022/06/09 11:03:41 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/films/beca22ceb97188c09bd8ef6090c6bca7/XQmMDjG2VLEmmHY6Y6TslMJBOi1w2p_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=peckinpah-sam
I 2022/06/09 11:03:41 Fulltext indexing: RUogvm_26NP5 https://www.criterion.com/shop/browse/list?director=peckinpah-sam
I 2022/06/09 11:03:41 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[RUogvm_26NP5 (1735154856500396032)]} 0 2
I 2022/06/09 11:03:41 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=peckinpah-sam [RUogvm_26NP5]
Description: Sam Peckinpah films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12476 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:41 HostQueue forcing crawl-delay of 247 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 514, robots.delay = 0, ((waitig = 257) - (timeSinceLastAccess = 10)) = 247
I 2022/06/09 11:03:41 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=erksan-metin, 224660 bytes
I 2022/06/09 11:03:41 HTCACHE storing content of url https://www.criterion.com/current/posts/582-twenty-four-eyes-growing-pains, 119093 bytes
I 2022/06/09 11:03:41 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse?director=erksan-metin, STACKING TIME = 1, PARSING TIME = 25
I 2022/06/09 11:03:41 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/films/49e3fce60e807242669361b7f18cfaf4/wmTcBChKCL1gZNBagJAHXx22sGKgKw_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1976-ece4132f4abef8c4e7beb0a0edffc9a8/y26UyQwNxt4FguJSgQIZWpCNlLsjHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/images/4335-d1e2c8224e9a824d3574ad68a178381d/img_current_518_011_medium.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:41 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/582-twenty-four-eyes-growing-pains, STACKING TIME = 1, PARSING TIME = 29
I 2022/06/09 11:03:41 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=erksan-metin
I 2022/06/09 11:03:41 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Isn-lm_26NP5 (1735154856739471360)]} 0 2
I 2022/06/09 11:03:41 Fulltext indexing: Isn-lm_26NP5 https://www.criterion.com/shop/browse?director=erksan-metin
I 2022/06/09 11:03:41 SWITCHBOARD *Indexed 1195 words in URL https://www.criterion.com/shop/browse?director=erksan-metin [Isn-lm_26NP5]
Description: Metin Erksan films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12461 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:41 HTCACHE storing content of url https://www.criterion.com/current/posts/788-the-burmese-harp, 90444 bytes
I 2022/06/09 11:03:41 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 510, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 9)) = 246
I 2022/06/09 11:03:42 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/582-twenty-four-eyes-growing-pains
I 2022/06/09 11:03:42 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/788-the-burmese-harp, STACKING TIME = 5, PARSING TIME = 15
I 2022/06/09 11:03:42 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[IW8SgG_26NP5 (1735154856919826432)]} 0 3
I 2022/06/09 11:03:42 Fulltext indexing: IW8SgG_26NP5 https://www.criterion.com/current/posts/582-twenty-four-eyes-growing-pains
I 2022/06/09 11:03:42 SWITCHBOARD *Indexed 944 words in URL https://www.criterion.com/current/posts/582-twenty-four-eyes-growing-pains [IW8SgG_26NP5]
Description: Twenty-Four Eyes: Growing Pains | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 13164 bytes |
LinkStorageTime: 9 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:42 SWITCHBOARD Excluded 25 words in URL https://www.criterion.com/current/posts/788-the-burmese-harp
I 2022/06/09 11:03:42 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[qtPG6G_26NP5 (1735154856970158080)]} 0 2
I 2022/06/09 11:03:42 Fulltext indexing: qtPG6G_26NP5 https://www.criterion.com/current/posts/788-the-burmese-harp
I 2022/06/09 11:03:42 SWITCHBOARD *Indexed 570 words in URL https://www.criterion.com/current/posts/788-the-burmese-harp [qtPG6G_26NP5]
Description: The Burmese Harp | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 7458 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:42 HTCACHE storing content of url https://www.criterion.com/current/posts/1149-pigs-and-battleships-feeding-frenzy, 120177 bytes
I 2022/06/09 11:03:42 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/current/posts/1149-pigs-and-battleships-feeding-frenzy, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:03:42 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://s3.amazonaws.com/criterion-production/images/4441-4c9e147750aa92efcad69c991f660f69/current_992_fg_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 357, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:42 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/1149-pigs-and-battleships-feeding-frenzy
I 2022/06/09 11:03:42 Fulltext indexing: kcFibG_26NP5 https://www.criterion.com/current/posts/1149-pigs-and-battleships-feeding-frenzy
I 2022/06/09 11:03:42 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[kcFibG_26NP5 (1735154857222864896)]} 0 3
I 2022/06/09 11:03:42 SWITCHBOARD *Indexed 1009 words in URL https://www.criterion.com/current/posts/1149-pigs-and-battleships-feeding-frenzy [kcFibG_26NP5]
Description: Pigs and Battleships: Feeding Frenzy | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 14120 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:42 HostQueue forcing crawl-delay of 205 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 357, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 45)) = 205
I 2022/06/09 11:03:42 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=oda-motoyoshi, 224748 bytes
I 2022/06/09 11:03:42 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=oda-motoyoshi, STACKING TIME = 0, PARSING TIME = 20
I 2022/06/09 11:03:42 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1953-0d1595bafe2ba6d32c5f5e1bc6215555/bG4INWRMocURqMHZl7ZgyVgw1O47Tx_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://s3.amazonaws.com/criterion-production/films/12777a13abcdaa547d51c2273947ddff/mgBLNMreWObbgdQFJlszuoTPqR8Nay_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:42 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=oda-motoyoshi
I 2022/06/09 11:03:42 Fulltext indexing: V2wQxm_26NP5 https://www.criterion.com/shop/browse/list?director=oda-motoyoshi
I 2022/06/09 11:03:42 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 372, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:42 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[V2wQxm_26NP5 (1735154857676898304)]} 0 3
I 2022/06/09 11:03:42 SWITCHBOARD *Indexed 1202 words in URL https://www.criterion.com/shop/browse/list?director=oda-motoyoshi [V2wQxm_26NP5]
Description: Motoyoshi Oda films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12506 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:43 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 372, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:43 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=lindberg-per, 224737 bytes
I 2022/06/09 11:03:43 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=lindberg-per, STACKING TIME = 1, PARSING TIME = 29
I 2022/06/09 11:03:43 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/films/490667c07df5c2a3e3b5e76219af4a20/IZXe9hzAvJvVLk1ZjjptbbtlVLDt8B_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1915-d024b93e4429f5b05d7f0bdc8d59c415/shXfQUMZTWnY6hrjrAHIhcBlosHu4c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=lindberg-per
I 2022/06/09 11:03:43 Fulltext indexing: Q9l0pm_26NP5 https://www.criterion.com/shop/browse/list?director=lindberg-per
I 2022/06/09 11:03:43 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Q9l0pm_26NP5 (1735154858094231552)]} 0 3
I 2022/06/09 11:03:43 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse/list?director=lindberg-per [Q9l0pm_26NP5]
Description: Per Lindberg films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12494 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:43 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 412, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:43 HTCACHE storing content of url https://www.criterion.com/current/posts/633-the-browning-version, 81990 bytes
I 2022/06/09 11:03:43 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=zetterling-mai, 224312 bytes
I 2022/06/09 11:03:43 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/633-the-browning-version, STACKING TIME = 1, PARSING TIME = 68
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=zetterling-mai, STACKING TIME = 0, PARSING TIME = 41
I 2022/06/09 11:03:43 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/films/89fd00884467657bae436047de8cd9b2/7RuTtvX525S0ENzFaMXEcuoqKM280Y_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/633-the-browning-version
I 2022/06/09 11:03:43 Fulltext indexing: izmiqG_26NP5 https://www.criterion.com/current/posts/633-the-browning-version
I 2022/06/09 11:03:43 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[izmiqG_26NP5 (1735154858410901504)]} 0 7
I 2022/06/09 11:03:43 SWITCHBOARD *Indexed 835 words in URL https://www.criterion.com/current/posts/633-the-browning-version [izmiqG_26NP5]
Description: The Browning Version | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 11556 bytes |
LinkStorageTime: 16 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:43 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 415, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:43 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=zetterling-mai
I 2022/06/09 11:03:43 Fulltext indexing: KIZvRm_26NP5 https://www.criterion.com/shop/browse/list?director=zetterling-mai
I 2022/06/09 11:03:43 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[KIZvRm_26NP5 (1735154858500030464)]} 0 2
I 2022/06/09 11:03:43 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=zetterling-mai [KIZvRm_26NP5]
Description: Mai Zetterling films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12597 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:43 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=katsu-shintaro, 224716 bytes
I 2022/06/09 11:03:43 HTCACHE storing content of url https://www.criterion.com/current/posts/3103-errol-morris-on-stephen-hawking, 67966 bytes
I 2022/06/09 11:03:43 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 405, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 8)) = 242
I 2022/06/09 11:03:43 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse?director=katsu-shintaro, STACKING TIME = 1, PARSING TIME = 74
I 2022/06/09 11:03:43 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/films/67b1624d36723adc095a547dbab9a4ea/thEX8E524itp1BxKsD1fZYPogk6S3g_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/3103-errol-morris-on-stephen-hawking, STACKING TIME = 6, PARSING TIME = 16
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/films/5199593d0fcdff78d678ad5ac1745fa9/vZ1yJIDRTUcAEmQs90SEEQ7tQvHCqu_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.youtube.com/embed/d7FBb_B5J5c?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:43 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=katsu-shintaro
I 2022/06/09 11:03:43 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[FQ3-6m_26NP5 (1735154858883809280)]} 0 2
I 2022/06/09 11:03:43 Fulltext indexing: FQ3-6m_26NP5 https://www.criterion.com/shop/browse?director=katsu-shintaro
I 2022/06/09 11:03:43 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse?director=katsu-shintaro [FQ3-6m_26NP5]
Description: Shintaro Katsu films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12499 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:43 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/current/posts/3103-errol-morris-on-stephen-hawking
I 2022/06/09 11:03:43 Fulltext indexing: J1XniG_26NP5 https://www.criterion.com/current/posts/3103-errol-morris-on-stephen-hawking
I 2022/06/09 11:03:43 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[J1XniG_26NP5 (1735154858903732224)]} 0 1
I 2022/06/09 11:03:43 SWITCHBOARD *Indexed 242 words in URL https://www.criterion.com/current/posts/3103-errol-morris-on-stephen-hawking [J1XniG_26NP5]
Description: Errol Morris on Stephen Hawking | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2925 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:44 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 405, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:03:44 HTCACHE storing content of url https://www.criterion.com/current/posts/4366-on-robeson-moonlight-shines-an-lgbtq-podcast-from-film-comment, 77033 bytes
I 2022/06/09 11:03:44 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=christian-jaque, 224185 bytes
I 2022/06/09 11:03:44 REJECTED http://www.indiewire.com/2016/12/isle-of-dogs-wes-anderson-stop-motion-first-footage-1201761330/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED http://lolajournal.com/7/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED http://www.bfi.org.uk/news-opinion/news-bfi/features/paul-robeson-singer-actor-activist - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://mubi.com/notebook/posts/as-if-their-light-could-define-us-jean-louis-schefer-s-the-ordinary-man-of-cinema - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.fandor.com/keyframe/time-movie-freeze-frame - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://w.soundcloud.com/player/?url=https%3A//api.soundcloud.com/tracks/298843045&auto_play=false&hide_related=false&show_comments=true&show_user=true&show_reposts=false&visual=true - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED http://www.indiewire.com/2016/12/best-movies-2016-critic-poll-results-1201757008/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED http://www.filmcomment.com/blog/film-comment-podcast-lgbtq-representation/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://thefilmstage.com/features/rossy-de-palma-on-trusting-pedro-almodovar-julieta-and-being-inspired-by-women/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 SWITCHBOARD CRAWL: ADDED 66 LINKS FROM https://www.criterion.com/current/posts/4366-on-robeson-moonlight-shines-an-lgbtq-podcast-from-film-comment, STACKING TIME = 5, PARSING TIME = 16
I 2022/06/09 11:03:44 REJECTED http://www.villagevoice.com/film/how-barry-jenkins-turned-the-misery-and-beauty-of-the-queer-black-experience-into-the-years-best-movie-9478791 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://s3.amazonaws.com/criterion-production/images/1127-256ef2ebc2c864f1be6f18064a30f022/curent_borderline297_028_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED http://www.villagevoice.com/filmpoll - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://s3.amazonaws.com/criterion-production/films/69a755ba1d29f769584b674d4114ac40/gGzssRokuyhN2VwbxmWfgZJVIq2EM7_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://newrepublic.com/article/139373/sad-timely-return-killing-america - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:03:44 HostQueue forcing crawl-delay of 194 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 385, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 56)) = 194
I 2022/06/09 11:03:44 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/current/posts/4366-on-robeson-moonlight-shines-an-lgbtq-podcast-from-film-comment
I 2022/06/09 11:03:44 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=christian-jaque, STACKING TIME = 20, PARSING TIME = 81
I 2022/06/09 11:03:44 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://s3.amazonaws.com/criterion-production/films/655106c6e5c5cb1345e565eaf218815e/H76zJ42x9gqvXla7tZzQflVDhzJYJz_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:44 Fulltext indexing: cU_kSG_26NP5 https://www.criterion.com/current/posts/4366-on-robeson-moonlight-shines-an-lgbtq-podcast-from-film-comment
I 2022/06/09 11:03:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[cU_kSG_26NP5 (1735154859370348544)]} 0 15
I 2022/06/09 11:03:44 SWITCHBOARD *Indexed 377 words in URL https://www.criterion.com/current/posts/4366-on-robeson-moonlight-shines-an-lgbtq-podcast-from-film-comment [cU_kSG_26NP5]
Description: On Robeson, Moonlight Shines, an LGBTQ podcast from Film Comment | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4166 bytes |
LinkStorageTime: 16 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:44 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:03:44 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@611e6c6e[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vf(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772539925}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vz(7.7.3):C4:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772604529}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w0(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772604627}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w1(7.7.3):C16:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772608698}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w2(7.7.3):C25:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772613778}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w3(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772619037}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w4(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772619585}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w5(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772619747}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w6(7.7.3):C18:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772624408}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:03:44 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:03:44 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=christian-jaque
I 2022/06/09 11:03:44 Fulltext indexing: -XyNmm_26NP5 https://www.criterion.com/shop/browse/list?director=christian-jaque
I 2022/06/09 11:03:44 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[-XyNmm_26NP5 (1735154859478351872)]} 0 2
I 2022/06/09 11:03:44 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=christian-jaque [-XyNmm_26NP5]
Description: Christian-Jaque films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12469 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:44 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 385, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:44 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 385, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:44 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=skjoldbjaerg-erik, 224183 bytes
I 2022/06/09 11:03:44 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=skjoldbjaerg-erik, STACKING TIME = 0, PARSING TIME = 24
I 2022/06/09 11:03:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://s3.amazonaws.com/criterion-production/films/609ba3283ee6e7a688a8f332948af460/UQs28Jv5Fz6nuVrbcLXq70jhwKzxXV_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bardem-juan-antonio, 224211 bytes
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 413, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:45 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=bardem-juan-antonio, STACKING TIME = 3, PARSING TIME = 78
I 2022/06/09 11:03:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://s3.amazonaws.com/criterion-production/films/012ec32830c466da41184e9b7ae850b9/j8Biresgpv45Yek1l8voqqAyV3xkcT_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=skjoldbjaerg-erik
I 2022/06/09 11:03:45 Fulltext indexing: nWHtkm_26NP5 https://www.criterion.com/shop/browse/list?director=skjoldbjaerg-erik
I 2022/06/09 11:03:45 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[nWHtkm_26NP5 (1735154860243812352)]} 0 6
I 2022/06/09 11:03:45 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=skjoldbjaerg-erik [nWHtkm_26NP5]
Description: Erik Skjoldbjærg films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12477 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:45 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=bardem-juan-antonio
I 2022/06/09 11:03:45 Fulltext indexing: WirMJm_26NP5 https://www.criterion.com/shop/browse/list?director=bardem-juan-antonio
I 2022/06/09 11:03:45 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[WirMJm_26NP5 (1735154860320358400)]} 0 2
I 2022/06/09 11:03:45 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=bardem-juan-antonio [WirMJm_26NP5]
Description: Juan Antonio Bardem films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12475 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:45 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 413, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:45 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=brice-monte, 224227 bytes
I 2022/06/09 11:03:45 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=murray-anderson-john, 224223 bytes
I 2022/06/09 11:03:45 SWITCHBOARD CRAWL: ADDED 42 LINKS FROM https://www.criterion.com/shop/browse/list?director=brice-monte, STACKING TIME = 1, PARSING TIME = 105
I 2022/06/09 11:03:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 434, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://s3.amazonaws.com/criterion-production/films/3a4a52811b630a9836c1b10cb2c55a38/1DZVBE8PnMfkggyvh5s9f7K2TSAiF0_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=brice-monte
I 2022/06/09 11:03:45 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[D5TgXm_26NP5 (1735154860694700032)]} 0 1
I 2022/06/09 11:03:45 Fulltext indexing: D5TgXm_26NP5 https://www.criterion.com/shop/browse/list?director=brice-monte
I 2022/06/09 11:03:45 SWITCHBOARD *Indexed 1194 words in URL https://www.criterion.com/shop/browse/list?director=brice-monte [D5TgXm_26NP5]
Description: Monte Brice films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12509 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:45 SWITCHBOARD CRAWL: ADDED 42 LINKS FROM https://www.criterion.com/shop/browse/list?director=murray-anderson-john, STACKING TIME = 1, PARSING TIME = 43
I 2022/06/09 11:03:45 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://s3.amazonaws.com/criterion-production/films/15bb540b38dbdaa023601e7456c3eebe/J24VDct0pXh868DYmPMO0I2j9ucr1N_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:45 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 434, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:03:45 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=murray-anderson-john
I 2022/06/09 11:03:45 Fulltext indexing: Y1izLm_26NP5 https://www.criterion.com/shop/browse/list?director=murray-anderson-john
I 2022/06/09 11:03:45 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Y1izLm_26NP5 (1735154860942163968)]} 0 2
I 2022/06/09 11:03:45 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=murray-anderson-john [Y1izLm_26NP5]
Description: John Murray Anderson films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12501 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:46 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=neergaard-holm-olivia, 224293 bytes
I 2022/06/09 11:03:46 HostQueue forcing crawl-delay of 247 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 456, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 3)) = 247
I 2022/06/09 11:03:46 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=kasdan-lawrence, 224181 bytes
I 2022/06/09 11:03:46 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=neergaard-holm-olivia, STACKING TIME = 1, PARSING TIME = 104
I 2022/06/09 11:03:46 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://s3.amazonaws.com/criterion-production/films/45ae4aaeb01b65d3788e09d553527a0d/O5urSQ3UPS5ELUhh25uHIS7M5ri1p3_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=kasdan-lawrence, STACKING TIME = 1, PARSING TIME = 46
I 2022/06/09 11:03:46 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://s3.amazonaws.com/criterion-production/films/ace687c73ff2a89b78e73f7ed313e796/2H53DXvnDSCbuN3PlD0MhvPBtWLagJ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=neergaard-holm-olivia
I 2022/06/09 11:03:46 Fulltext indexing: HW8TRm_26NP5 https://www.criterion.com/shop/browse/list?director=neergaard-holm-olivia
I 2022/06/09 11:03:46 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[HW8TRm_26NP5 (1735154861379420160)]} 0 2
I 2022/06/09 11:03:46 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=neergaard-holm-olivia [HW8TRm_26NP5]
Description: Olivia Neergaard-Holm films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12520 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:46 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 464, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 8)) = 242
I 2022/06/09 11:03:46 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=kasdan-lawrence
I 2022/06/09 11:03:46 Fulltext indexing: HvYuom_26NP5 https://www.criterion.com/shop/browse/list?director=kasdan-lawrence
I 2022/06/09 11:03:46 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[HvYuom_26NP5 (1735154861508395008)]} 0 2
I 2022/06/09 11:03:46 SWITCHBOARD *Indexed 1193 words in URL https://www.criterion.com/shop/browse/list?director=kasdan-lawrence [HvYuom_26NP5]
Description: Lawrence Kasdan films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12475 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:46 HTCACHE storing content of url https://www.criterion.com/current/posts/441-seduced-and-abandoned-honor-and-family, 78876 bytes
I 2022/06/09 11:03:46 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/441-seduced-and-abandoned-honor-and-family, STACKING TIME = 1, PARSING TIME = 12
I 2022/06/09 11:03:46 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=epstein-robert, 224203 bytes
I 2022/06/09 11:03:46 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/films/27735-torna
I 2022/06/09 11:03:46 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/shop/browse
I 2022/06/09 11:03:46 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[PsEXve_26NP5 (1735154861656244224)]} 0 0
I 2022/06/09 11:03:46 REJECTED https://www.criterion.com/films/27735-torna - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/films/27735-torna aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:35:14Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
I 2022/06/09 11:03:46 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=epstein-robert, STACKING TIME = 1, PARSING TIME = 46
I 2022/06/09 11:03:46 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://s3.amazonaws.com/criterion-production/films/b218dc736aca04e70a9141fb76938977/iGVsfdDUhUrZ7Q8QTwV4HdWq9T9mQ3_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:46 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/441-seduced-and-abandoned-honor-and-family
I 2022/06/09 11:03:46 Fulltext indexing: 7kQ-gG_26NP5 https://www.criterion.com/current/posts/441-seduced-and-abandoned-honor-and-family
I 2022/06/09 11:03:46 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[7kQ-gG_26NP5 (1735154861701332992)]} 0 3
I 2022/06/09 11:03:46 SWITCHBOARD *Indexed 809 words in URL https://www.criterion.com/current/posts/441-seduced-and-abandoned-honor-and-family [7kQ-gG_26NP5]
Description: Seduced and Abandoned: Honor and Family | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 11447 bytes |
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:46 HostQueue forcing crawl-delay of 179 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 471, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 71)) = 179
I 2022/06/09 11:03:46 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=epstein-robert
I 2022/06/09 11:03:46 Fulltext indexing: fZAMOm_26NP5 https://www.criterion.com/shop/browse/list?director=epstein-robert
I 2022/06/09 11:03:46 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[fZAMOm_26NP5 (1735154861846036480)]} 0 2
I 2022/06/09 11:03:46 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse/list?director=epstein-robert [fZAMOm_26NP5]
Description: Robert Epstein films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12476 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:46 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 471, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:46 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[EoKDLe_26NP5 (1735154862080917504)]} 0 0
I 2022/06/09 11:03:46 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/films/27839-phoenix
I 2022/06/09 11:03:46 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/shop/browse
I 2022/06/09 11:03:46 REJECTED https://www.criterion.com/films/27839-phoenix - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/films/27839-phoenix aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:35:14Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
I 2022/06/09 11:03:47 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 471, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:47 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 471, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:47 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=lang-fritz, 225868 bytes
I 2022/06/09 11:03:47 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=lang-fritz, STACKING TIME = 1, PARSING TIME = 29
I 2022/06/09 11:03:47 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://s3.amazonaws.com/criterion-production/films/54c721d65e42f080484dba5bb409daf0/j2sPOa4CCiMyJefg5DNy4uJraAbCCe_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://s3.amazonaws.com/criterion-production/films/1a5c01f75aef9b18d08f9ccbf5561972/zVKSQvVRaxcsqWeU0EVzXYRkFFxWSp_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://s3.amazonaws.com/criterion-production/films/102d6709e0973307d33a352991ff721b/GqHBXoBeV4uWlYFG5OU3lnoYrFQ6iD_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 494, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:03:47 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=lang-fritz
I 2022/06/09 11:03:47 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[iuDX-m_26NP5 (1735154862800240640)]} 0 3
I 2022/06/09 11:03:47 Fulltext indexing: iuDX-m_26NP5 https://www.criterion.com/shop/browse/list?director=lang-fritz
I 2022/06/09 11:03:47 SWITCHBOARD *Indexed 1210 words in URL https://www.criterion.com/shop/browse/list?director=lang-fritz [iuDX-m_26NP5]
Description: Fritz Lang films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12631 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:47 HTCACHE storing content of url https://www.criterion.com/current/author/209-peter-brunette, 51714 bytes
I 2022/06/09 11:03:47 SWITCHBOARD CRAWL: ADDED 41 LINKS FROM https://www.criterion.com/current/author/209-peter-brunette, STACKING TIME = 1, PARSING TIME = 8
I 2022/06/09 11:03:47 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/current/author/209-peter-brunette
I 2022/06/09 11:03:47 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[zn7R-G_26NP5 (1735154862889369600)]} 0 1
I 2022/06/09 11:03:47 Fulltext indexing: zn7R-G_26NP5 https://www.criterion.com/current/author/209-peter-brunette
I 2022/06/09 11:03:47 SWITCHBOARD *Indexed 170 words in URL https://www.criterion.com/current/author/209-peter-brunette [zn7R-G_26NP5]
Description: Peter Brunette | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2010 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:47 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=neame-ronald, 225329 bytes
I 2022/06/09 11:03:47 HostQueue forcing crawl-delay of 247 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 499, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 3)) = 247
I 2022/06/09 11:03:47 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse/list?director=neame-ronald, STACKING TIME = 1, PARSING TIME = 81
I 2022/06/09 11:03:47 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://s3.amazonaws.com/criterion-production/films/c6959a1673c21a4b3e6548fb30fef56a/8xozXlxosdeqIbRkrZGdsFsQqKBq5V_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://s3.amazonaws.com/criterion-production/films/418e66a9918f1828df51c24759694c93/fYx0hDxKnfUdjea1QAR3n18yKhbAJy_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://s3.amazonaws.com/criterion-production/films/43b80b5305b96f14302af3745135d219/LXsJdJHsKGIMUWbeeMrwqtsVXGAOlR_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:47 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=neame-ronald
I 2022/06/09 11:03:48 Fulltext indexing: qPaxVm_26NP5 https://www.criterion.com/shop/browse/list?director=neame-ronald
I 2022/06/09 11:03:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[qPaxVm_26NP5 (1735154863227011072)]} 0 2
I 2022/06/09 11:03:48 SWITCHBOARD *Indexed 1200 words in URL https://www.criterion.com/shop/browse/list?director=neame-ronald [qPaxVm_26NP5]
Description: Ronald Neame films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12615 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:48 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 499, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:48 HTCACHE storing content of url https://www.criterion.com/current/posts/428-backyard-monsters-equinox-and-the-triumph-of-love, 98705 bytes
I 2022/06/09 11:03:48 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/428-backyard-monsters-equinox-and-the-triumph-of-love, STACKING TIME = 1, PARSING TIME = 16
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED http://www.stopmotionanimation.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=hondo-med, 224676 bytes
I 2022/06/09 11:03:48 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse?director=hondo-med, STACKING TIME = 1, PARSING TIME = 95
I 2022/06/09 11:03:48 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1961-3b7b159a9ae3d500459e38e69c96a917/9P9MeXzolFQyY5OhK2XcwZ02e0Y0ZC_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/films/b1f5f414289850a75d37cf7c5606d8dd/bLwpos3fzsDT1qxVibPnN18FsBc63o_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 HTCACHE storing content of url https://www.criterion.com/current/posts/2789-my-golden-voyage-with-harryhausen, 72414 bytes
I 2022/06/09 11:03:48 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 481, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:48 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/2789-my-golden-voyage-with-harryhausen, STACKING TIME = 11, PARSING TIME = 15
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7820-/YKRO4oMLJouJhzJ0UVxMAsXpd2O4sF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/images/5124-118957ad44be99c7e125ffebcbabf76a/RH_CUrrent_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7799-/BcSOGzRAmQVrNktlZ6a8juCv8YVP7g_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7821-/DIxDSXUzZ7yk0yAo3JyIs4aQdkSJqq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/tout_image/7812-/cfFE90t4MLOwAR88lqDIdl2lvUw6SX_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/428-backyard-monsters-equinox-and-the-triumph-of-love
I 2022/06/09 11:03:48 Fulltext indexing: GVPBeG_26NP5 https://www.criterion.com/current/posts/428-backyard-monsters-equinox-and-the-triumph-of-love
I 2022/06/09 11:03:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[GVPBeG_26NP5 (1735154863664267264)]} 0 8
I 2022/06/09 11:03:48 SWITCHBOARD *Indexed 1633 words in URL https://www.criterion.com/current/posts/428-backyard-monsters-equinox-and-the-triumph-of-love [GVPBeG_26NP5]
Description: Backyard Monsters: Equinox and the Triumph of Love | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 26492 bytes |
LinkStorageTime: 15 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:48 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=hondo-med
I 2022/06/09 11:03:48 Fulltext indexing: 5K4cam_26NP5 https://www.criterion.com/shop/browse?director=hondo-med
I 2022/06/09 11:03:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[5K4cam_26NP5 (1735154863752347648)]} 0 2
I 2022/06/09 11:03:48 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse?director=hondo-med [5K4cam_26NP5]
Description: Med Hondo films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12474 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:48 HTCACHE storing content of url https://www.criterion.com/current/posts/273-before-the-beginning-was-the-word-stan-brakhage-s, 97765 bytes
I 2022/06/09 11:03:48 SWITCHBOARD Excluded 26 words in URL https://www.criterion.com/current/posts/2789-my-golden-voyage-with-harryhausen
I 2022/06/09 11:03:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Gf_QoG_26NP5 (1735154863807922176)]} 0 1
I 2022/06/09 11:03:48 Fulltext indexing: Gf_QoG_26NP5 https://www.criterion.com/current/posts/2789-my-golden-voyage-with-harryhausen
I 2022/06/09 11:03:48 SWITCHBOARD *Indexed 581 words in URL https://www.criterion.com/current/posts/2789-my-golden-voyage-with-harryhausen [Gf_QoG_26NP5]
Description: My Golden Voyage with Harryhausen | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 7111 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:48 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/273-before-the-beginning-was-the-word-stan-brakhage-s, STACKING TIME = 6, PARSING TIME = 20
I 2022/06/09 11:03:48 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 470, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:48 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/273-before-the-beginning-was-the-word-stan-brakhage-s
I 2022/06/09 11:03:48 Fulltext indexing: nN1FvG_26NP5 https://www.criterion.com/current/posts/273-before-the-beginning-was-the-word-stan-brakhage-s
I 2022/06/09 11:03:48 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[nN1FvG_26NP5 (1735154863920119808)]} 0 3
I 2022/06/09 11:03:48 SWITCHBOARD *Indexed 708 words in URL https://www.criterion.com/current/posts/273-before-the-beginning-was-the-word-stan-brakhage-s [nN1FvG_26NP5]
Description: Before the Beginning Was the Word:Stan Brakhages | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 9545 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:48 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 470, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:03:48 HTCACHE storing content of url https://www.criterion.com/current/posts/422-harlan-county-usa-no-neutrals-there, 120282 bytes
I 2022/06/09 11:03:48 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/422-harlan-county-usa-no-neutrals-there, STACKING TIME = 2, PARSING TIME = 12
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:48 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/422-harlan-county-usa-no-neutrals-there
I 2022/06/09 11:03:49 Fulltext indexing: S2_PKG_26NP5 https://www.criterion.com/current/posts/422-harlan-county-usa-no-neutrals-there
I 2022/06/09 11:03:49 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[S2_PKG_26NP5 (1735154864322772992)]} 0 4
I 2022/06/09 11:03:49 SWITCHBOARD *Indexed 1018 words in URL https://www.criterion.com/current/posts/422-harlan-county-usa-no-neutrals-there [S2_PKG_26NP5]
Description: Harlan County USA: No Neutrals There | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 13246 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:49 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 466, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:49 HTCACHE storing content of url https://www.criterion.com/current/posts/367-burden-of-dreams-in-dreams-begin-responsibilities, 117695 bytes
I 2022/06/09 11:03:49 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/367-burden-of-dreams-in-dreams-begin-responsibilities, STACKING TIME = 1, PARSING TIME = 9
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/images/4265-addce7bd88a30db399119072fee4d02c/img_current_287_medium.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/367-burden-of-dreams-in-dreams-begin-responsibilities
I 2022/06/09 11:03:49 Fulltext indexing: wogbGG_26NP5 https://www.criterion.com/current/posts/367-burden-of-dreams-in-dreams-begin-responsibilities
I 2022/06/09 11:03:49 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[wogbGG_26NP5 (1735154864612179968)]} 0 4
I 2022/06/09 11:03:49 SWITCHBOARD *Indexed 999 words in URL https://www.criterion.com/current/posts/367-burden-of-dreams-in-dreams-begin-responsibilities [wogbGG_26NP5]
Description: Burden of Dreams: In Dreams Begin Responsibilities | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12602 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:49 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 463, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:03:49 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:03:49 HTCACHE storing content of url https://www.criterion.com/current/posts/4632-silent-ozu-on-the-potomac, 67906 bytes
I 2022/06/09 11:03:49 SWITCHBOARD CRAWL: ADDED 55 LINKS FROM https://www.criterion.com/current/posts/4632-silent-ozu-on-the-potomac, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6502-/VcDUKU8LPywb5MgflsEJCCJOgfrUNv_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/images/8446-e50fad3c67709884e744064453efab82/storyoffloatingweeds_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6827-/sYS7xZWV366q9OANUqtCwimvTnL90D_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/films/d94b55b796fc63cd5dd36e29dd2d52cf/cTTGr3nonhyBnN87hE1AVpIaZ0QUkT_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED http://www.asia.si.edu/events/films.asp - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6701-/ETx6s7z5azkjRZIl1n1268y6oIFngD_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6516-/1ReaXkdoavjctz1ZBIEqTXIfC0QqZK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/current/posts/4632-silent-ozu-on-the-potomac
I 2022/06/09 11:03:49 Fulltext indexing: qk1JsG_26NP5 https://www.criterion.com/current/posts/4632-silent-ozu-on-the-potomac
I 2022/06/09 11:03:49 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[qk1JsG_26NP5 (1735154864742203392)]} 0 10
I 2022/06/09 11:03:49 SWITCHBOARD *Indexed 318 words in URL https://www.criterion.com/current/posts/4632-silent-ozu-on-the-potomac [qk1JsG_26NP5]
Description: Silent Ozu on the Potomac | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3532 bytes |
LinkStorageTime: 12 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:49 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:03:49 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@9812b0e8[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vf(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772539925}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vz(7.7.3):C4:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772604529}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w0(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772604627}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w1(7.7.3):C16:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772608698}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w2(7.7.3):C25:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772613778}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w3(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772619037}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w4(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772619585}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w5(7.7.3):C1:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772619747}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w6(7.7.3):C18:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772624408}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w7(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772629483}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:03:49 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:03:49 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 457, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:49 HTCACHE storing content of url https://www.criterion.com/boxsets/618-eclipse-series-16-alexander-korda-s-private-lives, 71267 bytes
I 2022/06/09 11:03:49 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/boxsets/618-eclipse-series-16-alexander-korda-s-private-lives, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:03:49 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/ef8f0dcd517471f7b9c1cc9ab79ddb6b.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/bec6d028451ddb93fe49c8183d3c0d5a.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/632b7e77045f341b35091788e0722140.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/films/827d83f395e9871c00115b30371c59ec/SPglN2wf8mGud0NXkBwEvzltbPB0OO_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 450, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1792-9c7fc9fe72c1452bdcb47e93fc30d9fc/upVBpiNpMTno5TapW08e3GoWZshBAi_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/films/29c5b186cd957f4893c2351f1f58792e/Jaxl42Fj3X25zsfakaYr3QU5qN2wO7_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/films/56f5eded1755287bf41b02369806c72a/VJrQfxSkZ5IP9CCYnP6Iaf5gUzZ5Sj_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://s3.amazonaws.com/criterion-production/films/12b8e0ac403fbe3fab4731f009dc1047/SEjwKWvKImibQ1DMHE1IBRhnQCdJC3_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/44c2e68ab34c82e49dc5e4682cdbff52.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:49 SWITCHBOARD Excluded 22 words in URL https://www.criterion.com/boxsets/618-eclipse-series-16-alexander-korda-s-private-lives
I 2022/06/09 11:03:49 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[S_t9D3_26NP5 (1735154865216159744)]} 0 2
I 2022/06/09 11:03:49 Fulltext indexing: S_t9D3_26NP5 https://www.criterion.com/boxsets/618-eclipse-series-16-alexander-korda-s-private-lives
I 2022/06/09 11:03:49 SWITCHBOARD *Indexed 353 words in URL https://www.criterion.com/boxsets/618-eclipse-series-16-alexander-korda-s-private-lives [S_t9D3_26NP5]
Description: Eclipse Series 16: Alexander Kordas Private Lives | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6053 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:50 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=korda-alexander, 227603 bytes
I 2022/06/09 11:03:50 SWITCHBOARD CRAWL: ADDED 49 LINKS FROM https://www.criterion.com/shop/browse/list?director=korda-alexander, STACKING TIME = 1, PARSING TIME = 22
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/films/9097f2ab6c434435d6db191904457a84/5KOqSgdRhZXBRqaUfZKw83ipTh8dD9_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/films/fa83c22895c56728b4564bfc6e403a3e/NjGkZVZ4UTHTcuRFMrgbVv9jSUYcAI_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/films/0a23a551aebf07fcf997dd0fc7201868/gK5Z0zfUDusxIBnAScefOq7kY5Pw6q_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1792-9c7fc9fe72c1452bdcb47e93fc30d9fc/upVBpiNpMTno5TapW08e3GoWZshBAi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/films/efd531e4f833ecf6ed4864afa507c11e/11QuJYuU3oyeIXl1PO6quna7UKCBH2_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/films/173c1232ac5647369dc45f157375b856/kE40rcbpjCPhQuXuPPxhKg9COoT0Fc_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1766-0926bfc8985e1333759badc8421feb20/CcaaWhABn32RGpmaY0QfWfQzx5pNgV_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 458, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:03:50 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=korda-alexander
I 2022/06/09 11:03:50 Fulltext indexing: BUNiEm_26NP5 https://www.criterion.com/shop/browse/list?director=korda-alexander
I 2022/06/09 11:03:50 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[BUNiEm_26NP5 (1735154865534926848)]} 0 2
I 2022/06/09 11:03:50 SWITCHBOARD *Indexed 1222 words in URL https://www.criterion.com/shop/browse/list?director=korda-alexander [BUNiEm_26NP5]
Description: Alexander Korda films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12808 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:50 HTCACHE storing content of url https://www.criterion.com/current/posts/6392-all-the-bad-men-in-let-the-sunshine-in, 64018 bytes
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=marshall-george, 224187 bytes
I 2022/06/09 11:03:50 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/6392-all-the-bad-men-in-let-the-sunshine-in, STACKING TIME = 11, PARSING TIME = 7
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/films/81177a98fc57c3e559501ecf60cb8d40/qgGnGPdl9T0PEVviPxrZsI6A4eenNR_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://player.vimeo.com/video/335939106 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 450, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:03:50 SWITCHBOARD Excluded 20 words in URL https://www.criterion.com/current/posts/6392-all-the-bad-men-in-let-the-sunshine-in
I 2022/06/09 11:03:50 Fulltext indexing: 4RpjxG_26NP5 https://www.criterion.com/current/posts/6392-all-the-bad-men-in-let-the-sunshine-in
I 2022/06/09 11:03:50 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[4RpjxG_26NP5 (1735154865710039040)]} 0 2
I 2022/06/09 11:03:50 SWITCHBOARD *Indexed 328 words in URL https://www.criterion.com/current/posts/6392-all-the-bad-men-in-let-the-sunshine-in [4RpjxG_26NP5]
Description: All the Bad Men in Let the Sunshine In | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3708 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:50 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=marshall-george, STACKING TIME = 1, PARSING TIME = 45
I 2022/06/09 11:03:50 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/films/36fa49e72f1b22eb2da8666d3df12704/NYApshBAzWWbOpps4xtjB1L9La7xTa_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=marshall-george
I 2022/06/09 11:03:50 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[khauem_26NP5 (1735154865794973696)]} 0 2
I 2022/06/09 11:03:50 Fulltext indexing: khauem_26NP5 https://www.criterion.com/shop/browse/list?director=marshall-george
I 2022/06/09 11:03:50 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse/list?director=marshall-george [khauem_26NP5]
Description: George Marshall films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12476 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:50 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 450, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:50 HTCACHE storing content of url https://www.criterion.com/boxsets/613-pigs-pimps-prostitutes-3-films-by-shohei-imamura, 70540 bytes
I 2022/06/09 11:03:50 SWITCHBOARD CRAWL: ADDED 49 LINKS FROM https://www.criterion.com/boxsets/613-pigs-pimps-prostitutes-3-films-by-shohei-imamura, STACKING TIME = 1, PARSING TIME = 6
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/films/657702e01ca689b19768581b747e97e9/lBbqkEWi0j71pgETSpwoMhEHQH5J5l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/6dc46977af83d6498662ee41041961e0.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1855-d1af5d0a5e3d807a317f3cc4e9c52f38/vdPxhXhBYsJOygUcZ3XqUkDiS3dZOl_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/films/a0ee213bcd52a61768546b6f50b49e93/oUj4Rhj3eltZ2KJezVL8MqVJVtmf8S_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://s3.amazonaws.com/criterion-production/films/955f6f5f61b98e8a440adbaff6544904/Pg6ZGiA0S3eXXGhlqVdh9PGdQtPvUw_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/2e77b8b61dab863fffc3972aef121e72.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/1cc4a3cbedc3d14be75648a0c88c3ebe.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:50 SWITCHBOARD Excluded 20 words in URL https://www.criterion.com/boxsets/613-pigs-pimps-prostitutes-3-films-by-shohei-imamura
I 2022/06/09 11:03:50 Fulltext indexing: c95zp3_26NP5 https://www.criterion.com/boxsets/613-pigs-pimps-prostitutes-3-films-by-shohei-imamura
I 2022/06/09 11:03:50 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[c95zp3_26NP5 (1735154866012028928)]} 0 2
I 2022/06/09 11:03:50 SWITCHBOARD *Indexed 356 words in URL https://www.criterion.com/boxsets/613-pigs-pimps-prostitutes-3-films-by-shohei-imamura [c95zp3_26NP5]
Description: Pigs, Pimps & Prostitutes: 3 Films by Shohei Imamura | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6045 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:50 HostQueue forcing crawl-delay of 197 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 445, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 53)) = 197
I 2022/06/09 11:03:51 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 445, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:51 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=imamura-shohei, 227037 bytes
I 2022/06/09 11:03:51 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 452, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:03:51 SWITCHBOARD CRAWL: ADDED 48 LINKS FROM https://www.criterion.com/shop/browse/list?director=imamura-shohei, STACKING TIME = 1, PARSING TIME = 47
I 2022/06/09 11:03:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/films/a0ee213bcd52a61768546b6f50b49e93/oUj4Rhj3eltZ2KJezVL8MqVJVtmf8S_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/films/2eb415de8274162b30e0d0ef5f7731bc/VtyOttyhj2h4swVZ63WLXq9Cf3lejM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/films/5f00be15d05b84e198167a0d0c43b215/0cl4Pa663UuwdYvZLSHaH61wcgRGDN_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/films/955f6f5f61b98e8a440adbaff6544904/Pg6ZGiA0S3eXXGhlqVdh9PGdQtPvUw_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/films/657702e01ca689b19768581b747e97e9/lBbqkEWi0j71pgETSpwoMhEHQH5J5l_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1855-d1af5d0a5e3d807a317f3cc4e9c52f38/vdPxhXhBYsJOygUcZ3XqUkDiS3dZOl_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=imamura-shohei
I 2022/06/09 11:03:51 Fulltext indexing: Qd97Xm_26NP5 https://www.criterion.com/shop/browse/list?director=imamura-shohei
I 2022/06/09 11:03:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[Qd97Xm_26NP5 (1735154866828869632)]} 0 3
I 2022/06/09 11:03:51 SWITCHBOARD *Indexed 1222 words in URL https://www.criterion.com/shop/browse/list?director=imamura-shohei [Qd97Xm_26NP5]
Description: Shohei Imamura films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12756 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:51 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=cuaron-alfonso, 225292 bytes
I 2022/06/09 11:03:51 HTCACHE storing content of url https://www.criterion.com/current/posts/5279-atom-egoyan-on-split-screen, 67357 bytes
I 2022/06/09 11:03:51 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse/list?director=cuaron-alfonso, STACKING TIME = 1, PARSING TIME = 87
I 2022/06/09 11:03:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/films/bc349204816d212fabd9b71524763b75/BMnwff4YpeT4kAkBsBwD6PGbdffoL3_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/films/5f30d2a6f02704c28b2b31a9331e1f7c/9th6Iqsdh4VJpysPfdoOhLkU6YRLB9_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/films/5caddf0518bd053fc51e7160d3c1b98d/FavFi41cWhvnqrMDnCfEjD6pPntr2x_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 SWITCHBOARD CRAWL: ADDED 55 LINKS FROM https://www.criterion.com/current/posts/5279-atom-egoyan-on-split-screen, STACKING TIME = 1, PARSING TIME = 16
I 2022/06/09 11:03:51 REJECTED https://www.filmstruck.com/us/watch/franchise/1700000042 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.filmstruck.com/us/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.filmstruck.com/us/watch/bundle/1520000464 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7777-/bTILAb13gYYFwneHHCb5dHHD1VjRf7_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.nytimes.com/2018/01/11/movies/john-pierson-split-screen-on-filmstruck.html?_r=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7707-/hZxxnAQeKCi5vjiDKzvdNuftSnkdOM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://player.vimeo.com/video/251356134 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7807-/DXN9QiWNnsXTuLss0TTJ4r9JspRu5B_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7738-/e35rwdsj2UHHYOdCCz8E5Qr8oxxKXj_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 HostQueue forcing crawl-delay of 234 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 457, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 16)) = 234
I 2022/06/09 11:03:51 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=cuaron-alfonso
I 2022/06/09 11:03:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[WQ6UHm_26NP5 (1735154867053264896)]} 0 2
I 2022/06/09 11:03:51 Fulltext indexing: WQ6UHm_26NP5 https://www.criterion.com/shop/browse/list?director=cuaron-alfonso
I 2022/06/09 11:03:51 SWITCHBOARD *Indexed 1199 words in URL https://www.criterion.com/shop/browse/list?director=cuaron-alfonso [WQ6UHm_26NP5]
Description: Alfonso Cuarón films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12578 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:51 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/current/posts/5279-atom-egoyan-on-split-screen
I 2022/06/09 11:03:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[9KRGOG_26NP5 (1735154867070042112)]} 0 1
I 2022/06/09 11:03:51 Fulltext indexing: 9KRGOG_26NP5 https://www.criterion.com/current/posts/5279-atom-egoyan-on-split-screen
I 2022/06/09 11:03:51 SWITCHBOARD *Indexed 268 words in URL https://www.criterion.com/current/posts/5279-atom-egoyan-on-split-screen [9KRGOG_26NP5]
Description: Atom Egoyan on Split Screen | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3397 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:51 HTCACHE storing content of url https://www.criterion.com/current/posts/5741-manila-in-the-claws-of-light-a-proletarian-inferno, 85183 bytes
I 2022/06/09 11:03:51 SWITCHBOARD CRAWL: ADDED 60 LINKS FROM https://www.criterion.com/current/posts/5741-manila-in-the-claws-of-light-a-proletarian-inferno, STACKING TIME = 1, PARSING TIME = 8
I 2022/06/09 11:03:51 REJECTED https://criterion-production.s3.amazonaws.com/6YZ7U6Y8nyHP1qxE30j9bK8qHshMOs.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://criterion-production.s3.amazonaws.com/okNhpxQKpdPKER0kJ60xSkCl03awT9.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://criterion-production.s3.amazonaws.com/fzzdn3cIDitz1fORN2NP8BP3SGozAh.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://criterion-production.s3.amazonaws.com/VRaQ5cz4uAtLqIEijcbOcfmI2lxsRn.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://criterion-production.s3.amazonaws.com/EeDdSDd4lHNPxKgfnoqz9fRldSI4bD.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 REJECTED https://s3.amazonaws.com/criterion-production/films/194827b27de9e6f270c3f62a940755a4/mZ78UUAcVc0c4hTbCJQeFC4V1vvVd1_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:51 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 455, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:03:51 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/5741-manila-in-the-claws-of-light-a-proletarian-inferno
I 2022/06/09 11:03:51 Fulltext indexing: A-uz8G_26NP5 https://www.criterion.com/current/posts/5741-manila-in-the-claws-of-light-a-proletarian-inferno
I 2022/06/09 11:03:51 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[A-uz8G_26NP5 (1735154867280805888)]} 0 3
I 2022/06/09 11:03:51 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/current/posts/5741-manila-in-the-claws-of-light-a-proletarian-inferno [A-uz8G_26NP5]
Description: Manila in the Claws of Light: A Proletarian Inferno | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 17410 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:51 HTCACHE storing content of url https://www.criterion.com/current/posts/5748-bringing-the-grit-to-philippine-cinema, 68462 bytes
I 2022/06/09 11:03:52 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/5748-bringing-the-grit-to-philippine-cinema, STACKING TIME = 1, PARSING TIME = 56
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 HTCACHE storing content of url https://www.criterion.com/current/author/681-jos-b-capino, 49960 bytes
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://player.vimeo.com/video/271505612 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://s3.amazonaws.com/criterion-production/films/194827b27de9e6f270c3f62a940755a4/mZ78UUAcVc0c4hTbCJQeFC4V1vvVd1_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/681-jos-b-capino, STACKING TIME = 1, PARSING TIME = 10
I 2022/06/09 11:03:52 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 SWITCHBOARD Excluded 22 words in URL https://www.criterion.com/current/posts/5748-bringing-the-grit-to-philippine-cinema
I 2022/06/09 11:03:52 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[lcezCG_26NP5 (1735154867435995136)]} 0 1
I 2022/06/09 11:03:52 Fulltext indexing: lcezCG_26NP5 https://www.criterion.com/current/posts/5748-bringing-the-grit-to-philippine-cinema
I 2022/06/09 11:03:52 SWITCHBOARD *Indexed 366 words in URL https://www.criterion.com/current/posts/5748-bringing-the-grit-to-philippine-cinema [lcezCG_26NP5]
Description: Bringing the Grit to Philippine Cinema | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4259 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:52 SWITCHBOARD Excluded 12 words in URL https://www.criterion.com/current/author/681-jos-b-capino
I 2022/06/09 11:03:52 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[2MSKYG_26NP5 (1735154867443335168)]} 0 1
I 2022/06/09 11:03:52 Fulltext indexing: 2MSKYG_26NP5 https://www.criterion.com/current/author/681-jos-b-capino
I 2022/06/09 11:03:52 SWITCHBOARD *Indexed 134 words in URL https://www.criterion.com/current/author/681-jos-b-capino [2MSKYG_26NP5]
Description: José B. Capino | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1773 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:52 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 443, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:03:52 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 443, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:52 HTCACHE storing content of url https://www.criterion.com/boxsets/1136-eclipse-series-44-julien-duvivier-in-the-thirties, 72262 bytes
I 2022/06/09 11:03:52 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/boxsets/1136-eclipse-series-44-julien-duvivier-in-the-thirties, STACKING TIME = 1, PARSING TIME = 6
I 2022/06/09 11:03:52 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/357e77a9ff6fa356f0fdd1c60b2992e1.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://s3.amazonaws.com/criterion-production/films/07eaa6995f0ba6e8e19f4d4a3406a526/NbZy4fMPZHcVJU0QsvIeYmmgBvrW7q_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/d0864172609dfd3fb401f4f733834efc.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1820-cad13dad3a97c22bebc2a9ca06d4d892/6VtWN3nFdPeeB727qUGqGdgJwQ3sJo_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://s3.amazonaws.com/criterion-production/films/fc9b529165aefd3cf39bf5523c241266/uwqWmbSt9GVpJICaCTfIP9A8HlbdfK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://s3.amazonaws.com/criterion-production/films/222ba48d8fe5a5bd9d6035b15465e8d7/SWaLXWFruVp0zkKZdKDGqACFRsg8V8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/8ad7cc22163f583adbe8a61e82244ace.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/55eed17724996ab93ccb9b1891bd083e.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 REJECTED https://s3.amazonaws.com/criterion-production/films/f05128f3a7b5fb3a2c1e4954b5139f56/hbNiJQzDdewTWWcw76cHmLhXu19haU_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:52 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/boxsets/1136-eclipse-series-44-julien-duvivier-in-the-thirties
I 2022/06/09 11:03:52 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 437, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:52 Fulltext indexing: 4cufj3_26NP5 https://www.criterion.com/boxsets/1136-eclipse-series-44-julien-duvivier-in-the-thirties
I 2022/06/09 11:03:52 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[4cufj3_26NP5 (1735154868054654976)]} 0 2
I 2022/06/09 11:03:52 SWITCHBOARD *Indexed 384 words in URL https://www.criterion.com/boxsets/1136-eclipse-series-44-julien-duvivier-in-the-thirties [4cufj3_26NP5]
Description: Eclipse Series 44: Julien Duvivier in the Thirties | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6169 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:52 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=molander-gustaf, 225860 bytes
I 2022/06/09 11:03:52 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 442, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:03:53 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=molander-gustaf, STACKING TIME = 1, PARSING TIME = 31
I 2022/06/09 11:03:53 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/films/2962399ac5d7821f06fd7e6c677e6fd7/Tp4fgX25eSqejEpi5CLWglqUyAC5Tv_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1915-d024b93e4429f5b05d7f0bdc8d59c415/shXfQUMZTWnY6hrjrAHIhcBlosHu4c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/films/a7127b93634281663783609475df1184/ZEZOEVz3wHw6BdpH18x6ilo8HQXIlj_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/films/e93aac4b4c354ad7e9f9cd71c6e29dd0/J9wAJzyv00CFbPcP6jqOFzPYS46V7e_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 442, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:53 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=molander-gustaf
I 2022/06/09 11:03:53 Fulltext indexing: fNXttm_26NP5 https://www.criterion.com/shop/browse/list?director=molander-gustaf
I 2022/06/09 11:03:53 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[fNXttm_26NP5 (1735154868641857536)]} 0 2
I 2022/06/09 11:03:53 SWITCHBOARD *Indexed 1203 words in URL https://www.criterion.com/shop/browse/list?director=molander-gustaf [fNXttm_26NP5]
Description: Gustaf Molander films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12606 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:53 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=o-akad-lutfi, 224734 bytes
I 2022/06/09 11:03:53 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse?director=o-akad-lutfi, STACKING TIME = 1, PARSING TIME = 29
I 2022/06/09 11:03:53 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/films/c2105b60b5c472ef0353df86749c06d1/BHPNyPCXUmK5TeXUJI4xTFV6Qbxtrb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1767-0744687c884e8c056982d471b122dce3/cgKxO604g3phzPqpONSN5STBrfnV6y_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=duvivier-julien, 228097 bytes
I 2022/06/09 11:03:53 SWITCHBOARD CRAWL: ADDED 50 LINKS FROM https://www.criterion.com/shop/browse/list?director=duvivier-julien, STACKING TIME = 1, PARSING TIME = 39
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/films/d91270fb29e1545b74a9b171b31ed123/DICVrTQXZm6Hp1ZA5eeNVS5C2cJH9j_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/films/d3e5b0be0cbf103ddc190b26556990e7/VG0RApuhMC7e7jA4AGjDw4ZUoCP57g_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/films/f9a71f3a062f3620d6ea66dbc3cd7ba6/LXPNa7UCaNnqAZUSt8guKGaM5MwcJr_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1820-cad13dad3a97c22bebc2a9ca06d4d892/6VtWN3nFdPeeB727qUGqGdgJwQ3sJo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/films/403e847dd8731937c35889728349b623/CgM0xL1w053hSfTPUxWl8HH8UBp2jI_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/films/d025caecfc954f0e20bd89c5dbde6b72/aDBAiP1zQFiGqJzA3B6maRSwtmyobB_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/films/8075aaed4e8b71b2d75ae941500dcb28/Gegnkhyk6MqCcaBfUrKM2O1lNp4wtH_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:53 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=o-akad-lutfi
I 2022/06/09 11:03:53 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 448, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:53 Fulltext indexing: GSvm4m_26NP5 https://www.criterion.com/shop/browse?director=o-akad-lutfi
I 2022/06/09 11:03:53 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[GSvm4m_26NP5 (1735154868866252800)]} 0 56
I 2022/06/09 11:03:53 SWITCHBOARD *Indexed 1198 words in URL https://www.criterion.com/shop/browse?director=o-akad-lutfi [GSvm4m_26NP5]
Description: Lütfi Ö. Akad films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12501 bytes |
LinkStorageTime: 57 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:53 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=duvivier-julien
I 2022/06/09 11:03:53 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[BGU_Xm_26NP5 (1735154868992081920)]} 0 2
I 2022/06/09 11:03:53 Fulltext indexing: BGU_Xm_26NP5 https://www.criterion.com/shop/browse/list?director=duvivier-julien
I 2022/06/09 11:03:53 SWITCHBOARD *Indexed 1230 words in URL https://www.criterion.com/shop/browse/list?director=duvivier-julien [BGU_Xm_26NP5]
Description: Julien Duvivier films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12811 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:53 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 448, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:53 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 448, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:54 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=middleton-edwin, 224232 bytes
I 2022/06/09 11:03:54 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=middleton-edwin, STACKING TIME = 1, PARSING TIME = 30
I 2022/06/09 11:03:54 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://s3.amazonaws.com/criterion-production/films/3a4a52811b630a9836c1b10cb2c55a38/1DZVBE8PnMfkggyvh5s9f7K2TSAiF0_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=honda-ishiro, 228161 bytes
I 2022/06/09 11:03:54 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 464, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:54 SWITCHBOARD CRAWL: ADDED 50 LINKS FROM https://www.criterion.com/shop/browse/list?director=honda-ishiro, STACKING TIME = 1, PARSING TIME = 47
I 2022/06/09 11:03:54 REJECTED https://s3.amazonaws.com/criterion-production/films/931007ed2d072cc749aae6dd5b08498a/h60E9l17Gc5uJu5X7KHUOBrQFg5hoV_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://s3.amazonaws.com/criterion-production/films/14bcf36506cf57c5e3752e8005612fdd/9Q6vehzMBEqVa7Di5w0dDoISN8dFn7_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://s3.amazonaws.com/criterion-production/films/8fa6755916b32ef7098b57550a72ea08/9NH9clK8dPpEO6houf2zTXuBMCBkw1_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://s3.amazonaws.com/criterion-production/films/1dd15fdf3d6be319f4477b12f4ed5b43/Z6GsOQiR9sk5pPAhi2ykMh1ZJ75zYk_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1953-0d1595bafe2ba6d32c5f5e1bc6215555/bG4INWRMocURqMHZl7ZgyVgw1O47Tx_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://s3.amazonaws.com/criterion-production/films/f0260853d1a5a44039eabd2bbcd9af56/vKeL3Z8JQS8CcpVGf71hCklwYsRZ4G_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://s3.amazonaws.com/criterion-production/films/31f099012aaf951c70361da005ba0bc1/t7D0fBx5a0bcQoVaXwwX0kZ5FVhvlI_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://s3.amazonaws.com/criterion-production/films/f9389395b2c45e76850749c3d891ffcf/mfpB0xEqSTm7MITaxo4wOvRDhfhwp2_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=middleton-edwin
I 2022/06/09 11:03:54 Fulltext indexing: M8xGCm_26NP5 https://www.criterion.com/shop/browse/list?director=middleton-edwin
I 2022/06/09 11:03:54 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[M8xGCm_26NP5 (1735154869750202368)]} 0 4
I 2022/06/09 11:03:54 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=middleton-edwin [M8xGCm_26NP5]
Description: Edwin Middleton films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12491 bytes |
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:54 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=honda-ishiro
I 2022/06/09 11:03:54 Fulltext indexing: lCw8Wm_26NP5 https://www.criterion.com/shop/browse/list?director=honda-ishiro
I 2022/06/09 11:03:54 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[lCw8Wm_26NP5 (1735154869817311232)]} 0 2
I 2022/06/09 11:03:54 SWITCHBOARD *Indexed 1224 words in URL https://www.criterion.com/shop/browse/list?director=honda-ishiro [lCw8Wm_26NP5]
Description: Ishiro Honda films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12833 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:54 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 464, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:54 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:03:54 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:03:54 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@6b7f908c[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w8(7.7.3):C18:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772634528}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:03:54 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:03:54 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
I 2022/06/09 11:03:54 org.apache.solr.update.SolrIndexWriter Calling setCommitData with IW:org.apache.solr.update.SolrIndexWriter@fed4ccdd commitCommandVersion:0
I 2022/06/09 11:03:54 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 464, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:54 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:03:54 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=whale-james, 224151 bytes
I 2022/06/09 11:03:54 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=schorm-evald, 225264 bytes
I 2022/06/09 11:03:54 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=whale-james, STACKING TIME = 0, PARSING TIME = 77
I 2022/06/09 11:03:54 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://s3.amazonaws.com/criterion-production/films/56c15770f21d506909cd9d6b510b0a48/L8eD9ereAY71I07uEdI3FcfpnJ94d2_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1808-2d46a12c7b3bb2aca94ace66d3c9c0e9/nyO6RFFEuME4UWTQK8qfR39YzD6NJK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 SWITCHBOARD CRAWL: ADDED 49 LINKS FROM https://www.criterion.com/shop/browse?director=schorm-evald, STACKING TIME = 1, PARSING TIME = 64
I 2022/06/09 11:03:54 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://s3.amazonaws.com/criterion-production/films/2c244881659f91c4d02f4aac96e3a255/dZJriyxOrnEIuAJkUrSaL6xMyN6xgc_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:03:54 REJECTED https://s3.amazonaws.com/criterion-production/films/d407b2a9114561fd548088c18652efce/0ks95xPCZSXsSm0dAivUUE6gjtn3qJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:54 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=whale-james
I 2022/06/09 11:03:54 Fulltext indexing: f1HM-m_26NP5 https://www.criterion.com/shop/browse/list?director=whale-james
I 2022/06/09 11:03:54 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[f1HM-m_26NP5 (1735154870480011264)]} 0 8
I 2022/06/09 11:03:54 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=whale-james [f1HM-m_26NP5]
Description: James Whale films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12459 bytes |
LinkStorageTime: 14 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:55 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=schorm-evald
I 2022/06/09 11:03:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[_HnYEm_26NP5 (1735154870547120128)]} 0 2
I 2022/06/09 11:03:55 Fulltext indexing: _HnYEm_26NP5 https://www.criterion.com/shop/browse?director=schorm-evald
I 2022/06/09 11:03:55 SWITCHBOARD *Indexed 1206 words in URL https://www.criterion.com/shop/browse?director=schorm-evald [_HnYEm_26NP5]
Description: Evald Schorm films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12586 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:55 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 487, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:03:55 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=demille-cecil-b, 224208 bytes
I 2022/06/09 11:03:55 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=lanzmann-claude, 224150 bytes
I 2022/06/09 11:03:55 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=demille-cecil-b, STACKING TIME = 1, PARSING TIME = 34
I 2022/06/09 11:03:55 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://s3.amazonaws.com/criterion-production/films/8f3ff64731a9d9200fbe1129a8982550/k4tHWmKa5KQ68TCAqDJ5TuQU2sg67g_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=lanzmann-claude, STACKING TIME = 0, PARSING TIME = 47
I 2022/06/09 11:03:55 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://s3.amazonaws.com/criterion-production/films/f7403dbc3446dcc91efc1265b99a3ace/7D5cUVg7BDFSF9gONXPb6PxOwkBRcm_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 510, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 12)) = 243
I 2022/06/09 11:03:55 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=demille-cecil-b
I 2022/06/09 11:03:55 Fulltext indexing: eOKAJm_26NP5 https://www.criterion.com/shop/browse/list?director=demille-cecil-b
I 2022/06/09 11:03:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[eOKAJm_26NP5 (1735154871079796736)]} 0 2
I 2022/06/09 11:03:55 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse/list?director=demille-cecil-b [eOKAJm_26NP5]
Description: Cecil B. DeMille films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12493 bytes |
LinkStorageTime: 12 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:55 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=lanzmann-claude
I 2022/06/09 11:03:55 Fulltext indexing: 5bgWsm_26NP5 https://www.criterion.com/shop/browse/list?director=lanzmann-claude
I 2022/06/09 11:03:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[5bgWsm_26NP5 (1735154871141662720)]} 0 2
I 2022/06/09 11:03:55 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=lanzmann-claude [5bgWsm_26NP5]
Description: Claude Lanzmann films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12455 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:55 HostQueue forcing crawl-delay of 245 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 510, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 11)) = 244
I 2022/06/09 11:03:55 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=wexler-haskell, 224168 bytes
I 2022/06/09 11:03:55 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=wexler-haskell, STACKING TIME = 1, PARSING TIME = 32
I 2022/06/09 11:03:55 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 REJECTED https://s3.amazonaws.com/criterion-production/films/b8d150ed20e90cb009b9cd92f6b838a2/OkOUDcVNqdlmsGnZOh3S4rYkZfKhfF_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:55 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=wexler-haskell
I 2022/06/09 11:03:55 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[5NF_Fm_26NP5 (1735154871420583936)]} 0 2
I 2022/06/09 11:03:55 Fulltext indexing: 5NF_Fm_26NP5 https://www.criterion.com/shop/browse/list?director=wexler-haskell
I 2022/06/09 11:03:55 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse/list?director=wexler-haskell [5NF_Fm_26NP5]
Description: Haskell Wexler films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12467 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:55 HostQueue forcing crawl-delay of 248 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 517, robots.delay = 0, ((waitig = 258) - (timeSinceLastAccess = 10)) = 248
I 2022/06/09 11:03:55 HTCACHE storing content of url https://www.criterion.com/boxsets/907-eclipse-series-35-i-maidstone-i-and-other-films-by-norman-mailer, 70163 bytes
I 2022/06/09 11:03:55 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=hegedus-chris, 225970 bytes
I 2022/06/09 11:03:56 SWITCHBOARD CRAWL: ADDED 49 LINKS FROM https://www.criterion.com/boxsets/907-eclipse-series-35-i-maidstone-i-and-other-films-by-norman-mailer, STACKING TIME = 1, PARSING TIME = 17
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/films/4b9b9dced5f7cf1ab40d943e30183e6a/YyU3WQoVkp58ve8vCe5sKicekfd4yc_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/films/a265c6e0eff149acc92b4e6e6a0406ee/zyfZKZJqi1dh2ffO3FYd5B0Tu5ZFq4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/films/b4ca30ca821019ea383407cef347690a/5V1iR0Wp13z3cDdseceVsPdEHK0UCP_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/c96c2c8e1749c7e8daabe54a2387eb85.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/842b8aa4e519574ecf12551580ccc0c1.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/afd8b8070b406fcf46f0fdbb3ca9dc6a.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1811-266c43d8e78bfa5f6efd201d3afa048d/V4k8sacHUW9LbGxtOeFlX5FTAbo2xm_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=hegedus-chris, STACKING TIME = 1, PARSING TIME = 124
I 2022/06/09 11:03:56 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/films/7d49fee67a64a7b818b28d452c0b6600/Yn32RYtD6y6sFn34lfmbjoAlrVxr1C_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/films/99a4844716cb74f633b512d438c35e84/AzMPeGMS75YpzGDhcQzt0OMv2djJOY_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1845-db3c5bcb75d32e794daaab2880a9cefc/oRg21cXVbdpJhR5WR3oZFrGDkJXY37_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/films/7d90215f8b43ae9d7f6c256371cb6561/zXxL5nVVxs6mFjV7YGhNhNqiQh2oP0_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 SWITCHBOARD Excluded 22 words in URL https://www.criterion.com/boxsets/907-eclipse-series-35-i-maidstone-i-and-other-films-by-norman-mailer
I 2022/06/09 11:03:56 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[BVVG63_26NP5 (1735154871721525248)]} 0 1
I 2022/06/09 11:03:56 Fulltext indexing: BVVG63_26NP5 https://www.criterion.com/boxsets/907-eclipse-series-35-i-maidstone-i-and-other-films-by-norman-mailer
I 2022/06/09 11:03:56 SWITCHBOARD *Indexed 325 words in URL https://www.criterion.com/boxsets/907-eclipse-series-35-i-maidstone-i-and-other-films-by-norman-mailer [BVVG63_26NP5]
Description: Eclipse Series 35: Maidstone and Other Films by Norman Mailer | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5282 bytes |
LinkStorageTime: 9 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:56 HostQueue forcing crawl-delay of 259 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 538, robots.delay = 0, ((waitig = 269) - (timeSinceLastAccess = 10)) = 259
I 2022/06/09 11:03:56 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=hegedus-chris
I 2022/06/09 11:03:56 Fulltext indexing: KMFYSm_26NP5 https://www.criterion.com/shop/browse/list?director=hegedus-chris
I 2022/06/09 11:03:56 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[KMFYSm_26NP5 (1735154871792828416)]} 0 2
I 2022/06/09 11:03:56 SWITCHBOARD *Indexed 1215 words in URL https://www.criterion.com/shop/browse/list?director=hegedus-chris [KMFYSm_26NP5]
Description: Chris Hegedus films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12684 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:56 HostQueue forcing crawl-delay of 259 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 538, robots.delay = 0, ((waitig = 269) - (timeSinceLastAccess = 11)) = 258
I 2022/06/09 11:03:56 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=mailer-norman, 225867 bytes
I 2022/06/09 11:03:56 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=brakhage-stan, 225456 bytes
I 2022/06/09 11:03:56 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=mailer-norman, STACKING TIME = 0, PARSING TIME = 54
I 2022/06/09 11:03:56 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/films/c69e0a3d8f080846aecb3fab94af1760/87KoyhtPxZFa2tAUsdjGYWdGFkV0Uq_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/films/b069abe928385f97336491824e25923b/MjNYU96ZyBqDUuGvnpoqXLFz6MVKPS_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/films/f01fd5cd8ceeda0a6b33f78e25f81d98/dYeHQUY0SAGvBWP5y5tDXGWAYDhgrh_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1811-266c43d8e78bfa5f6efd201d3afa048d/V4k8sacHUW9LbGxtOeFlX5FTAbo2xm_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 HostQueue forcing crawl-delay of 268 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 556, robots.delay = 0, ((waitig = 278) - (timeSinceLastAccess = 10)) = 268
I 2022/06/09 11:03:56 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse/list?director=brakhage-stan, STACKING TIME = 3, PARSING TIME = 125
I 2022/06/09 11:03:56 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/films/e4edb0b54b1d565c27870d2a3826d6b3/fMqyWu4ooJl1NUtQKC1qopRWtM0YQk_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/films/3eb6763798a2beb8c2e66f2cba9bf438/oOjVjvYPIHS1KXuKIHOna0hvBJrCmM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1980-d6f55873e3a1b4901255990fca628280/58sDczfTOsoQ3ZJKh8W7xzcwE95s1f_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=mailer-norman
I 2022/06/09 11:03:56 Fulltext indexing: JQxTom_26NP5 https://www.criterion.com/shop/browse/list?director=mailer-norman
I 2022/06/09 11:03:56 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[JQxTom_26NP5 (1735154872426168320)]} 0 2
I 2022/06/09 11:03:56 SWITCHBOARD *Indexed 1207 words in URL https://www.criterion.com/shop/browse/list?director=mailer-norman [JQxTom_26NP5]
Description: Norman Mailer films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12622 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:56 SWITCHBOARD Excluded 11 words in URL https://www.criterion.com/shop/browse/list?director=brakhage-stan
I 2022/06/09 11:03:56 HTCACHE storing content of url https://www.criterion.com/current/posts/5666-hirokazu-kore-eda-s-shoplifters, 72234 bytes
I 2022/06/09 11:03:56 Fulltext indexing: _8CRzm_26NP5 https://www.criterion.com/shop/browse/list?director=brakhage-stan
I 2022/06/09 11:03:56 SWITCHBOARD CRAWL: ADDED 74 LINKS FROM https://www.criterion.com/current/posts/5666-hirokazu-kore-eda-s-shoplifters, STACKING TIME = 10, PARSING TIME = 10
I 2022/06/09 11:03:56 REJECTED https://www.theguardian.com/film/2015/may/21/hirokazu-kore-director-our-little-sister-interview - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED http://variety.com/2018/film/reviews/shoplifters-review-manbiki-kazoku-1202809298/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.ioncinema.com/reviews/hirokazu-kore-eda-shoplifters-review - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.telegraph.co.uk/films/0/shoplifters-reviewa-thrilling-beautiful-tale-toykos-down-and/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[_8CRzm_26NP5 (1735154872514248704)]} 0 12
I 2022/06/09 11:03:56 SWITCHBOARD *Indexed 1203 words in URL https://www.criterion.com/shop/browse/list?director=brakhage-stan [_8CRzm_26NP5]
Description: Stan Brakhage films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12618 bytes |
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:56 REJECTED http://lwlies.com/festivals/shoplifters-cannes-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.youtube.com/embed/3zJ3_JZnH_Q?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://cine-vue.com/2018/05/cannes-2018-shoplifters-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://criterion-production.s3.amazonaws.com/fXh0cxamWbK3FhsKLSPKHIeLqBW4h3.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.rogerebert.com/cannes/cannes-2018-black-kkklansman-shoplifters - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.theguardian.com/film/2018/may/14/shoplifters-review-family-of-thieves-steals-moral-high-ground-and-hearts - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://thefilmstage.com/reviews/cannes-review-hirokazu-kore-edas-shoplifters-finds-empathy-and-grace-in-the-lives-of-petty-criminals/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://filmmakermagazine.com/105338-cannes-2018-dispatch-4-shoplifters-girl-happy-as-lazzaro/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED http://www.indiewire.com/2018/05/shoplifters-review-kore-eda-hirokazu-cannes-2018-1201964195/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.festival-cannes.com/en/festival/films/manbiki-kazoku - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://theplaylist.net/shoplifters-hirokazu-kore-eda-review-20180520/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-9-a-footie-god-transitioning-teen-and-family-of-criminals - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED http://www.vulture.com/2018/05/palme-dor-winner-shoplifters-a-family-on-the-margins.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED http://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/shoplifters-hirokazu-koreeda-wonky-family-lament - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://icsfilm.org/reviews/cannes-2018-review-shoplifters-hirokazu-koreeda/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.screendaily.com/reviews/shoplifters-cannes-review/5129318.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 REJECTED https://www.hollywoodreporter.com/review/shoplifters-review-1111546 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:56 SWITCHBOARD Excluded 26 words in URL https://www.criterion.com/current/posts/5666-hirokazu-kore-eda-s-shoplifters
I 2022/06/09 11:03:56 Fulltext indexing: _eUJaG_26NP5 https://www.criterion.com/current/posts/5666-hirokazu-kore-eda-s-shoplifters
I 2022/06/09 11:03:56 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[_eUJaG_26NP5 (1735154872566677504)]} 0 2
I 2022/06/09 11:03:56 SWITCHBOARD *Indexed 419 words in URL https://www.criterion.com/current/posts/5666-hirokazu-kore-eda-s-shoplifters [_eUJaG_26NP5]
Description: Hirokazu Kore-edas Shoplifters | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4820 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:56 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=murakami-takashi, 224179 bytes
I 2022/06/09 11:03:57 HostQueue forcing crawl-delay of 273 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 567, robots.delay = 0, ((waitig = 283) - (timeSinceLastAccess = 10)) = 273
I 2022/06/09 11:03:57 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=murakami-takashi, STACKING TIME = 1, PARSING TIME = 58
I 2022/06/09 11:03:57 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/films/6ce7dad93ae964557df431ad8ab68b39/pbf52N9O0DaEZ49NjRVTnggVgWc7H5_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=murakami-takashi
I 2022/06/09 11:03:57 Fulltext indexing: _gPbhm_26NP5 https://www.criterion.com/shop/browse/list?director=murakami-takashi
I 2022/06/09 11:03:57 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[_gPbhm_26NP5 (1735154872783732736)]} 0 3
I 2022/06/09 11:03:57 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=murakami-takashi [_gPbhm_26NP5]
Description: Takashi Murakami films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12465 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:57 HTCACHE storing content of url https://www.criterion.com/current/posts/6377-diao-yinan-s-the-wild-goose-lake, 71333 bytes
I 2022/06/09 11:03:57 REJECTED https://lwlies.com/festivals/the-wild-goose-lake-cannes-film-festival-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.youtube.com/embed/gmv7olv2UlI?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 SWITCHBOARD CRAWL: ADDED 61 LINKS FROM https://www.criterion.com/current/posts/6377-diao-yinan-s-the-wild-goose-lake, STACKING TIME = 3, PARSING TIME = 11
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.hollywoodreporter.com/review/wild-goose-lake-nan-fang-che-zhan-de-ju-hui-review-1211950 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://film.avclub.com/is-it-finally-pedro-almodovar-s-year-at-cannes-1834875629 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.timeout.com/london/film/the-wild-goose-lake - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://criterion-production.s3.amazonaws.com/P2I2n0UOMwsQhVMveGjsi2nmtYVczy.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://variety.com/2019/film/reviews/the-wild-goose-lake-review-1203219296/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.screendaily.com/reviews/the-wild-goose-lake-cannes-review/5139610.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.festival-cannes.com/en/festival/films/nan-fang-che-zhan-de-ju-hui - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/6377-diao-yinan-s-the-wild-goose-lake
I 2022/06/09 11:03:57 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[_ZI14G_26NP5 (1735154872867618816)]} 0 2
I 2022/06/09 11:03:57 Fulltext indexing: _ZI14G_26NP5 https://www.criterion.com/current/posts/6377-diao-yinan-s-the-wild-goose-lake
I 2022/06/09 11:03:57 SWITCHBOARD *Indexed 503 words in URL https://www.criterion.com/current/posts/6377-diao-yinan-s-the-wild-goose-lake [_ZI14G_26NP5]
Description: Diao Yinans The Wild Goose Lake | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5916 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:57 HostQueue forcing crawl-delay of 275 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 570, robots.delay = 0, ((waitig = 285) - (timeSinceLastAccess = 11)) = 274
I 2022/06/09 11:03:57 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=solas-humberto, 224732 bytes
I 2022/06/09 11:03:57 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=solas-humberto, STACKING TIME = 4, PARSING TIME = 68
I 2022/06/09 11:03:57 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/films/d2c835969098c7bd46ae2cd81602f484/BHh2KOSkBsYNtCoEP20YT1DkdPJM14_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1961-3b7b159a9ae3d500459e38e69c96a917/9P9MeXzolFQyY5OhK2XcwZ02e0Y0ZC_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 HostQueue forcing crawl-delay of 276 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 573, robots.delay = 0, ((waitig = 286) - (timeSinceLastAccess = 10)) = 276
I 2022/06/09 11:03:57 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=wicki-bernhard, 224126 bytes
I 2022/06/09 11:03:57 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=solas-humberto
I 2022/06/09 11:03:57 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=wicki-bernhard, STACKING TIME = 0, PARSING TIME = 53
I 2022/06/09 11:03:57 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/films/dc0c6d72367b18727c846047f2b39cb6/JOpfLu2e0pJYqYoJ67UvpPXUgKcIP1_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[-hZIUm_26NP5 (1735154873337380864)]} 0 2
I 2022/06/09 11:03:57 Fulltext indexing: -hZIUm_26NP5 https://www.criterion.com/shop/browse/list?director=solas-humberto
I 2022/06/09 11:03:57 SWITCHBOARD *Indexed 1199 words in URL https://www.criterion.com/shop/browse/list?director=solas-humberto [-hZIUm_26NP5]
Description: Humberto Solás films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12503 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:57 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=wicki-bernhard
I 2022/06/09 11:03:57 Fulltext indexing: -Vob9m_26NP5 https://www.criterion.com/shop/browse?director=wicki-bernhard
I 2022/06/09 11:03:57 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[-Vob9m_26NP5 (1735154873418121216)]} 0 2
I 2022/06/09 11:03:57 SWITCHBOARD *Indexed 1187 words in URL https://www.criterion.com/shop/browse?director=wicki-bernhard [-Vob9m_26NP5]
Description: Bernhard Wicki films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12402 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:57 HTCACHE storing content of url https://www.criterion.com/current/posts/7533-throw-down-down-but-not-out, 85720 bytes
I 2022/06/09 11:03:57 SWITCHBOARD CRAWL: ADDED 62 LINKS FROM https://www.criterion.com/current/posts/7533-throw-down-down-but-not-out, STACKING TIME = 2, PARSING TIME = 8
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://criterion-production.s3.amazonaws.com/a1V4dAcO07zWYpvP3Wk3wz0pqau1do.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://criterion-production.s3.amazonaws.com/4CeRWq3CgR9rGMJbQRgC94yS3cgBji.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/films/78f00702358370de10b7256ded97d10b/qh2QGOHZiI77jVyFWnv9ex9XhAUTy0_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7533-/8sNgXkWlm7uiw4tAlzRu0WXvBYNj7g_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6325-/gglgHtqmEuUWogYYau0OmPNtrmIlI5_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://criterion-production.s3.amazonaws.com/8pEsEOTYBYeM0w80z5HxZ5gWgUetGg.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 HTCACHE storing content of url https://www.criterion.com/current/posts/124-salesman, 86250 bytes
I 2022/06/09 11:03:57 HostQueue forcing crawl-delay of 274 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 567, robots.delay = 0, ((waitig = 283) - (timeSinceLastAccess = 9)) = 274
I 2022/06/09 11:03:57 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/124-salesman, STACKING TIME = 9, PARSING TIME = 18
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:57 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/7533-throw-down-down-but-not-out
I 2022/06/09 11:03:57 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[9v0OqG_26NP5 (1735154873629933568)]} 0 3
I 2022/06/09 11:03:57 Fulltext indexing: 9v0OqG_26NP5 https://www.criterion.com/current/posts/7533-throw-down-down-but-not-out
I 2022/06/09 11:03:57 SWITCHBOARD *Indexed 1121 words in URL https://www.criterion.com/current/posts/7533-throw-down-down-but-not-out [9v0OqG_26NP5]
Description: Throw Down: Down but Not Out | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 15571 bytes |
LinkStorageTime: 14 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:58 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/124-salesman
I 2022/06/09 11:03:58 Fulltext indexing: 9R1GjG_26NP5 https://www.criterion.com/current/posts/124-salesman
I 2022/06/09 11:03:58 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[9R1GjG_26NP5 (1735154873671876608)]} 0 2
I 2022/06/09 11:03:58 SWITCHBOARD *Indexed 513 words in URL https://www.criterion.com/current/posts/124-salesman [9R1GjG_26NP5]
Description: Salesman | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6303 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:58 HostQueue forcing crawl-delay of 273 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 567, robots.delay = 0, ((waitig = 283) - (timeSinceLastAccess = 10)) = 273
I 2022/06/09 11:03:58 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bellocchio-marco, 224792 bytes
I 2022/06/09 11:03:58 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=bellocchio-marco, STACKING TIME = 1, PARSING TIME = 19
I 2022/06/09 11:03:58 REJECTED https://s3.amazonaws.com/criterion-production/films/bfe837a00f091aaa9e9223903ad717e2/PtBArZqMvX55aE1Py7CcJJRqtMFXy7_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 HTCACHE storing content of url https://www.criterion.com/current/posts/540-eclipse-series-1-early-bergman, 90071 bytes
I 2022/06/09 11:03:58 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/540-eclipse-series-1-early-bergman, STACKING TIME = 5, PARSING TIME = 11
I 2022/06/09 11:03:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=bellocchio-marco
I 2022/06/09 11:03:58 HostQueue forcing crawl-delay of 228 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 560, robots.delay = 0, ((waitig = 280) - (timeSinceLastAccess = 52)) = 228
I 2022/06/09 11:03:58 Fulltext indexing: 9CrbLm_26NP5 https://www.criterion.com/shop/browse/list?director=bellocchio-marco
I 2022/06/09 11:03:58 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[9CrbLm_26NP5 (1735154874187776000)]} 0 2
I 2022/06/09 11:03:58 SWITCHBOARD *Indexed 1201 words in URL https://www.criterion.com/shop/browse/list?director=bellocchio-marco [9CrbLm_26NP5]
Description: Marco Bellocchio films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12525 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:58 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/540-eclipse-series-1-early-bergman
I 2022/06/09 11:03:58 Fulltext indexing: 871xIG_26NP5 https://www.criterion.com/current/posts/540-eclipse-series-1-early-bergman
I 2022/06/09 11:03:58 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[871xIG_26NP5 (1735154874289487872)]} 0 4
I 2022/06/09 11:03:58 SWITCHBOARD *Indexed 1183 words in URL https://www.criterion.com/current/posts/540-eclipse-series-1-early-bergman [871xIG_26NP5]
Description: Eclipse Series 1: Early Bergman | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 17956 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:58 HTCACHE storing content of url https://www.criterion.com/current/posts/4792-screwball-almod-var-in-denver, 68176 bytes
I 2022/06/09 11:03:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6502-/VcDUKU8LPywb5MgflsEJCCJOgfrUNv_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6827-/sYS7xZWV366q9OANUqtCwimvTnL90D_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 SWITCHBOARD CRAWL: ADDED 55 LINKS FROM https://www.criterion.com/current/posts/4792-screwball-almod-var-in-denver, STACKING TIME = 5, PARSING TIME = 5
I 2022/06/09 11:03:58 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED http://secure.denverfilm.org/tickets/film.aspx?id=29482 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://s3.amazonaws.com/criterion-production/films/b93ed3824aabf3784387991675dde82c/J2DKHtUkEGO5iYbKWnV7TIatSWH6x4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://s3.amazonaws.com/criterion-production/images/8690-3252a955ea81b65518b6027375bea8d6/womenontheverge_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 HostQueue forcing crawl-delay of 267 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 555, robots.delay = 0, ((waitig = 277) - (timeSinceLastAccess = 10)) = 267
I 2022/06/09 11:03:58 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6701-/ETx6s7z5azkjRZIl1n1268y6oIFngD_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6516-/1ReaXkdoavjctz1ZBIEqTXIfC0QqZK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:58 SWITCHBOARD Excluded 22 words in URL https://www.criterion.com/current/posts/4792-screwball-almod-var-in-denver
I 2022/06/09 11:03:58 Fulltext indexing: 8kpp7G_26NP5 https://www.criterion.com/current/posts/4792-screwball-almod-var-in-denver
I 2022/06/09 11:03:58 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[8kpp7G_26NP5 (1735154874453065728)]} 0 1
I 2022/06/09 11:03:58 SWITCHBOARD *Indexed 335 words in URL https://www.criterion.com/current/posts/4792-screwball-almod-var-in-denver [8kpp7G_26NP5]
Description: Screwball Almodóvar in Denver | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3808 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:58 HostQueue forcing crawl-delay of 267 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 555, robots.delay = 0, ((waitig = 277) - (timeSinceLastAccess = 10)) = 267
I 2022/06/09 11:03:59 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=spielmann-goetz, 224133 bytes
I 2022/06/09 11:03:59 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=spielmann-goetz, STACKING TIME = 4, PARSING TIME = 19
I 2022/06/09 11:03:59 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://s3.amazonaws.com/criterion-production/films/8bd7bc2e13975cd8733f51d85346f8c0/jv8mOJ3S7gzDFNooyV0zXbNrHG8exO_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 HostQueue forcing crawl-delay of 265 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 555, robots.delay = 0, ((waitig = 277) - (timeSinceLastAccess = 12)) = 265
I 2022/06/09 11:03:59 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=spielmann-goetz
I 2022/06/09 11:03:59 Fulltext indexing: 795Kcm_26NP5 https://www.criterion.com/shop/browse/list?director=spielmann-goetz
I 2022/06/09 11:03:59 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[795Kcm_26NP5 (1735154875084308480)]} 0 2
I 2022/06/09 11:03:59 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=spielmann-goetz [795Kcm_26NP5]
Description: Götz Spielmann films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12452 bytes |
LinkStorageTime: 46 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:59 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=sluizer-george, 224237 bytes
I 2022/06/09 11:03:59 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=sluizer-george, STACKING TIME = 2, PARSING TIME = 20
I 2022/06/09 11:03:59 REJECTED https://s3.amazonaws.com/criterion-production/films/0ff25308283a42cb89949523be052d2a/QpDrkKk7cg9UjwlabbA5jjRjO3zTKh_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 HTCACHE storing content of url https://www.criterion.com/current/posts/6367-annie-silverstein-s-bull, 70184 bytes
I 2022/06/09 11:03:59 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=sluizer-george
I 2022/06/09 11:03:59 SWITCHBOARD CRAWL: ADDED 60 LINKS FROM https://www.criterion.com/current/posts/6367-annie-silverstein-s-bull, STACKING TIME = 2, PARSING TIME = 10
I 2022/06/09 11:03:59 REJECTED https://criterion-production.s3.amazonaws.com/vGXsGevwviBHILC6OE3Xq3bFbSihtr.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.screendaily.com/reviews/bull-cannes-review/5139275.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://variety.com/2019/film/reviews/bull-review-1203215246/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.festival-cannes.com/en/films/bull - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.nytimes.com/2019/05/16/movies/cannes-film-festival-jim-jarmusch.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.hollywoodreporter.com/review/bull-review-1210001 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.rogerebert.com/cannes/cannes-2019-les-miserables-bull - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 Fulltext indexing: 73RYIm_26NP5 https://www.criterion.com/shop/browse/list?director=sluizer-george
I 2022/06/09 11:03:59 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.youtube.com/embed/j7lHXtbQbvo?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[73RYIm_26NP5 (1735154875264663552)]} 0 7
I 2022/06/09 11:03:59 SWITCHBOARD *Indexed 1195 words in URL https://www.criterion.com/shop/browse/list?director=sluizer-george [73RYIm_26NP5]
Description: George Sluizer films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12512 bytes |
LinkStorageTime: 9 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:59 HostQueue forcing crawl-delay of 262 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 548, robots.delay = 0, ((waitig = 274) - (timeSinceLastAccess = 12)) = 262
I 2022/06/09 11:03:59 SWITCHBOARD Excluded 26 words in URL https://www.criterion.com/current/posts/6367-annie-silverstein-s-bull
I 2022/06/09 11:03:59 Fulltext indexing: 7sBS-G_26NP5 https://www.criterion.com/current/posts/6367-annie-silverstein-s-bull
I 2022/06/09 11:03:59 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[7sBS-G_26NP5 (1735154875313946624)]} 0 3
I 2022/06/09 11:03:59 SWITCHBOARD *Indexed 439 words in URL https://www.criterion.com/current/posts/6367-annie-silverstein-s-bull [7sBS-G_26NP5]
Description: Annie Silversteins Bull | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4941 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:59 HTCACHE storing content of url https://www.criterion.com/current/posts/947-burn, 67143 bytes
I 2022/06/09 11:03:59 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/947-burn, STACKING TIME = 3, PARSING TIME = 8
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:03:59 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/947-burn
I 2022/06/09 11:03:59 Fulltext indexing: 7qda2G_26NP5 https://www.criterion.com/current/posts/947-burn
I 2022/06/09 11:03:59 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[7qda2G_26NP5 (1735154875559313408)]} 0 2
I 2022/06/09 11:03:59 SWITCHBOARD *Indexed 453 words in URL https://www.criterion.com/current/posts/947-burn [7qda2G_26NP5]
Description: Burn! | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5678 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:03:59 HostQueue forcing crawl-delay of 261 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 543, robots.delay = 0, ((waitig = 271) - (timeSinceLastAccess = 10)) = 261
I 2022/06/09 11:03:59 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:04:00 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:00 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@688a84c1[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w8(7.7.3):C18:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772634528}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wa(7.7.3):C23:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772640046}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:04:00 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:04:00 HostQueue forcing crawl-delay of 261 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 543, robots.delay = 0, ((waitig = 271) - (timeSinceLastAccess = 10)) = 261
I 2022/06/09 11:04:00 HostQueue forcing crawl-delay of 258 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 543, robots.delay = 0, ((waitig = 271) - (timeSinceLastAccess = 13)) = 258
I 2022/06/09 11:04:00 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=forman-milos, 226006 bytes
I 2022/06/09 11:04:00 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=forman-milos, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:04:00 REJECTED https://s3.amazonaws.com/criterion-production/films/0d9221dc6c6888f70ff2138706dc7bc2/DgNDc3SgTe80p7vTi0Huc6tr7BPNWp_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://s3.amazonaws.com/criterion-production/films/89fd00884467657bae436047de8cd9b2/7RuTtvX525S0ENzFaMXEcuoqKM280Y_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://s3.amazonaws.com/criterion-production/films/4c558aa85967869d1140e852cb1f0368/oCHrw4PTooq8CIRtFGiXs2NSbWaWu2_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=crabtree-arthur, 225439 bytes
I 2022/06/09 11:04:00 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=forman-milos
I 2022/06/09 11:04:00 Fulltext indexing: 7SQ8Om_26NP5 https://www.criterion.com/shop/browse/list?director=forman-milos
I 2022/06/09 11:04:00 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[7SQ8Om_26NP5 (1735154876419145728)]} 0 13
I 2022/06/09 11:04:00 SWITCHBOARD *Indexed 1212 words in URL https://www.criterion.com/shop/browse/list?director=forman-milos [7SQ8Om_26NP5]
Description: Miloš Forman films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12753 bytes |
LinkStorageTime: 15 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:00 HostQueue forcing crawl-delay of 261 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 543, robots.delay = 0, ((waitig = 271) - (timeSinceLastAccess = 10)) = 261
I 2022/06/09 11:04:00 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse/list?director=crabtree-arthur, STACKING TIME = 1, PARSING TIME = 30
I 2022/06/09 11:04:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1812-2bdeb818c107e95738f59894990c22b2/oTM1jWGYaWx6KHPFFGsXiyVmbdpCPi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://s3.amazonaws.com/criterion-production/films/396742df082949cbe7cd842f4d5fe77e/B2N8OyGqk863KMbcNBoYzgRiBHkrVc_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://s3.amazonaws.com/criterion-production/films/bd2452146ea73af63f76a550204841f9/4yJd8DqIYDNQ5jaoJpFxnPPkbFjdAG_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 HTCACHE storing content of url https://www.criterion.com/current/posts/3892-liv-ullmann-reflects-on-working-with-jan-troell, 70977 bytes
I 2022/06/09 11:04:00 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=crabtree-arthur
I 2022/06/09 11:04:00 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/3892-liv-ullmann-reflects-on-working-with-jan-troell, STACKING TIME = 2, PARSING TIME = 11
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.youtube.com/embed/AQ4UvOczjnw - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://s3.amazonaws.com/criterion-production/films/aaceb4cad8621ca9617f4c00b8ad4748/5xi0GwA3BbtdOq3TOnIIC17roOR2Pu_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 Fulltext indexing: 7Mn5mm_26NP5 https://www.criterion.com/shop/browse/list?director=crabtree-arthur
I 2022/06/09 11:04:00 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://s3.amazonaws.com/criterion-production/films/af3a424e036ce6064ba4c3b884c82128/cwe2k8wIo3C0zHyHipEgHMC2IsVcXq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[7Mn5mm_26NP5 (1735154876555460608)]} 0 6
I 2022/06/09 11:04:00 SWITCHBOARD *Indexed 1207 words in URL https://www.criterion.com/shop/browse/list?director=crabtree-arthur [7Mn5mm_26NP5]
Description: Arthur Crabtree films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12613 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:00 HTCACHE storing content of url https://www.criterion.com/current/author/400-jean-luc-godard, 48500 bytes
I 2022/06/09 11:04:00 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/400-jean-luc-godard, STACKING TIME = 5, PARSING TIME = 11
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:00 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/current/posts/3892-liv-ullmann-reflects-on-working-with-jan-troell
I 2022/06/09 11:04:00 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[7KmHvG_26NP5 (1735154876602646528)]} 0 1
I 2022/06/09 11:04:00 Fulltext indexing: 7KmHvG_26NP5 https://www.criterion.com/current/posts/3892-liv-ullmann-reflects-on-working-with-jan-troell
I 2022/06/09 11:04:00 SWITCHBOARD *Indexed 291 words in URL https://www.criterion.com/current/posts/3892-liv-ullmann-reflects-on-working-with-jan-troell [7KmHvG_26NP5]
Description: Liv Ullmann Reflects on Working with Jan Troell | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3518 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:00 SWITCHBOARD Excluded 12 words in URL https://www.criterion.com/current/author/400-jean-luc-godard
I 2022/06/09 11:04:00 Fulltext indexing: 7E0cbG_26NP5 https://www.criterion.com/current/author/400-jean-luc-godard
I 2022/06/09 11:04:00 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[7E0cbG_26NP5 (1735154876606840832)]} 0 1
I 2022/06/09 11:04:00 SWITCHBOARD *Indexed 123 words in URL https://www.criterion.com/current/author/400-jean-luc-godard [7E0cbG_26NP5]
Description: Jean-Luc Godard | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1439 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:00 HostQueue forcing crawl-delay of 257 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 535, robots.delay = 0, ((waitig = 267) - (timeSinceLastAccess = 10)) = 257
I 2022/06/09 11:04:01 HostQueue forcing crawl-delay of 256 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 535, robots.delay = 0, ((waitig = 267) - (timeSinceLastAccess = 11)) = 256
I 2022/06/09 11:04:01 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=palcy-euzhan, 224152 bytes
I 2022/06/09 11:04:01 HTCACHE storing content of url https://www.criterion.com/current/posts/829-bad-day-at-black-rock, 84587 bytes
I 2022/06/09 11:04:01 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=palcy-euzhan, STACKING TIME = 3, PARSING TIME = 92
I 2022/06/09 11:04:01 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://s3.amazonaws.com/criterion-production/films/d3fa0bd2e5949b9e3c861222fb594d95/ASnRN4Kj6AdEv8RJTDrHKhIve2ZFQY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 HostQueue forcing crawl-delay of 247 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 528, robots.delay = 0, ((waitig = 264) - (timeSinceLastAccess = 17)) = 247
I 2022/06/09 11:04:01 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/829-bad-day-at-black-rock, STACKING TIME = 0, PARSING TIME = 20
I 2022/06/09 11:04:01 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=palcy-euzhan
I 2022/06/09 11:04:01 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[6eFNWm_26NP5 (1735154877383835648)]} 0 2
I 2022/06/09 11:04:01 Fulltext indexing: 6eFNWm_26NP5 https://www.criterion.com/shop/browse?director=palcy-euzhan
I 2022/06/09 11:04:01 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse?director=palcy-euzhan [6eFNWm_26NP5]
Description: Euzhan Palcy films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12416 bytes |
LinkStorageTime: 7 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:01 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/829-bad-day-at-black-rock
I 2022/06/09 11:04:01 Fulltext indexing: 6XhF7G_26NP5 https://www.criterion.com/current/posts/829-bad-day-at-black-rock
I 2022/06/09 11:04:01 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[6XhF7G_26NP5 (1735154877427875840)]} 0 2
I 2022/06/09 11:04:01 SWITCHBOARD *Indexed 524 words in URL https://www.criterion.com/current/posts/829-bad-day-at-black-rock [6XhF7G_26NP5]
Description: Bad Day at Black Rock | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6905 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:01 HTCACHE storing content of url https://www.criterion.com/current/posts/6358-jim-jarmusch-s-the-dead-don-t-die, 74227 bytes
I 2022/06/09 11:04:01 SWITCHBOARD CRAWL: ADDED 66 LINKS FROM https://www.criterion.com/current/posts/6358-jim-jarmusch-s-the-dead-don-t-die, STACKING TIME = 2, PARSING TIME = 6
I 2022/06/09 11:04:01 REJECTED https://www.theguardian.com/film/2019/may/14/the-dead-dont-die-review-stumbling-zombie-comedy-kicks-off-cannes - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.festival-cannes.com/en/festival/films/the-dead-dont-die - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.filmcomment.com/blog/cannes-interview-jim-jarmusch/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.thedailybeast.com/cannes-the-dead-dont-die-bill-murray-and-adam-driver-fight-zombies-in-maga-infested-small-town-america - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-the-dead-dont-die-jarmusch-chang-notebook-20190514-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.screendaily.com/reviews/the-dead-dont-die-cannes-review/5139365.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.youtube.com/embed/bs5ZOcU6Bnw?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.indiewire.com/2019/05/dead-dont-die-review-jim-jarmusch-cannes-1202140841/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.telegraph.co.uk/films/0/dead-dont-die-review-winningly-eccentric-way-usher-zombie-apocalypse/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://variety.com/2019/film/reviews/the-dead-dont-die-review-adam-driver-bill-murray-1203213609/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.youtube.com/embed/bqhcRsKcaSA?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://film.avclub.com/jim-jarmusch-opens-cannes-with-a-doa-zombie-comedy-for-1834767725 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://www.vulture.com/2019/05/adam-driver-has-a-star-wars-keychain-in-the-dead-dont-die.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 REJECTED https://criterion-production.s3.amazonaws.com/XqH4rpetEhKyH3ltKC1W0VARoD7ueE.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:01 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/6358-jim-jarmusch-s-the-dead-don-t-die
I 2022/06/09 11:04:01 Fulltext indexing: 6Wez0G_26NP5 https://www.criterion.com/current/posts/6358-jim-jarmusch-s-the-dead-don-t-die
I 2022/06/09 11:04:01 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[6Wez0G_26NP5 (1735154877545316352)]} 0 3
I 2022/06/09 11:04:01 SWITCHBOARD *Indexed 593 words in URL https://www.criterion.com/current/posts/6358-jim-jarmusch-s-the-dead-don-t-die [6Wez0G_26NP5]
Description: Jim Jarmuschs The Dead Dont Die | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 7027 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:01 HostQueue forcing crawl-delay of 252 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 524, robots.delay = 0, ((waitig = 262) - (timeSinceLastAccess = 11)) = 251
I 2022/06/09 11:04:01 HostQueue forcing crawl-delay of 252 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 524, robots.delay = 0, ((waitig = 262) - (timeSinceLastAccess = 10)) = 252
I 2022/06/09 11:04:02 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=lent-dean, 224199 bytes
I 2022/06/09 11:04:02 HTCACHE storing content of url https://www.criterion.com/current/posts/5664-jafar-panani-s-3-faces, 74016 bytes
I 2022/06/09 11:04:02 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=lent-dean, STACKING TIME = 0, PARSING TIME = 71
I 2022/06/09 11:04:02 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://s3.amazonaws.com/criterion-production/films/2131124bf11dd19cde56a791c8fc54f9/o5Y9AGkM9iZWMr9AQm46QYzEAvubcV_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 SWITCHBOARD CRAWL: ADDED 77 LINKS FROM https://www.criterion.com/current/posts/5664-jafar-panani-s-3-faces, STACKING TIME = 6, PARSING TIME = 17
I 2022/06/09 11:04:02 REJECTED https://www.thewrap.com/three-faces-film-review-jafar-panahi-modest-profound/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://time.com/5275701/cannes-review-3-faces-and-leto/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.filmcomment.com/blog/film-comment-podcast-cannes-day-six/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.rogerebert.com/cannes/cannes-2018-3-faces-happy-as-lazzaro - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED http://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/three-faces-jafar-panahi-road-trip - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED http://www.indiewire.com/2018/05/three-faces-review-jafar-panahi-cannes-2018-1201963927/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.hollywoodreporter.com/review/3-faces-1111436 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED http://www.latimes.com/entertainment/movies/la-et-mn-cannes-diary-3-faces-girls-of-the-sun-20180513-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.slantmagazine.com/film/review/three-faces - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.theguardian.com/film/2018/may/13/three-faces-review-jafar-panahis-latest-is-calm-modest-and-inscrutable - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://criterion-production.s3.amazonaws.com/EZWMzUKXsDCzWsNMXjpSoQCLymCM4N.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED http://desistfilm.com/cannes-2018-three-faces-by-jafar-panahi/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://thefilmstage.com/reviews/cannes-review-jafar-panahis-3-faces-is-a-loose-empathetic-ultimately-minor-work/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-6-fortnight-provocations-jafar-panahi-s-3-faces - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.timeout.com/london/film/three-faces - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED http://www.anothergaze.com/cannes-review-jafar-panahis-three-faces-se-rokh-feminism/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://player.vimeo.com/video/269321294 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://icsfilm.org/reviews/cannes-2018-review-three-faces-jafar-panahi/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.screendaily.com/reviews/3-faces-cannes-review/5129269.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-7-two-gentle-competitors-and-war-s-dirty-work - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://cine-vue.com/2018/05/cannes-2018-three-faces-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://film.avclub.com/me-too-looms-over-a-cannes-contender-while-michael-b-1825992063 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED http://www.indiewire.com/2018/05/cannes-asghar-farhadi-iran-jafar-panahi-1201962155/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.festival-cannes.com/en/festival/films/se-rokh - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 10)) = 249
I 2022/06/09 11:04:02 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=lent-dean
I 2022/06/09 11:04:02 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[6QuCBm_26NP5 (1735154878168170496)]} 0 2
I 2022/06/09 11:04:02 Fulltext indexing: 6QuCBm_26NP5 https://www.criterion.com/shop/browse/list?director=lent-dean
I 2022/06/09 11:04:02 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=lent-dean [6QuCBm_26NP5]
Description: Dean Lent films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12500 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:02 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/5664-jafar-panani-s-3-faces
I 2022/06/09 11:04:02 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[6J-abG_26NP5 (1735154878194384896)]} 0 2
I 2022/06/09 11:04:02 Fulltext indexing: 6J-abG_26NP5 https://www.criterion.com/current/posts/5664-jafar-panani-s-3-faces
I 2022/06/09 11:04:02 SWITCHBOARD *Indexed 501 words in URL https://www.criterion.com/current/posts/5664-jafar-panani-s-3-faces [6J-abG_26NP5]
Description: Jafar Pananis 3 Faces | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6060 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:02 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 13)) = 246
I 2022/06/09 11:04:02 HostQueue forcing crawl-delay of 249 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 518, robots.delay = 0, ((waitig = 259) - (timeSinceLastAccess = 11)) = 248
I 2022/06/09 11:04:02 HTCACHE storing content of url https://www.criterion.com/current/posts/6371-ladj-ly-s-les-mis-rables, 70318 bytes
I 2022/06/09 11:04:02 SWITCHBOARD CRAWL: ADDED 61 LINKS FROM https://www.criterion.com/current/posts/6371-ladj-ly-s-les-mis-rables, STACKING TIME = 1, PARSING TIME = 8
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.hollywoodreporter.com/review/les-miserables-review-1210837 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.avclub.com/john-carpenter-looms-over-a-day-of-madness-and-violence-1834808856 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.festival-cannes.com/en/films/les-miserables - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://variety.com/2019/film/reviews/les-miserables-review-ladj-ly-1203215299/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-chang-notebook-bacurau-les-miserables-deerskin-20190516-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.screendaily.com/reviews/les-miserables-cannes-review/5139434.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://variety.com/2019/film/news/les-miserables-ladj-ly-cannes-1203213054/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://criterion-production.s3.amazonaws.com/vrVZUF2gnI6EzybV4EWd7HdQNZbhDk.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://deadline.com/2019/05/les-miserables-ladj-ly-cannes-ones-to-watch-news-1202609462/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:02 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/6371-ladj-ly-s-les-mis-rables
I 2022/06/09 11:04:02 Fulltext indexing: 5_VL_G_26NP5 https://www.criterion.com/current/posts/6371-ladj-ly-s-les-mis-rables
I 2022/06/09 11:04:02 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[5_VL_G_26NP5 (1735154878800461824)]} 0 1
I 2022/06/09 11:04:02 SWITCHBOARD *Indexed 466 words in URL https://www.criterion.com/current/posts/6371-ladj-ly-s-les-mis-rables [5_VL_G_26NP5]
Description: Ladj Lys Les misérables | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5515 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:03 HostQueue forcing crawl-delay of 247 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 515, robots.delay = 0, ((waitig = 257) - (timeSinceLastAccess = 10)) = 247
I 2022/06/09 11:04:03 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=schloendorff-volker, 226471 bytes
I 2022/06/09 11:04:03 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse/list?director=schloendorff-volker, STACKING TIME = 1, PARSING TIME = 29
I 2022/06/09 11:04:03 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/films/1522135761424c32d477da9851f016ff/33GcylWNQvIKPneqIDDpcJnFPoUtSg_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/films/eae2794b63eb565a9518d0ddc6dc4800/MfEB5BHM5873COrnchiYFPxygZYNP8_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/films/ea8616829a288b5b6d680c9f6b66ba59/03UzOLZzQogXDtQOTkIp8BbLpZWGYM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/films/22a10f46e2d950d3ab907e62e119bd61/QMb2egiiChRALGyT7ZrOLX40p36N5I_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/films/bed1dc8df02842d6a75325665e718ebd/da8xTBLVhcfx0KQXSyOOMImKRe6s2r_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 HostQueue forcing crawl-delay of 248 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 520, robots.delay = 0, ((waitig = 260) - (timeSinceLastAccess = 12)) = 248
I 2022/06/09 11:04:03 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=schloendorff-volker
I 2022/06/09 11:04:03 Fulltext indexing: 6ENX0m_26NP5 https://www.criterion.com/shop/browse/list?director=schloendorff-volker
I 2022/06/09 11:04:03 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[6ENX0m_26NP5 (1735154879221989376)]} 0 4
I 2022/06/09 11:04:03 SWITCHBOARD *Indexed 1218 words in URL https://www.criterion.com/shop/browse/list?director=schloendorff-volker [6ENX0m_26NP5]
Description: Volker Schlöndorff films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12736 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:03 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=cooper-stuart, 224157 bytes
I 2022/06/09 11:04:03 HTCACHE storing content of url https://www.criterion.com/films/27976-le-mariage-de-chiffon, 69778 bytes
I 2022/06/09 11:04:03 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=cooper-stuart, STACKING TIME = 4, PARSING TIME = 122
I 2022/06/09 11:04:03 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/films/86111580875e28e5a36dc225c9926266/uh9IJVswl55y0VrTNtQMZC8WqezjDK_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/films/27976-le-mariage-de-chiffon, STACKING TIME = 5, PARSING TIME = 14
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1816-e188e2102b63387dbe23fc67edb6beea/DWwbQG5RL4lfG3HYvO9uUZ78amems4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/films/a61bf7df3267a8394798540b0e6a6243/xThd7HAKnTfFlGkcDzVIv63rhI8uQY_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/le-mariage-de-chiffon?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/3fc67ec38885a5dcd3850a16b1c407c5.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/b3363f12356972cb4bcd9d373485c6bf.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/a56882f74145f32d151529581d8ccbc4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/images/9518-74f3b68e17d1c132c4fb7dd0555570cc/Current_29404id_015_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/a92a695782aefb9bf1b7f37e04b79f55.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/7266a91d9ed9392127fd4477f886555d.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 HTCACHE storing content of url https://www.criterion.com/current/posts/14-testament-of-orpheus, 104367 bytes
I 2022/06/09 11:04:03 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=cooper-stuart
I 2022/06/09 11:04:03 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 515, robots.delay = 0, ((waitig = 257) - (timeSinceLastAccess = 11)) = 246
I 2022/06/09 11:04:03 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/14-testament-of-orpheus, STACKING TIME = 9, PARSING TIME = 15
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/14-/nuD2oZxtALXuReZ0cCwK2FeOovtgh5_original.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 Fulltext indexing: 5ZmQXm_26NP5 https://www.criterion.com/shop/browse/list?director=cooper-stuart
I 2022/06/09 11:04:03 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[5ZmQXm_26NP5 (1735154879504056320)]} 0 10
I 2022/06/09 11:04:03 SWITCHBOARD *Indexed 1188 words in URL https://www.criterion.com/shop/browse/list?director=cooper-stuart [5ZmQXm_26NP5]
Description: Stuart Cooper films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12475 bytes |
LinkStorageTime: 12 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:03 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/films/27976-le-mariage-de-chiffon
I 2022/06/09 11:04:03 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[5FVQFe_26NP5 (1735154879533416448)]} 0 1
I 2022/06/09 11:04:03 Fulltext indexing: 5FVQFe_26NP5 https://www.criterion.com/films/27976-le-mariage-de-chiffon
I 2022/06/09 11:04:03 SWITCHBOARD *Indexed 256 words in URL https://www.criterion.com/films/27976-le-mariage-de-chiffon [5FVQFe_26NP5]
Description: Le mariage de Chiffon (1942) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2992 bytes |
LinkStorageTime: 9 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:03 SWITCHBOARD Excluded 30 words in URL https://www.criterion.com/current/posts/14-testament-of-orpheus
I 2022/06/09 11:04:03 Fulltext indexing: 4xNmNG_26NP5 https://www.criterion.com/current/posts/14-testament-of-orpheus
I 2022/06/09 11:04:03 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[4xNmNG_26NP5 (1735154879602622464)]} 0 3
I 2022/06/09 11:04:03 SWITCHBOARD *Indexed 729 words in URL https://www.criterion.com/current/posts/14-testament-of-orpheus [4xNmNG_26NP5]
Description: Testament of Orpheus | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 9824 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:03 HTCACHE storing content of url https://www.criterion.com/current/posts/322-smiles-of-a-summer-night, 88086 bytes
I 2022/06/09 11:04:03 SWITCHBOARD CRAWL: ADDED 41 LINKS FROM https://www.criterion.com/current/posts/322-smiles-of-a-summer-night, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:04:03 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://s3.amazonaws.com/criterion-production/images/4250-27575b9122a650cb8d59f57d1c15e84b/current_446_019b_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:03 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 511, robots.delay = 0, ((waitig = 255) - (timeSinceLastAccess = 11)) = 244
I 2022/06/09 11:04:03 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/322-smiles-of-a-summer-night
I 2022/06/09 11:04:03 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[4wf_HG_26NP5 (1735154879838552064)]} 0 2
I 2022/06/09 11:04:03 Fulltext indexing: 4wf_HG_26NP5 https://www.criterion.com/current/posts/322-smiles-of-a-summer-night
I 2022/06/09 11:04:03 SWITCHBOARD *Indexed 560 words in URL https://www.criterion.com/current/posts/322-smiles-of-a-summer-night [4wf_HG_26NP5]
Description: Smiles of a Summer Night | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 7496 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:04 HTCACHE storing content of url https://www.criterion.com/current/posts/6366-quentin-dupieux-s-deerskin, 70511 bytes
I 2022/06/09 11:04:04 SWITCHBOARD CRAWL: ADDED 61 LINKS FROM https://www.criterion.com/current/posts/6366-quentin-dupieux-s-deerskin, STACKING TIME = 2, PARSING TIME = 9
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://variety.com/2019/film/reviews/deerskin-review-jean-dujardin-1203215532/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.quinzaine-realisateurs.com/en/film/le-daim/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://cineuropa.org/en/interview/372504/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://criterion-production.s3.amazonaws.com/uUn3G2uj9wKt1RpmJfx40g0l2KMco0.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.rogerebert.com/cannes/cannes-2019-deerskin-opens-directors-fortnight-chlo%C3%AB-sevigny-in-the-dead-dont-die - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-2-of-zombies-and-deerskin-jackets - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.avclub.com/john-carpenter-looms-over-a-day-of-madness-and-violence-1834808856 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://filmmakermagazine.com/107516-cannes-2019-dispatch-1-the-dead-dont-die-deerskin/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.screendaily.com/reviews/deerskin-cannes-review/5139425.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 508, robots.delay = 0, ((waitig = 254) - (timeSinceLastAccess = 10)) = 244
I 2022/06/09 11:04:04 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/6366-quentin-dupieux-s-deerskin
I 2022/06/09 11:04:04 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[4hMR6G_26NP5 (1735154880093356032)]} 0 1
I 2022/06/09 11:04:04 Fulltext indexing: 4hMR6G_26NP5 https://www.criterion.com/current/posts/6366-quentin-dupieux-s-deerskin
I 2022/06/09 11:04:04 SWITCHBOARD *Indexed 468 words in URL https://www.criterion.com/current/posts/6366-quentin-dupieux-s-deerskin [4hMR6G_26NP5]
Description: Quentin Dupieuxs Deerskin | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5681 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:04 HTCACHE storing content of url https://www.criterion.com/current/posts/131-thoughts-on-my-m-tier, 151602 bytes
I 2022/06/09 11:04:04 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/131-thoughts-on-my-m-tier, STACKING TIME = 0, PARSING TIME = 9
I 2022/06/09 11:04:04 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 504, robots.delay = 0, ((waitig = 252) - (timeSinceLastAccess = 14)) = 238
I 2022/06/09 11:04:04 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/131-thoughts-on-my-m-tier
I 2022/06/09 11:04:04 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 504, robots.delay = 0, ((waitig = 252) - (timeSinceLastAccess = 11)) = 241
I 2022/06/09 11:04:04 Fulltext indexing: 4dSmbG_26NP5 https://www.criterion.com/current/posts/131-thoughts-on-my-m-tier
I 2022/06/09 11:04:04 HTCACHE storing content of url https://www.criterion.com/current/posts/3831-polanski-comes-to-pennsylvania, 71359 bytes
I 2022/06/09 11:04:04 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[4dSmbG_26NP5 (1735154880609255424)]} 0 12
I 2022/06/09 11:04:04 SWITCHBOARD *Indexed 925 words in URL https://www.criterion.com/current/posts/131-thoughts-on-my-m-tier [4dSmbG_26NP5]
Description: Thoughts on My Métier | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 20047 bytes |
LinkStorageTime: 15 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:04 SWITCHBOARD CRAWL: ADDED 57 LINKS FROM https://www.criterion.com/current/posts/3831-polanski-comes-to-pennsylvania, STACKING TIME = 4, PARSING TIME = 10
I 2022/06/09 11:04:04 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6502-/VcDUKU8LPywb5MgflsEJCCJOgfrUNv_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://s3.amazonaws.com/criterion-production/images/6445-161c0526b926d93dd4f9f05e68cb81bc/Screen_Shot_2015-12-10_at_4.33.11_PM_medium.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6827-/sYS7xZWV366q9OANUqtCwimvTnL90D_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED http://www.brynmawrfilm.org/films/?id=1586 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.youtube.com/embed/zxyIxHzv3Lg?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://s3.amazonaws.com/criterion-production/films/9466ce24b77e3fa3a9ade373aac1fa2a/yNmDmRc9rtQN79bkL8PcRrpb8OjiKO_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://s3.amazonaws.com/criterion-production/films/4886db5d9452d2247ff729d35aa839a6/rXjMmW28vBI3WW7qEj2gO6X7NKmq6A_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6701-/ETx6s7z5azkjRZIl1n1268y6oIFngD_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6516-/1ReaXkdoavjctz1ZBIEqTXIfC0QqZK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:04 SWITCHBOARD Excluded 18 words in URL https://www.criterion.com/current/posts/3831-polanski-comes-to-pennsylvania
I 2022/06/09 11:04:04 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[4XPxyG_26NP5 (1735154880651198464)]} 0 1
I 2022/06/09 11:04:04 Fulltext indexing: 4XPxyG_26NP5 https://www.criterion.com/current/posts/3831-polanski-comes-to-pennsylvania
I 2022/06/09 11:04:04 SWITCHBOARD *Indexed 288 words in URL https://www.criterion.com/current/posts/3831-polanski-comes-to-pennsylvania [4XPxyG_26NP5]
Description: Polanski Comes to Pennsylvania | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3271 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:04 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 502, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:05 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=mackenzie-john, 224205 bytes
I 2022/06/09 11:04:05 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=mackenzie-john, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:04:05 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://s3.amazonaws.com/criterion-production/films/504873f293ee8d776c9cec4f561f5e89/4nrkOmePov50Z8C4NPOfo39MOjpnFC_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 501, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:05 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=mackenzie-john
I 2022/06/09 11:04:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[4J7Bzm_26NP5 (1735154881155563520)]} 0 2
I 2022/06/09 11:04:05 Fulltext indexing: 4J7Bzm_26NP5 https://www.criterion.com/shop/browse/list?director=mackenzie-john
I 2022/06/09 11:04:05 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=mackenzie-john [4J7Bzm_26NP5]
Description: John Mackenzie films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12483 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:05 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=yamamoto-satsuo, 224667 bytes
I 2022/06/09 11:04:05 HTCACHE storing content of url https://www.criterion.com/films/28161-sylvie-et-le-fant-me, 70133 bytes
I 2022/06/09 11:04:05 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse?director=yamamoto-satsuo, STACKING TIME = 1, PARSING TIME = 64
I 2022/06/09 11:04:05 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 497, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:05 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://s3.amazonaws.com/criterion-production/films/49725f835ddd6df80a60f59f7abe0d5d/PqicagnIq5S2W893XvoCRS7erOmZY0_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/films/28161-sylvie-et-le-fant-me, STACKING TIME = 1, PARSING TIME = 19
I 2022/06/09 11:04:05 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1816-e188e2102b63387dbe23fc67edb6beea/DWwbQG5RL4lfG3HYvO9uUZ78amems4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/765cad30aa2857eb5594ca4e81a08206.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/c102d3594d0728ecac2392d00f25b3af.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/48e426be110d417f0f0cabb4f3bbfdf6.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/5dd09cf8c4caced97dc37af723f6e2a0.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/d225c8e031535adfb65e8d0b245e7251.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://s3.amazonaws.com/criterion-production/films/5ce838d5726233a5338bdec40372bbb2/E5pq3whV8OAou99bMtvdG2Pfs44MZZ_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://s3.amazonaws.com/criterion-production/images/9518-74f3b68e17d1c132c4fb7dd0555570cc/Current_29404id_015_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/sylvie-et-le-fantome?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse?director=yamamoto-satsuo
I 2022/06/09 11:04:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[4HHUZm_26NP5 (1735154881490059264)]} 0 2
I 2022/06/09 11:04:05 Fulltext indexing: 4HHUZm_26NP5 https://www.criterion.com/shop/browse?director=yamamoto-satsuo
I 2022/06/09 11:04:05 SWITCHBOARD *Indexed 1197 words in URL https://www.criterion.com/shop/browse?director=yamamoto-satsuo [4HHUZm_26NP5]
Description: Satsuo Yamamoto films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12475 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:05 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/films/28161-sylvie-et-le-fant-me
I 2022/06/09 11:04:05 Fulltext indexing: 39m_Ve_26NP5 https://www.criterion.com/films/28161-sylvie-et-le-fant-me
I 2022/06/09 11:04:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[39m_Ve_26NP5 (1735154881505787904)]} 0 1
I 2022/06/09 11:04:05 SWITCHBOARD *Indexed 274 words in URL https://www.criterion.com/films/28161-sylvie-et-le-fant-me [39m_Ve_26NP5]
Description: Sylvie et le fantôme (1946) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3061 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:05 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 497, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:05 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:04:05 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:05 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@4837be3f[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w8(7.7.3):C18:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772634528}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wa(7.7.3):C23:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772640046}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wb(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772645671}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:04:05 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:04:05 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=masaki-domoto, 224169 bytes
I 2022/06/09 11:04:05 HTCACHE storing content of url https://www.criterion.com/films/30710-once-upon-a-time-in-china-iii, 68346 bytes
I 2022/06/09 11:04:05 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=masaki-domoto, STACKING TIME = 1, PARSING TIME = 21
I 2022/06/09 11:04:05 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/films/30710-once-upon-a-time-in-china-iii, STACKING TIME = 1, PARSING TIME = 18
I 2022/06/09 11:04:05 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://s3.amazonaws.com/criterion-production/films/1afc31bda9087f75091fae936b5c1ca0/1HMg9MpF1yL5AAmyha7kzSbACv2zcV_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 493, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:05 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/bsaP2l8ijGKBWdecgUWq31hxkg13t4aiCHQTRVpS.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/DA42q8y7RzaLyevLGRM149bAUctAIpixwckpOcsK.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/7KEHmMK1NGqpC2qEFPremNV8smzzcZnwyNuPpPF9.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/mswKwu3iDiKWY0Ha34DyxvddGvgamoSffH3msXni.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1987-beb71f216e96d1ff2d0f8231f5b8b975/44LVkvftLRcr5paF4enJfBFTe5mI2c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://s3.amazonaws.com/criterion-production/films/e8d4cd2c5dd1b2541a3c8325a6d1805f/W5gKGPvtkm5evOJ9devGYL935KMAdy_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://s3.amazonaws.com/criterion-production/films/6fd3976cf806d6fad6dc70712c3e9ebc/pOFw8IdzoEUt6wKFT2vFl4kmGkrRBS_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://s3.amazonaws.com/criterion-production/films/cd4ae6bdbaea9fd9c9aff1d69f924bc4/5wErYoFwVfkciAfnpRbFIhPqv7tIC5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/5648wX1MGoaAPzoXbvIXJbFwRj4MXyTPczOoRwmk.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/GBq3frqudeqNiaJaU4govHpeiOIr4t75Yp27kxNn.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:05 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=masaki-domoto
I 2022/06/09 11:04:05 Fulltext indexing: 3poNlm_26NP5 https://www.criterion.com/shop/browse/list?director=masaki-domoto
I 2022/06/09 11:04:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[3poNlm_26NP5 (1735154882008055808)]} 0 7
I 2022/06/09 11:04:05 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=masaki-domoto [3poNlm_26NP5]
Description: Domoto Masaki films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12468 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:05 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/films/30710-once-upon-a-time-in-china-iii
I 2022/06/09 11:04:05 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[3mnsFe_26NP5 (1735154882023784448)]} 0 1
I 2022/06/09 11:04:05 Fulltext indexing: 3mnsFe_26NP5 https://www.criterion.com/films/30710-once-upon-a-time-in-china-iii
I 2022/06/09 11:04:05 SWITCHBOARD *Indexed 285 words in URL https://www.criterion.com/films/30710-once-upon-a-time-in-china-iii [3mnsFe_26NP5]
Description: Once Upon a Time in China III (1993) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2908 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:06 HTCACHE storing content of url https://www.criterion.com/current/posts/6391-bong-joon-ho-s-parasite, 71531 bytes
I 2022/06/09 11:04:06 SWITCHBOARD CRAWL: ADDED 61 LINKS FROM https://www.criterion.com/current/posts/6391-bong-joon-ho-s-parasite, STACKING TIME = 1, PARSING TIME = 7
I 2022/06/09 11:04:06 REJECTED https://www.youtube.com/embed/CEIwFAQ-Rec?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://variety.com/2019/film/markets-festivals/parasite-review-1203221435/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.screendaily.com/reviews/parasite-cannes-review/5139675.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.vulture.com/2019/05/bong-joon-hos-parasite-is-a-nerve-racking-masterpiece.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.indiewire.com/2019/05/parasite-review-bong-joon-ho-1202143634/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-9-burning-witches-and-invasive-parasites - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://criterion-production.s3.amazonaws.com/ETyZVcI3CyvZ367FjX7uQ81P502zsH.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.festival-cannes.com/en/festival/films/gisaengchung - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://twitter.com/CriterionDaily/status/1128679385958055936 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 490, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 11)) = 240
I 2022/06/09 11:04:06 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/6391-bong-joon-ho-s-parasite
I 2022/06/09 11:04:06 Fulltext indexing: 3djoBG_26NP5 https://www.criterion.com/current/posts/6391-bong-joon-ho-s-parasite
I 2022/06/09 11:04:06 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[3djoBG_26NP5 (1735154882197848064)]} 0 2
I 2022/06/09 11:04:06 SWITCHBOARD *Indexed 526 words in URL https://www.criterion.com/current/posts/6391-bong-joon-ho-s-parasite [3djoBG_26NP5]
Description: Bong Joon-hos Parasite | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6155 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:06 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 490, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 11)) = 240
I 2022/06/09 11:04:06 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=fejos-paul, 224124 bytes
I 2022/06/09 11:04:06 HTCACHE storing content of url https://www.criterion.com/current/posts/5682-lee-chang-dong-s-burning, 74346 bytes
I 2022/06/09 11:04:06 HostQueue forcing crawl-delay of 243 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 486, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 9)) = 242
I 2022/06/09 11:04:06 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=fejos-paul, STACKING TIME = 1, PARSING TIME = 86
I 2022/06/09 11:04:06 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://s3.amazonaws.com/criterion-production/films/6ccd9970a004695bc2132c7edfbe8a35/Qvz32YpStw1QSbbdvZG2faTZW3ROlx_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 SWITCHBOARD CRAWL: ADDED 84 LINKS FROM https://www.criterion.com/current/posts/5682-lee-chang-dong-s-burning, STACKING TIME = 7, PARSING TIME = 17
I 2022/06/09 11:04:06 REJECTED https://www.screendaily.com/reviews/burning-cannes-review/5129452.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.festival-cannes.com/en/festival/films/burning - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.screendaily.com/burning-director-lee-chang-dong-on-his-ambiguous-cannes-competition-title/5129466.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.telegraph.co.uk/films/0/burning-review-daring-study-class-conflict-sexual-longing-blazes/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://theplaylist.net/burning-lee-chang-dong-review-20180521/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED http://desistfilm.com/cannes-2018-burning-by-lee-chang-dong/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.youtube.com/embed/wi6Kw7V8gXk?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED http://lwlies.com/festivals/burning-cannes-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED http://www.indiewire.com/2018/05/burning-review-lee-chang-dong-steven-yeung-1201963075/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED http://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/burning-lee-chang-dong-love-triangle - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.timeout.com/london/film/burning - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.screendaily.com/news/burning-sets-record-score-in-history-of-screens-cannes-jury-grid/5129480.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.villagevoice.com/2018/05/17/sex-obsession-and-class-in-under-the-silver-lake-and-burning/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.slantmagazine.com/house/article/cannes-film-review-burning - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.ioncinema.com/reviews/lee-chang-dong-burning-review - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-10-dogs-and-disappearances - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.thewrap.com/burning-film-review-korean-auteur-lee-chang-dong-returns-breathtaking-drama/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.filmcomment.com/blog/cannes-interview-lee-chang-dong/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://filmmakermagazine.com/105353-cannes-dispatch-6-under-the-silver-lake-burning/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED http://variety.com/2018/film/reviews/burning-review-beoning-1202812196/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://film.avclub.com/han-solo-brings-not-enough-fun-to-cannes-while-a-super-1826108687 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED http://blog.lareviewofbooks.org/the-korea-blog/burning-acclaimed-korean-auteurs-explosive-haruki-murakami-adapting-indictment-inequality/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.hollywoodreporter.com/review/burning-review-1112684 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED http://screenanarchy.com/2018/05/cannes-2018-review-a-slowburn-film-for-the-ages.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.rogerebert.com/cannes/cannes-2018-under-the-silver-lake-burning-sofia - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://criterion-production.s3.amazonaws.com/URsAZWniEsD11hFfMd76Umdp7mL7Su.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.filmcomment.com/blog/film-comment-podcast-cannes-day-nine/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://icsfilm.org/reviews/cannes-2018-review-burning-lee-chang-dong/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED http://www.latimes.com/entertainment/movies/la-et-mn-cannes-diary-burning-under-the-silver-lake-20180518-htmlstory.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED http://variety.com/2018/film/asia/lee-chang-dong-burning-cannes-1202812485/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://thefilmstage.com/reviews/cannes-review-lee-chang-dongs-burning-turns-haruki-murakami-into-a-frothy-page-turner/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 REJECTED https://www.theguardian.com/film/2018/may/17/burning-review-cannes-2018-lee-chang-dong - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:06 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=fejos-paul
I 2022/06/09 11:04:06 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[3Ujdum_26NP5 (1735154882854256640)]} 0 2
I 2022/06/09 11:04:06 Fulltext indexing: 3Ujdum_26NP5 https://www.criterion.com/shop/browse/list?director=fejos-paul
I 2022/06/09 11:04:06 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=fejos-paul [3Ujdum_26NP5]
Description: Paul Fejos films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12448 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:06 SWITCHBOARD Excluded 24 words in URL https://www.criterion.com/current/posts/5682-lee-chang-dong-s-burning
I 2022/06/09 11:04:06 Fulltext indexing: 2qQTvG_26NP5 https://www.criterion.com/current/posts/5682-lee-chang-dong-s-burning
I 2022/06/09 11:04:06 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[2qQTvG_26NP5 (1735154882886762496)]} 0 2
I 2022/06/09 11:04:06 SWITCHBOARD *Indexed 532 words in URL https://www.criterion.com/current/posts/5682-lee-chang-dong-s-burning [2qQTvG_26NP5]
Description: Lee Chang-dongs Burning | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6071 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:06 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 486, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 14)) = 237
I 2022/06/09 11:04:07 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=wallen-sigurd, 224863 bytes
I 2022/06/09 11:04:07 HostQueue forcing crawl-delay of 244 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 486, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 7)) = 244
I 2022/06/09 11:04:07 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=wallen-sigurd, STACKING TIME = 1, PARSING TIME = 86
I 2022/06/09 11:04:07 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/films/bb3d92ac28b94e8cbad1c169e15230eb/gxgY0bIbXgkuyFw4Zs4uwe1uD5PxnV_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1915-d024b93e4429f5b05d7f0bdc8d59c415/shXfQUMZTWnY6hrjrAHIhcBlosHu4c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=wallen-sigurd
I 2022/06/09 11:04:07 Fulltext indexing: 2n1TUm_26NP5 https://www.criterion.com/shop/browse/list?director=wallen-sigurd
I 2022/06/09 11:04:07 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[2n1TUm_26NP5 (1735154883404759040)]} 0 3
I 2022/06/09 11:04:07 SWITCHBOARD *Indexed 1207 words in URL https://www.criterion.com/shop/browse/list?director=wallen-sigurd [2n1TUm_26NP5]
Description: Sigurd Wallén films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12567 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:07 HTCACHE storing content of url https://www.criterion.com/current/posts/925-shampoo, 78833 bytes
I 2022/06/09 11:04:07 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/925-shampoo, STACKING TIME = 1, PARSING TIME = 14
I 2022/06/09 11:04:07 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 483, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:07 SWITCHBOARD Excluded 26 words in URL https://www.criterion.com/current/posts/925-shampoo
I 2022/06/09 11:04:07 Fulltext indexing: 2i4NtG_26NP5 https://www.criterion.com/current/posts/925-shampoo
I 2022/06/09 11:04:07 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[2i4NtG_26NP5 (1735154883570434048)]} 0 2
I 2022/06/09 11:04:07 SWITCHBOARD *Indexed 459 words in URL https://www.criterion.com/current/posts/925-shampoo [2i4NtG_26NP5]
Description: Shampoo | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5794 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:07 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=downey-sr-robert, 227087 bytes
I 2022/06/09 11:04:07 HTCACHE storing content of url https://www.criterion.com/current/posts/272-by-brakhage-the-act-of-seeing, 153393 bytes
I 2022/06/09 11:04:07 SWITCHBOARD CRAWL: ADDED 48 LINKS FROM https://www.criterion.com/shop/browse/list?director=downey-sr-robert, STACKING TIME = 5, PARSING TIME = 118
I 2022/06/09 11:04:07 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/films/a40089af5a99e65997b7ab84b63c23e6/xwkezz5WNkgtavKsf03aC9CxPq4gTb_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/films/c0d938eed365a6326549a2fb003d4589/OMnBlaUrCmq1E4ByzpCG0lAb5dZlQ5_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/films/8c9be029b45b7c8197c07e2721a341db/DeYMEjJ5dfkpnMVkD3M2pK0FuAfLI5_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/films/81aefdc48e19d12c4fa0664e7f575b59/ZwhEP5AiwjZhejZ0ytK4Ot53TtIMWF_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/films/3e73456f5c7208db818f141785bf900b/0rdTEhbswvC2ho9oNe9c6EmXIqG3Rw_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1809-a4a8b84c4cbcababe9073629fd726b50/S4JbdHupEZur0VszttgXmG8hjRdrG2_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 481, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 13)) = 238
I 2022/06/09 11:04:07 SWITCHBOARD CRAWL: ADDED 54 LINKS FROM https://www.criterion.com/current/posts/272-by-brakhage-the-act-of-seeing, STACKING TIME = 11, PARSING TIME = 26
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/images/4243-0f328f44ac319717208352f8fe157a51/current_brakhage_vol1_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=downey-sr-robert
I 2022/06/09 11:04:07 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[2nMDPm_26NP5 (1735154883922755584)]} 0 2
I 2022/06/09 11:04:07 Fulltext indexing: 2nMDPm_26NP5 https://www.criterion.com/shop/browse/list?director=downey-sr-robert
I 2022/06/09 11:04:07 SWITCHBOARD *Indexed 1221 words in URL https://www.criterion.com/shop/browse/list?director=downey-sr-robert [2nMDPm_26NP5]
Description: Robert Downey Sr. films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12784 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:07 HTCACHE storing content of url https://www.criterion.com/films/625, 76103 bytes
I 2022/06/09 11:04:07 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 479, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:07 SWITCHBOARD CRAWL: ADDED 66 LINKS FROM https://www.criterion.com/films/625, STACKING TIME = 2, PARSING TIME = 19
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1838-ee328a31205b114ef125fd81b54b5cd0/VZGhEsbGQY3luUNqMc64IKmXGoRe9U_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 SWITCHBOARD Excluded 32 words in URL https://www.criterion.com/current/posts/272-by-brakhage-the-act-of-seeing
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/be0e802e835ac927eaf3be41589e23e0.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/9895f84c63decd17b0d0872839293364.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/7bb275f48c0307c9ee9a06b299e6cddc.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/films/aed45dbeaf63414624b16890eb458dea/fSaXGJq2BBkowhHXw2FJft0UoNIdnI_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/4b512a2c2a810f90efdc04722db22a9e.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/2b24b88da82ecfe8de682516f9cc49e4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:07 REJECTED https://s3.amazonaws.com/criterion-production/posts/1131-e236cb8c87ec5b809c2301982224d1ec/IVAN_rosenbaum_still_1_original.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/ivan-the-terrible-part-i-1?utm_source=criterion.com&utm_medium=referral&utm_campaign=watch-now&utm_content=film - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/e10a3d39181c428ca5e8575bd45f4a2d.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://s3.amazonaws.com/criterion-production/films/19c902a73e243b9293de4c717430f639/H1rWEdJtowN7Xh9vSnOvPU9I6y0AgT_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/72e6245e795d55b442929cdadb96db68.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/49bfa779abf0094c67408068cf05ad61.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://s3.amazonaws.com/criterion-production/films/647527e5987ba1f01e714df49e95acdb/caQS8NJz8DXvLNcdgnxxZJn7pZtC6l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://s3.amazonaws.com/criterion-production/films/b653a1545863d441cf9b8a8bc50946b8/SXcj1Zf8bWoyaoUEuzlFAc4gPNGwYm_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/fe1eb0a685837b6e81e09ec8ab055ff4.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/ba1402f35a62007453909e128fe39483.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://s3.amazonaws.com/criterion-production/explore_images/1040-d47e967cfc342baabd16848454af7e7a/GlT2mkYmPezx59HuDM1tGeFVmRgx9l_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 Fulltext indexing: 11VNgG_26NP5 https://www.criterion.com/current/posts/272-by-brakhage-the-act-of-seeing
I 2022/06/09 11:04:08 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[11VNgG_26NP5 (1735154884170219520)]} 0 10
I 2022/06/09 11:04:08 SWITCHBOARD *Indexed 1175 words in URL https://www.criterion.com/current/posts/272-by-brakhage-the-act-of-seeing [11VNgG_26NP5]
Description: By Brakhage: The Act of Seeing . . . | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 19804 bytes |
LinkStorageTime: 11 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:08 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/films/625
I 2022/06/09 11:04:08 Fulltext indexing: 11MH4e_26NP5 https://www.criterion.com/films/625
I 2022/06/09 11:04:08 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[11MH4e_26NP5 (1735154884190142464)]} 0 1
I 2022/06/09 11:04:08 SWITCHBOARD *Indexed 345 words in URL https://www.criterion.com/films/625 [11MH4e_26NP5]
Description: Ivan the Terrible, Part I (1944) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4157 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:08 HTCACHE storing content of url https://www.criterion.com/current/posts/4421-on-the-channel-art-house-america, 72911 bytes
I 2022/06/09 11:04:08 REJECTED https://www.youtube.com/embed/aflQZYXVj9U?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7777-/bTILAb13gYYFwneHHCb5dHHD1VjRf7_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7707-/hZxxnAQeKCi5vjiDKzvdNuftSnkdOM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://watch.filmstruck.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/4421-on-the-channel-art-house-america, STACKING TIME = 4, PARSING TIME = 6
I 2022/06/09 11:04:08 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7807-/DXN9QiWNnsXTuLss0TTJ4r9JspRu5B_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7738-/e35rwdsj2UHHYOdCCz8E5Qr8oxxKXj_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/current/posts/4421-on-the-channel-art-house-america
I 2022/06/09 11:04:08 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[1nwvFG_26NP5 (1735154884323311616)]} 0 1
I 2022/06/09 11:04:08 Fulltext indexing: 1nwvFG_26NP5 https://www.criterion.com/current/posts/4421-on-the-channel-art-house-america
I 2022/06/09 11:04:08 SWITCHBOARD *Indexed 285 words in URL https://www.criterion.com/current/posts/4421-on-the-channel-art-house-america [1nwvFG_26NP5]
Description: Celebrating Twenty-Five Years at the Walter Reade Theater | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3754 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:08 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 476, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 11)) = 240
I 2022/06/09 11:04:08 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 476, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:08 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 476, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 10)) = 241
I 2022/06/09 11:04:08 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=ripley-arthur, 224214 bytes
I 2022/06/09 11:04:08 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=ripley-arthur, STACKING TIME = 1, PARSING TIME = 26
I 2022/06/09 11:04:08 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:08 REJECTED https://s3.amazonaws.com/criterion-production/films/3a4a52811b630a9836c1b10cb2c55a38/1DZVBE8PnMfkggyvh5s9f7K2TSAiF0_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 1, host.average = 478, robots.delay = 0, ((waitig = 251) - (timeSinceLastAccess = 12)) = 239
I 2022/06/09 11:04:09 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=ripley-arthur
I 2022/06/09 11:04:09 Fulltext indexing: 1noiim_26NP5 https://www.criterion.com/shop/browse/list?director=ripley-arthur
I 2022/06/09 11:04:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[1noiim_26NP5 (1735154885271224320)]} 0 2
I 2022/06/09 11:04:09 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=ripley-arthur [1noiim_26NP5]
Description: Arthur Ripley films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12490 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:09 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=king-henry, 224151 bytes
I 2022/06/09 11:04:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://s3.amazonaws.com/criterion-production/films/5df5a988519a21f6f8902cb68a50fad2/PNke8tz8SXjbs27HUThiZYSvYcNnnm_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=king-henry, STACKING TIME = 3, PARSING TIME = 21
I 2022/06/09 11:04:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 HostQueue forcing crawl-delay of 292 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 606, robots.delay = 0, ((waitig = 303) - (timeSinceLastAccess = 11)) = 292
I 2022/06/09 11:04:09 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=king-henry
I 2022/06/09 11:04:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[1ghXim_26NP5 (1735154885521833984)]} 0 2
I 2022/06/09 11:04:09 Fulltext indexing: 1ghXim_26NP5 https://www.criterion.com/shop/browse/list?director=king-henry
I 2022/06/09 11:04:09 SWITCHBOARD *Indexed 1191 words in URL https://www.criterion.com/shop/browse/list?director=king-henry [1ghXim_26NP5]
Description: Henry King films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12471 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:09 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=barnes-rick, 224243 bytes
I 2022/06/09 11:04:09 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=barnes-rick, STACKING TIME = 1, PARSING TIME = 19
I 2022/06/09 11:04:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://s3.amazonaws.com/criterion-production/films/45ae4aaeb01b65d3788e09d553527a0d/O5urSQ3UPS5ELUhh25uHIS7M5ri1p3_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 HTCACHE storing content of url https://www.criterion.com/current/posts/6388-corneliu-porumboiu-s-the-whistlers, 71472 bytes
I 2022/06/09 11:04:09 HTCACHE storing content of url https://www.criterion.com/current/author/228-peggy-chiao, 48309 bytes
I 2022/06/09 11:04:09 SWITCHBOARD CRAWL: ADDED 63 LINKS FROM https://www.criterion.com/current/posts/6388-corneliu-porumboiu-s-the-whistlers, STACKING TIME = 5, PARSING TIME = 10
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.hollywoodreporter.com/review/whistlers-review-1211959 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.telegraph.co.uk/films/2019/05/21/whistlers-review-crowd-pleasing-romanian-thriller-twists-rival/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://thefilmstage.com/reviews/the-whistlers-cannes-review-corneliu-porumboiu/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://criterion-production.s3.amazonaws.com/X5Xy6NKtMSYwuCNn9Icpg1cqBcl1v8.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.festival-cannes.com/en/festival/films/la-gomera - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.indiewire.com/2019/05/the-whistlers-review-cannes-2019-1202142660/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://variety.com/2019/film/reviews/the-whistlers-review-1203219289/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.rogerebert.com/cannes/cannes-2019-a-hidden-life-the-whistlers - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.youtube.com/embed/mUvaj4JB92c?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://cineuropa.org/en/video/372947/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://lwlies.com/festivals/the-whistlers-cannes-film-festival-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/228-peggy-chiao, STACKING TIME = 3, PARSING TIME = 7
I 2022/06/09 11:04:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=barnes-rick
I 2022/06/09 11:04:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[1gKOJm_26NP5 (1735154885776637952)]} 0 2
I 2022/06/09 11:04:09 Fulltext indexing: 1gKOJm_26NP5 https://www.criterion.com/shop/browse/list?director=barnes-rick
I 2022/06/09 11:04:09 SWITCHBOARD *Indexed 1193 words in URL https://www.criterion.com/shop/browse/list?director=barnes-rick [1gKOJm_26NP5]
Description: Rick Barnes films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12514 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:09 SWITCHBOARD Excluded 26 words in URL https://www.criterion.com/current/posts/6388-corneliu-porumboiu-s-the-whistlers
I 2022/06/09 11:04:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[1QdEyG_26NP5 (1735154885815435264)]} 0 1
I 2022/06/09 11:04:09 Fulltext indexing: 1QdEyG_26NP5 https://www.criterion.com/current/posts/6388-corneliu-porumboiu-s-the-whistlers
I 2022/06/09 11:04:09 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 506, robots.delay = 0, ((waitig = 253) - (timeSinceLastAccess = 12)) = 241
I 2022/06/09 11:04:09 SWITCHBOARD *Indexed 480 words in URL https://www.criterion.com/current/posts/6388-corneliu-porumboiu-s-the-whistlers [1QdEyG_26NP5]
Description: Corneliu Porumboius The Whistlers | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5731 bytes |
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:09 SWITCHBOARD Excluded 14 words in URL https://www.criterion.com/current/author/228-peggy-chiao
I 2022/06/09 11:04:09 Fulltext indexing: 1Dcu1G_26NP5 https://www.criterion.com/current/author/228-peggy-chiao
I 2022/06/09 11:04:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[1Dcu1G_26NP5 (1735154885825921024)]} 0 5
I 2022/06/09 11:04:09 SWITCHBOARD *Indexed 117 words in URL https://www.criterion.com/current/author/228-peggy-chiao [1Dcu1G_26NP5]
Description: Peggy Chiao | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1405 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:09 HTCACHE storing content of url https://www.criterion.com/current/posts/793-damage, 81789 bytes
I 2022/06/09 11:04:09 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/793-damage, STACKING TIME = 4, PARSING TIME = 15
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:09 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/793-damage
I 2022/06/09 11:04:09 Fulltext indexing: 1CScDG_26NP5 https://www.criterion.com/current/posts/793-damage
I 2022/06/09 11:04:09 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[1CScDG_26NP5 (1735154886055559168)]} 0 3
I 2022/06/09 11:04:09 SWITCHBOARD *Indexed 511 words in URL https://www.criterion.com/current/posts/793-damage [1CScDG_26NP5]
Description: Damage | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6505 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:09 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 449, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:09 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
I 2022/06/09 11:04:09 org.apache.solr.update.SolrIndexWriter Calling setCommitData with IW:org.apache.solr.update.SolrIndexWriter@fed4ccdd commitCommandVersion:0
I 2022/06/09 11:04:10 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:10 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 449, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:10 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 449, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:10 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=mori-kazuo, 225861 bytes
I 2022/06/09 11:04:10 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=mori-kazuo, STACKING TIME = 1, PARSING TIME = 21
I 2022/06/09 11:04:10 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://s3.amazonaws.com/criterion-production/films/7af61b7756472ca8199f565fdeb12028/FIguIMVIcFtDOh64HtZSpHOpMikBev_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://s3.amazonaws.com/criterion-production/films/48e20200c8ad2b49e92c13e4b51b878e/7Hi7SnpUWOxqdkJWIDzdJMJ18iMT3l_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://s3.amazonaws.com/criterion-production/films/97236ed3236ed2fe13222e582614f584/cd7YjlRypX4kuUTaDvc0KuJix0vKIq_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=mori-kazuo
I 2022/06/09 11:04:10 Fulltext indexing: 0bE0Am_26NP5 https://www.criterion.com/shop/browse/list?director=mori-kazuo
I 2022/06/09 11:04:10 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 475, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:10 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[0bE0Am_26NP5 (1735154886868205568)]} 0 4
I 2022/06/09 11:04:10 SWITCHBOARD *Indexed 1204 words in URL https://www.criterion.com/shop/browse/list?director=mori-kazuo [0bE0Am_26NP5]
Description: Kazuo Mori films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12599 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:10 LOADER CRAWLER Redirection detected ('HTTP/1.1 302 Found') for URL https://www.criterion.com/films/28401-le-coup-du-berger
I 2022/06/09 11:04:10 LOADER CRAWLER ..Redirecting request to: https://www.criterion.com/shop/browse
I 2022/06/09 11:04:10 REJECTED https://www.criterion.com/films/28401-le-coup-du-berger - cannot load: load error - CRAWLER Redirect of URL=https://www.criterion.com/films/28401-le-coup-du-berger aborted. Reason : double in: local index, recrawl rejected. Document date = 2022-06-09T10:35:14Z is not older than crawl profile recrawl minimum date = 2022-06-06T10:33:47Z
I 2022/06/09 11:04:10 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[zqS77e_26NP5 (1735154886884982784)]} 0 0
I 2022/06/09 11:04:10 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=howard-leslie, 224735 bytes
I 2022/06/09 11:04:10 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=howard-leslie, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:04:10 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://s3.amazonaws.com/criterion-production/films/6b60c41a20b0ad93142016f0fbaef104/iPWNkGkmts56OruZK3c8nVR4whMZnz_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1896-83fe48a59fe1f7eb29f0307e3f2f63f4/zQ235ICl6OxHik6DAkN1liJaj0i6Mt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 HTCACHE storing content of url https://www.criterion.com/current/posts/895-that-obscure-object-of-desire, 86431 bytes
I 2022/06/09 11:04:10 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/895-that-obscure-object-of-desire, STACKING TIME = 6, PARSING TIME = 6
I 2022/06/09 11:04:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 469, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:10 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:10 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=howard-leslie
I 2022/06/09 11:04:10 Fulltext indexing: 0a-Qcm_26NP5 https://www.criterion.com/shop/browse/list?director=howard-leslie
I 2022/06/09 11:04:10 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[0a-Qcm_26NP5 (1735154887198507008)]} 0 2
I 2022/06/09 11:04:10 SWITCHBOARD *Indexed 1200 words in URL https://www.criterion.com/shop/browse/list?director=howard-leslie [0a-Qcm_26NP5]
Description: Leslie Howard films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12506 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:10 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/895-that-obscure-object-of-desire
I 2022/06/09 11:04:10 Fulltext indexing: zp9gyG_26NP5 https://www.criterion.com/current/posts/895-that-obscure-object-of-desire
I 2022/06/09 11:04:10 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[zp9gyG_26NP5 (1735154887239401472)]} 0 2
I 2022/06/09 11:04:10 SWITCHBOARD *Indexed 536 words in URL https://www.criterion.com/current/posts/895-that-obscure-object-of-desire [zp9gyG_26NP5]
Description: That Obscure Object of Desire | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6478 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:10 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:04:10 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:10 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@71d3ad97[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w8(7.7.3):C18:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772634528}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wa(7.7.3):C23:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772640046}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wb(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772645671}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wc(7.7.3):C17:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772650026}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wd(7.7.3):C4:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772650982}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:04:10 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:04:11 HTCACHE storing content of url https://www.criterion.com/current/posts/3628-mike-leigh-on-here-is-your-life, 67398 bytes
I 2022/06/09 11:04:11 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/3628-mike-leigh-on-here-is-your-life, STACKING TIME = 1, PARSING TIME = 6
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 443, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:11 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/films/ccf8a3f2353002103ef420fd02fe2585/cE4nJ2rcnsqFoXZOGdTHQz1j9zLv3e_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.youtube.com/embed/5_h0-qlaXJQ?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 SWITCHBOARD Excluded 17 words in URL https://www.criterion.com/current/posts/3628-mike-leigh-on-here-is-your-life
I 2022/06/09 11:04:11 Fulltext indexing: zlMRsG_26NP5 https://www.criterion.com/current/posts/3628-mike-leigh-on-here-is-your-life
I 2022/06/09 11:04:11 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[zlMRsG_26NP5 (1735154887431290880)]} 0 4
I 2022/06/09 11:04:11 SWITCHBOARD *Indexed 229 words in URL https://www.criterion.com/current/posts/3628-mike-leigh-on-here-is-your-life [zlMRsG_26NP5]
Description: Mike Leigh on Here Is Your Life | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2809 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:11 HTCACHE storing content of url https://www.criterion.com/current/posts/4362-mike-mills-on-ermanno-olmi, 72500 bytes
I 2022/06/09 11:04:11 SWITCHBOARD CRAWL: ADDED 59 LINKS FROM https://www.criterion.com/current/posts/4362-mike-mills-on-ermanno-olmi, STACKING TIME = 3, PARSING TIME = 6
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7112-/hrna9FD9i5qHmiKkha2h6GpHr8PIC7_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6256-/4hMn9W7CAftp0gtTU1Bbha4vrOtvXT_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.youtube.com/embed/Z1O78lJcG38 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6686-/bX7f4Hl2eqYz3ky5RjEhlInoO8aIGs_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/images/7893-da77a549233b01dc715154249fe325ae/mikemills_medium.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6816-/fNhsKcWROyxHmEbTJYzKacXRM237Ks_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/films/b6bc2282c440de5a08e438da562afd24/pMjW1z3Aj37rZu94vHlLeKwhRPdoUS_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/films/8cc71f23f230fe161df0a179a2f01231/7ssZeQ2jRKasoS7tOXcxzgfpIfQUW2_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 SWITCHBOARD Excluded 16 words in URL https://www.criterion.com/current/posts/4362-mike-mills-on-ermanno-olmi
I 2022/06/09 11:04:11 Fulltext indexing: zkG6ZG_26NP5 https://www.criterion.com/current/posts/4362-mike-mills-on-ermanno-olmi
I 2022/06/09 11:04:11 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[zkG6ZG_26NP5 (1735154887637860352)]} 0 2
I 2022/06/09 11:04:11 SWITCHBOARD *Indexed 315 words in URL https://www.criterion.com/current/posts/4362-mike-mills-on-ermanno-olmi [zkG6ZG_26NP5]
Description: How Mike Mills Found His Personal Xanax in Ermanno Olmi | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3546 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:11 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 419, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:11 HTCACHE storing content of url https://www.criterion.com/boxsets/1315-eclipse-series-46-ingrid-bergman-s-swedish-years, 76900 bytes
I 2022/06/09 11:04:11 SWITCHBOARD CRAWL: ADDED 58 LINKS FROM https://www.criterion.com/boxsets/1315-eclipse-series-46-ingrid-bergman-s-swedish-years, STACKING TIME = 1, PARSING TIME = 9
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/films/9996db3918ccc4883b794ee704519da8/4CH5LetGEqAUwmswsiHcjrrI5g391u_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/bba8824f8e897196a1c8a369b9f32895.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/5d68f28a94fd0c7a636d339c700f18da.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/b1c5726f28557349778715f2669d281f.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/films/9ba6690cdb52249f75c88a440af3a17d/QNziiO2rH60L8rkE6UUQLyqXtUP9xe_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/films/f7177049e94bac92fa832936393a285a/8ViwZZJ13W05NXuGqStR9RsFKC7pC8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/films/165865803353029e91cd7a56e54343ee/xeeq04EX9dZPDxBYgVMACgSuKo1QR0_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/films/2454da4125c59af610290354e40e0d52/poCiOLIClDu8KUVya0UipOnRRK2l2K_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1915-d024b93e4429f5b05d7f0bdc8d59c415/shXfQUMZTWnY6hrjrAHIhcBlosHu4c_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/9048dc851ea9db8750e0b7443d38e1f0.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/725a0b071d12dda96b2a9166aa58f612.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/002ee3fff2fa24b51027f4e5cdc34611.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 REJECTED https://s3.amazonaws.com/criterion-production/films/1966ababf422a56a0d641b70eaa1a254/8fU0aq1fZWMXdDpLXZHLtft1ulTmKU_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:11 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 402, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:11 SWITCHBOARD Excluded 26 words in URL https://www.criterion.com/boxsets/1315-eclipse-series-46-ingrid-bergman-s-swedish-years
I 2022/06/09 11:04:11 Fulltext indexing: zTYTp3_26NP5 https://www.criterion.com/boxsets/1315-eclipse-series-46-ingrid-bergman-s-swedish-years
I 2022/06/09 11:04:11 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[zTYTp3_26NP5 (1735154887968161792)]} 0 2
I 2022/06/09 11:04:11 SWITCHBOARD *Indexed 460 words in URL https://www.criterion.com/boxsets/1315-eclipse-series-46-ingrid-bergman-s-swedish-years [zTYTp3_26NP5]
Description: Eclipse Series 46: Ingrid Bergmans Swedish Years | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 8886 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:11 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 402, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:11 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=czinner-paul, 224776 bytes
I 2022/06/09 11:04:12 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse?director=czinner-paul, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:04:12 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://s3.amazonaws.com/criterion-production/films/db4b0658b462dab4c656c1733822252c/sbintTYaK6jy93GAcfuS5PqBGywpvM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1792-9c7fc9fe72c1452bdcb47e93fc30d9fc/upVBpiNpMTno5TapW08e3GoWZshBAi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 402, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 8)) = 242
I 2022/06/09 11:04:12 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=czinner-paul
I 2022/06/09 11:04:12 Fulltext indexing: zTMX1m_26NP5 https://www.criterion.com/shop/browse?director=czinner-paul
I 2022/06/09 11:04:12 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[zTMX1m_26NP5 (1735154888526004224)]} 0 2
I 2022/06/09 11:04:12 SWITCHBOARD *Indexed 1196 words in URL https://www.criterion.com/shop/browse?director=czinner-paul [zTMX1m_26NP5]
Description: Paul Czinner films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12496 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:12 HTCACHE storing content of url https://www.criterion.com/boxsets/668-eclipse-series-18-du-an-makavejev-free-radical, 70347 bytes
I 2022/06/09 11:04:12 SWITCHBOARD CRAWL: ADDED 49 LINKS FROM https://www.criterion.com/boxsets/668-eclipse-series-18-du-an-makavejev-free-radical, STACKING TIME = 2, PARSING TIME = 10
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/9b4eac914529c71c5806468863dedb87.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://s3.amazonaws.com/criterion-production/films/a8daec98296f9e21f813139361c08a2a/1k1sTMq4rVxWuM1VwIWzYpkqgHTjSi_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://s3.amazonaws.com/criterion-production/films/1f7e43bbb0d702398581a23013b9acb8/kKZ2RBIZ29jnau5RtJVxbzzqZKXzTr_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1794-050c8e7aa2d3ba1ddbd296d108239cf1/shvY9H0K9PWixHcIAuuAUuKL7GcP2V_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/4c122232afdac6885dc5fbd348b40d5c.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/d14b5c6cc7b01a919bcd1efcae8ca0c7.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://s3.amazonaws.com/criterion-production/films/184cb93c556a4b405f1db06bdaeab4ea/JKh7nu0W3zyvUJ4slsaap6gUKloUWR_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 HostQueue forcing crawl-delay of 236 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 390, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 14)) = 236
I 2022/06/09 11:04:12 SWITCHBOARD Excluded 21 words in URL https://www.criterion.com/boxsets/668-eclipse-series-18-du-an-makavejev-free-radical
I 2022/06/09 11:04:12 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[zARbj3_26NP5 (1735154888776613888)]} 0 2
I 2022/06/09 11:04:12 Fulltext indexing: zARbj3_26NP5 https://www.criterion.com/boxsets/668-eclipse-series-18-du-an-makavejev-free-radical
I 2022/06/09 11:04:12 SWITCHBOARD *Indexed 360 words in URL https://www.criterion.com/boxsets/668-eclipse-series-18-du-an-makavejev-free-radical [zARbj3_26NP5]
Description: Eclipse Series 18: Dušan Makavejev—Free Radical | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6199 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:12 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=kopple-barbara, 224169 bytes
I 2022/06/09 11:04:12 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=kopple-barbara, STACKING TIME = 1, PARSING TIME = 18
I 2022/06/09 11:04:12 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://s3.amazonaws.com/criterion-production/films/f5361612515871553628e26305ac00a5/yc3y38kQLI8Mz6XaqTG0KhvdViYpyZ_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=kopple-barbara
I 2022/06/09 11:04:12 Fulltext indexing: zRLRvm_26NP5 https://www.criterion.com/shop/browse/list?director=kopple-barbara
I 2022/06/09 11:04:12 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[zRLRvm_26NP5 (1735154888919220224)]} 0 2
I 2022/06/09 11:04:12 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=kopple-barbara [zRLRvm_26NP5]
Description: Barbara Kopple films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12461 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:12 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 404, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:12 HTCACHE storing content of url https://www.criterion.com/current/posts/2043-three-reasons-the-phantom-carriage, 65844 bytes
I 2022/06/09 11:04:12 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/2043-three-reasons-the-phantom-carriage, STACKING TIME = 1, PARSING TIME = 51
I 2022/06/09 11:04:12 REJECTED https://www.youtube.com/embed/HoqCsMUoN1c?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://s3.amazonaws.com/criterion-production/films/7265b13395ec259ff98672237c54b4c6/hl8OoSNAgm9ND4Fh7ksjUzNXplspyF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:12 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 391, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:12 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ynwZ4G_26NP5 (1735154889272590336)]} 0 1
I 2022/06/09 11:04:12 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/current/posts/2043-three-reasons-the-phantom-carriage
I 2022/06/09 11:04:12 Fulltext indexing: ynwZ4G_26NP5 https://www.criterion.com/current/posts/2043-three-reasons-the-phantom-carriage
I 2022/06/09 11:04:12 SWITCHBOARD *Indexed 215 words in URL https://www.criterion.com/current/posts/2043-three-reasons-the-phantom-carriage [ynwZ4G_26NP5]
Description: Three Reasons: The Phantom Carriage | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2522 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:13 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=ikehiro-kazuo, 225581 bytes
I 2022/06/09 11:04:13 SWITCHBOARD CRAWL: ADDED 51 LINKS FROM https://www.criterion.com/shop/browse?director=ikehiro-kazuo, STACKING TIME = 2, PARSING TIME = 19
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1829-e54c495813226f69d09f05ddc914b0e2/0VeHCin2ALFJtf0af05BebTL2pamd4_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 410, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:13 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://s3.amazonaws.com/criterion-production/films/736b3991921d9b1506d38e34ad0f28dc/wJwW7Y4WCtnBXGG8eFNU1EmD8DctrV_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://s3.amazonaws.com/criterion-production/films/854f9925e05d05e1498b3712df1ae24f/scED6dxhACXAKRX6ORQm6RjxZyKHvQ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://s3.amazonaws.com/criterion-production/films/dd560c50567da0e7848c98570c7eaed8/Vrl4qmICJyQhToXmjIGg0AL98jK6QM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse?director=ikehiro-kazuo
I 2022/06/09 11:04:13 Fulltext indexing: yqxqbm_26NP5 https://www.criterion.com/shop/browse?director=ikehiro-kazuo
I 2022/06/09 11:04:13 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[yqxqbm_26NP5 (1735154889617571840)]} 0 2
I 2022/06/09 11:04:13 SWITCHBOARD *Indexed 1203 words in URL https://www.criterion.com/shop/browse?director=ikehiro-kazuo [yqxqbm_26NP5]
Description: Kazuo Ikehiro films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12550 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:13 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 410, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:13 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=jires-jaromil, 225916 bytes
I 2022/06/09 11:04:13 HostQueue forcing crawl-delay of 166 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 428, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 84)) = 166
I 2022/06/09 11:04:13 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://s3.amazonaws.com/criterion-production/films/b87cdc112105192e4d3abfb82eef63c0/Cyz9cny8EdPcAycuJL9YFWkofSffVM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://s3.amazonaws.com/criterion-production/films/2c244881659f91c4d02f4aac96e3a255/dZJriyxOrnEIuAJkUrSaL6xMyN6xgc_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1808-2d46a12c7b3bb2aca94ace66d3c9c0e9/nyO6RFFEuME4UWTQK8qfR39YzD6NJK_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://s3.amazonaws.com/criterion-production/films/759805461c3e5365a323e779cf4e2b9c/ZZwAHHUZZsmWREsrXPnPLtlJ1i58Wb_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=jires-jaromil, STACKING TIME = 4, PARSING TIME = 105
I 2022/06/09 11:04:13 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=jires-jaromil
I 2022/06/09 11:04:13 Fulltext indexing: yntzXm_26NP5 https://www.criterion.com/shop/browse/list?director=jires-jaromil
I 2022/06/09 11:04:13 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[yntzXm_26NP5 (1735154890215260160)]} 0 2
I 2022/06/09 11:04:13 SWITCHBOARD *Indexed 1211 words in URL https://www.criterion.com/shop/browse/list?director=jires-jaromil [yntzXm_26NP5]
Description: Jaromil Jireš films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12642 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:13 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 428, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:13 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=costa-pedro, 225867 bytes
I 2022/06/09 11:04:13 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/shop/browse/list?director=costa-pedro, STACKING TIME = 1, PARSING TIME = 26
I 2022/06/09 11:04:13 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://s3.amazonaws.com/criterion-production/films/c8fa6dc833c2a454531e48b4c36030ea/ZfYymz0KrSklWG1WRfDL4ZhT653erp_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://s3.amazonaws.com/criterion-production/films/6d11fa2755ba7b9ae8fef609d52c7d2a/EUCxeXlhyFlnIoUOXvT7PGXA0YMhSf_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1851-dba6177f0bd538f868c12a4c87760e83/tLbF78x2hqXHJGqSbQ5j7qW9izcNvg_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:13 REJECTED https://s3.amazonaws.com/criterion-production/films/1f26021625d2e2e4b8ca84ebe942f854/ewcI4BpdaNjfLaahutp5MgA2scdl0v_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=liu-bing, 224190 bytes
I 2022/06/09 11:04:14 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=costa-pedro
I 2022/06/09 11:04:14 HTCACHE storing content of url https://www.criterion.com/current/author/296-keiko-mcdonald-thomas-rimer, 49597 bytes
I 2022/06/09 11:04:14 Fulltext indexing: ylJiNm_26NP5 https://www.criterion.com/shop/browse/list?director=costa-pedro
I 2022/06/09 11:04:14 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 451, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 4)) = 246
I 2022/06/09 11:04:14 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[ylJiNm_26NP5 (1735154890609524736)]} 0 8
I 2022/06/09 11:04:14 SWITCHBOARD *Indexed 1210 words in URL https://www.criterion.com/shop/browse/list?director=costa-pedro [ylJiNm_26NP5]
Description: Pedro Costa films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12626 bytes |
LinkStorageTime: 10 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:14 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=liu-bing, STACKING TIME = 1, PARSING TIME = 117
I 2022/06/09 11:04:14 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://s3.amazonaws.com/criterion-production/films/ae35595d33e11e2c7c1009d865635cfa/6M5V8AX281t3tnvWTMI3J6hG0Fa33Z_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/author/296-keiko-mcdonald-thomas-rimer, STACKING TIME = 0, PARSING TIME = 17
I 2022/06/09 11:04:14 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=liu-bing
I 2022/06/09 11:04:14 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[yaZY2m_26NP5 (1735154890721722368)]} 0 2
I 2022/06/09 11:04:14 Fulltext indexing: yaZY2m_26NP5 https://www.criterion.com/shop/browse/list?director=liu-bing
I 2022/06/09 11:04:14 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=liu-bing [yaZY2m_26NP5]
Description: Bing Liu films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12492 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:14 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[x74QCG_26NP5 (1735154890728013824)]} 0 0
I 2022/06/09 11:04:14 SWITCHBOARD Excluded 12 words in URL https://www.criterion.com/current/author/296-keiko-mcdonald-thomas-rimer
I 2022/06/09 11:04:14 Fulltext indexing: x74QCG_26NP5 https://www.criterion.com/current/author/296-keiko-mcdonald-thomas-rimer
I 2022/06/09 11:04:14 SWITCHBOARD *Indexed 140 words in URL https://www.criterion.com/current/author/296-keiko-mcdonald-thomas-rimer [x74QCG_26NP5]
Description: Keiko McDonald & Thomas Rimer | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 1685 bytes |
LinkStorageTime: 1 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:14 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=mason-william, 224177 bytes
I 2022/06/09 11:04:14 HTCACHE storing content of url https://www.criterion.com/current/posts/6376-pedro-almod-var-s-pain-and-glory, 73705 bytes
I 2022/06/09 11:04:14 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=mason-william, STACKING TIME = 1, PARSING TIME = 19
I 2022/06/09 11:04:14 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://s3.amazonaws.com/criterion-production/films/1575a0edde21f2a1e688c428fc546cee/XX0NgrcY1AyrE9qwxEMq7c3q4ca2We_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 SWITCHBOARD CRAWL: ADDED 70 LINKS FROM https://www.criterion.com/current/posts/6376-pedro-almod-var-s-pain-and-glory, STACKING TIME = 1, PARSING TIME = 20
I 2022/06/09 11:04:14 REJECTED http://www.todaslascriticas.com.ar/cannes/2019 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://lwlies.com/festivals/pain-and-glory-cannes-film-festival-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.youtube.com/embed/rQaycqyjLFw?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-chang-pedro-almodovar-ken-loach-jessica-hausner-20190518-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.ioncinema.com/tag/2019-cannes-critics-panel - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.rogerebert.com/cannes/cannes-2019-pain-and-glory-little-joe - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.telegraph.co.uk/films/2019/05/17/pain-glory-antonio-banderas-deeply-touching-uneven-effort-almodovar/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://film.avclub.com/is-it-finally-pedro-almodovar-s-year-at-cannes-1834875629 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://icsfilm.org/features/cannes-2019-ics-critics-industry-panel/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://criterion-production.s3.amazonaws.com/Vpw9itRZVNj2VOKZK5do0WwdNR8OXM.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://variety.com/2019/film/reviews/cannes-film-review-pedro-almodovars-pain-and-glory-1203218880/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.filmcomment.com/blog/review-pain-glory/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.hollywoodreporter.com/review/pain-glory-review-1195284 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://jury.critic.de/cannes/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/pain-glory-pedro-almodovar-antonio-banderas-self-portrait-artist-addict - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.festival-cannes.com/en/festival/films/dolor-y-gloria - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://thefilmstage.com/reviews/cannes-review-pain-and-glory-is-one-of-pedro-almodovars-most-personal-exceptional-works/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.screendaily.com/news/the-wild-goose-lake-lands-second-on-screens-cannes-jury-grid-almodovar-holds-lead/5139653.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 452, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:14 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=mason-william
I 2022/06/09 11:04:14 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[yUjUkm_26NP5 (1735154890990157824)]} 0 2
I 2022/06/09 11:04:14 Fulltext indexing: yUjUkm_26NP5 https://www.criterion.com/shop/browse/list?director=mason-william
I 2022/06/09 11:04:14 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=mason-william [yUjUkm_26NP5]
Description: William Mason films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12465 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:14 SWITCHBOARD Excluded 28 words in URL https://www.criterion.com/current/posts/6376-pedro-almod-var-s-pain-and-glory
I 2022/06/09 11:04:14 Fulltext indexing: xohzHG_26NP5 https://www.criterion.com/current/posts/6376-pedro-almod-var-s-pain-and-glory
I 2022/06/09 11:04:14 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[xohzHG_26NP5 (1735154891025809408)]} 0 2
I 2022/06/09 11:04:14 SWITCHBOARD *Indexed 577 words in URL https://www.criterion.com/current/posts/6376-pedro-almod-var-s-pain-and-glory [xohzHG_26NP5]
Description: Pedro Almodóvars Pain and Glory | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 7240 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:14 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 452, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:14 HTCACHE storing content of url https://www.criterion.com/boxsets/1097-gates-of-heaven-vernon-florida, 68858 bytes
I 2022/06/09 11:04:14 SWITCHBOARD CRAWL: ADDED 46 LINKS FROM https://www.criterion.com/boxsets/1097-gates-of-heaven-vernon-florida, STACKING TIME = 1, PARSING TIME = 5
I 2022/06/09 11:04:14 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/66562370c5aeed01721671f440b60aee.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1776-78eb25c559a866198d58680c9874465e/N2awPkaE0xOiPEj26tjI9yv2eLLXZM_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://s3.amazonaws.com/criterion-production/films/939ec46490d500823654f65419a8db04/f57BMlw9kdr7YF0EiuQtIM1i67rFHU_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/e754761b5a3b86359b1de1d3c5a64b18.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 REJECTED https://s3.amazonaws.com/criterion-production/films/3217a6e471ba2f9239cbe6cb398aa02f/dgNduEohOof1NohG0fGVRpPOyb322m_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:14 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 442, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:14 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/boxsets/1097-gates-of-heaven-vernon-florida
I 2022/06/09 11:04:14 Fulltext indexing: xavLk3_26NP5 https://www.criterion.com/boxsets/1097-gates-of-heaven-vernon-florida
I 2022/06/09 11:04:14 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[xavLk3_26NP5 (1735154891412733952)]} 0 1
I 2022/06/09 11:04:14 SWITCHBOARD *Indexed 303 words in URL https://www.criterion.com/boxsets/1097-gates-of-heaven-vernon-florida [xavLk3_26NP5]
Description: Gates of Heaven/Vernon, Florida | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 4638 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:15 HTCACHE storing content of url https://www.criterion.com/current/posts/900-the-king-of-kings, 83411 bytes
I 2022/06/09 11:04:15 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/900-the-king-of-kings, STACKING TIME = 0, PARSING TIME = 6
I 2022/06/09 11:04:15 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/900-the-king-of-kings
I 2022/06/09 11:04:15 Fulltext indexing: wrlnGG_26NP5 https://www.criterion.com/current/posts/900-the-king-of-kings
I 2022/06/09 11:04:15 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[wrlnGG_26NP5 (1735154891658100736)]} 0 4
I 2022/06/09 11:04:15 SWITCHBOARD *Indexed 562 words in URL https://www.criterion.com/current/posts/900-the-king-of-kings [wrlnGG_26NP5]
Description: The King of Kings | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6751 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:15 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 432, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:15 HTCACHE storing content of url https://www.criterion.com/current/posts/6356-previewing-cannes-2019, 80557 bytes
I 2022/06/09 11:04:15 REJECTED https://www.indiewire.com/2019/05/cannes-thierry-fremaux-gender-parity-1202140371/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.theguardian.com/film/2019/may/13/cannes-2019-party-kicks-off-as-clouds-of-controversy-gather-netflix-quentin-tarantino - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.filmcomment.com/blog/category/podcast/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://thefilmstage.com/features/our-20-most-anticipated-films-of-the-2019-cannes-film-festival/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://criterion-production.s3.amazonaws.com/Tont2XttJSXI9U9AGHKNPiGWERGGed.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.filmcomment.com/blog/cannes-2019-preview-part-ii/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://variety.com/2019/film/festivals/cannes-13-buzziest-movies-for-sale-from-chris-hemsworth-to-michelle-pfeiffer-1203213852/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-1-festival-preview - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.festival-cannes.com/en/festival/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 SWITCHBOARD CRAWL: ADDED 72 LINKS FROM https://www.criterion.com/current/posts/6356-previewing-cannes-2019, STACKING TIME = 6, PARSING TIME = 11
I 2022/06/09 11:04:15 REJECTED https://variety.com/2019/film/news/cannes-film-festival-trump-director-alain-delon-controversy-trump-1203212774/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://deadline.com/2019/05/cannes-film-festival-market-2019-preview-hot-titles-indpendent-film-sales-1202613425/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.vulture.com/2019/05/cannes-festival-lineup-2019-the-most-anticipated-films.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.indiewire.com/2019/05/cannes-2019-film-festival-quentin-tarantino-terrence-malick-1202131965/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.nytimes.com/2019/05/13/arts/isabelle-huppert-cannes.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.filmcomment.com/blog/cannes-2019-preview-i/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.bfi.org.uk/news-opinion/news-bfi/features/preview-cannes-film-festival-2019 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED http://talkeasypod.com/artist/werner-herzog/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://site.5050by2020.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.festival-cannes.com/en/infos-communiques/communique/articles/the-72nd-festival-de-cannes-in-numbers - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 424, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:15 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=brooks-richard, 224206 bytes
I 2022/06/09 11:04:15 HTCACHE storing content of url https://www.criterion.com/current/posts/334-french-cancan, 71761 bytes
I 2022/06/09 11:04:15 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=brooks-richard, STACKING TIME = 1, PARSING TIME = 100
I 2022/06/09 11:04:15 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://s3.amazonaws.com/criterion-production/films/84453cd1ed9f26281b85c9107c581dab/MrHiNDPFLEUfmHzonSauKOw4DVjDEe_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/334-french-cancan, STACKING TIME = 1, PARSING TIME = 19
I 2022/06/09 11:04:15 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 SWITCHBOARD Excluded 30 words in URL https://www.criterion.com/current/posts/6356-previewing-cannes-2019
I 2022/06/09 11:04:15 Fulltext indexing: wptsiG_26NP5 https://www.criterion.com/current/posts/6356-previewing-cannes-2019
I 2022/06/09 11:04:15 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[wptsiG_26NP5 (1735154892160368640)]} 0 8
I 2022/06/09 11:04:15 SWITCHBOARD *Indexed 1029 words in URL https://www.criterion.com/current/posts/6356-previewing-cannes-2019 [wptsiG_26NP5]
Description: Previewing Cannes 2019 | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 13885 bytes |
LinkStorageTime: 16 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:15 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 435, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:15 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=brooks-richard
I 2022/06/09 11:04:15 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[xiwe-m_26NP5 (1735154892257886208)]} 0 2
I 2022/06/09 11:04:15 Fulltext indexing: xiwe-m_26NP5 https://www.criterion.com/shop/browse/list?director=brooks-richard
I 2022/06/09 11:04:15 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=brooks-richard [xiwe-m_26NP5]
Description: Richard Brooks films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12489 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:15 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/334-french-cancan
I 2022/06/09 11:04:15 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[v2rr9G_26NP5 (1735154892296683520)]} 0 1
I 2022/06/09 11:04:15 Fulltext indexing: v2rr9G_26NP5 https://www.criterion.com/current/posts/334-french-cancan
I 2022/06/09 11:04:15 SWITCHBOARD *Indexed 468 words in URL https://www.criterion.com/current/posts/334-french-cancan [v2rr9G_26NP5]
Description: French Cancan | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6423 bytes |
LinkStorageTime: 2 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:15 HTCACHE storing content of url https://www.criterion.com/current/posts/1987-three-reasons-cul-de-sac, 65669 bytes
I 2022/06/09 11:04:15 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 429, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 4)) = 246
I 2022/06/09 11:04:15 SWITCHBOARD CRAWL: ADDED 52 LINKS FROM https://www.criterion.com/current/posts/1987-three-reasons-cul-de-sac, STACKING TIME = 2, PARSING TIME = 9
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6760-/suggMssQBAMrucFCtg197EY527f98l_small.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6733-/TThJEQBoyOP14YpugyL2VyUwgncTXF_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.youtube.com/embed/zxyIxHzv3Lg?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://s3.amazonaws.com/criterion-production/films/9466ce24b77e3fa3a9ade373aac1fa2a/yNmDmRc9rtQN79bkL8PcRrpb8OjiKO_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6709-/BOiQUp2KKKrLlTd6T9aiiXk2lvT9DM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6722-/QlYhXZ931rsr6dqpR4DvQERgARQaGt_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:15 SWITCHBOARD Excluded 15 words in URL https://www.criterion.com/current/posts/1987-three-reasons-cul-de-sac
I 2022/06/09 11:04:15 Fulltext indexing: vyt-KG_26NP5 https://www.criterion.com/current/posts/1987-three-reasons-cul-de-sac
I 2022/06/09 11:04:15 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[vyt-KG_26NP5 (1735154892500107264)]} 0 1
I 2022/06/09 11:04:15 SWITCHBOARD *Indexed 215 words in URL https://www.criterion.com/current/posts/1987-three-reasons-cul-de-sac [vyt-KG_26NP5]
Description: Three Reasons: Cul-de-sac | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2491 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:16 org.apache.solr.update.DirectUpdateHandler2 start commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=true,prepareCommit=false}
I 2022/06/09 11:04:16 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 429, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:16 org.apache.solr.update.DirectUpdateHandler2 end_commit_flush
I 2022/06/09 11:04:16 org.apache.solr.core.QuerySenderListener QuerySenderListener sending requests to Searcher@62efeabb[collection1] main{ExitableDirectoryReader(UninvertingDirectoryReader(Uninverting(_l1(7.7.3):C6307:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771379428}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_of(7.7.3):C1425:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654771757970}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_rr(7.7.3):C1380:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772128045}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_v4(7.7.3):C1419:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772508549}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_ve(7.7.3):c65:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772534500}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vo(7.7.3):c138:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772566128}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w9(7.7.3):c127:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772629503}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_vy(7.7.3):c168:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, mergeMaxNumSegments=-1, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=merge, mergeFactor=10, os.version=4.19.0-20-amd64, timestamp=1654772603576}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_w8(7.7.3):C18:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772634528}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wa(7.7.3):C23:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772640046}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wb(7.7.3):C21:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772645671}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wc(7.7.3):C17:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772650026}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_wd(7.7.3):C4:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772650982}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}]) Uninverting(_we(7.7.3):C20:[diagnostics={os=Linux, java.vendor=IBM Corporation, java.version=1.8.0_332, java.vm.version=openj9-0.32.0, lucene.version=7.7.3, os.arch=amd64, java.runtime.version=1.8.0_332-b09, source=flush, os.version=4.19.0-20-amd64, timestamp=1654772656193}]:[attributes={Lucene50StoredFieldsFormat.mode=BEST_SPEED}])))}
I 2022/06/09 11:04:16 org.apache.solr.core.QuerySenderListener QuerySenderListener done.
I 2022/06/09 11:04:16 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 429, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:16 HTCACHE storing content of url https://www.criterion.com/current/posts/5819-lineup-pileup, 78674 bytes
I 2022/06/09 11:04:16 SWITCHBOARD CRAWL: ADDED 70 LINKS FROM https://www.criterion.com/current/posts/5819-lineup-pileup, STACKING TIME = 3, PARSING TIME = 13
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.youtube.com/embed/SojHxpqswV8?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.tiff.net/the-review/tiff-18-gala-special-presentations/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://criterion-production.s3.amazonaws.com/OFR7iEgTEMso6Sj1XnXNu01L4fJRb5.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.youtube.com/embed/-kMp-U5LEuQ?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.theguardian.com/world/2018/jul/24/toronto-shooting-gunman-had-history-of-psychosis-and-depression-family-says - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.youtube.com/embed/zePKQQml0o8?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.filmlinc.org/nyff2018/daily/yorgos-lanthimos-the-favourite-will-open-nyff56/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=hark-tsui, 226539 bytes
I 2022/06/09 11:04:16 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED http://www.vulture.com/2018/04/barry-jenkins-one-book-one-new-york-beale-street.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 SWITCHBOARD CRAWL: ADDED 47 LINKS FROM https://www.criterion.com/shop/browse/list?director=hark-tsui, STACKING TIME = 5, PARSING TIME = 35
I 2022/06/09 11:04:16 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/films/b796fc21a57558358eb7f9e54fa5e6d0/2WXT8ULgPXXbikn39pHMz1m7dVbtt7_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1987-beb71f216e96d1ff2d0f8231f5b8b975/44LVkvftLRcr5paF4enJfBFTe5mI2c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/films/6fd3976cf806d6fad6dc70712c3e9ebc/pOFw8IdzoEUt6wKFT2vFl4kmGkrRBS_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/films/d79bc7d2b65cdc2885116fdc074e87d8/c6OOcQ5v21wzr7MPrwLBBlQdgo09EM_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/films/ca13e1d049d2ce3312fa7ff43ceccb6e/CrlQ9weSAFflHtvyZfloPMtBYtWksT_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/5819-lineup-pileup
I 2022/06/09 11:04:16 Fulltext indexing: vrFDFG_26NP5 https://www.criterion.com/current/posts/5819-lineup-pileup
I 2022/06/09 11:04:16 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[vrFDFG_26NP5 (1735154893189021696)]} 0 12
I 2022/06/09 11:04:16 SWITCHBOARD *Indexed 831 words in URL https://www.criterion.com/current/posts/5819-lineup-pileup [vrFDFG_26NP5]
Description: Lineup Pileup | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 9922 bytes |
LinkStorageTime: 13 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:16 HTCACHE storing content of url https://www.criterion.com/current/posts/6381-terrence-malick-s-a-hidden-life, 74198 bytes
I 2022/06/09 11:04:16 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 424, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 8)) = 242
I 2022/06/09 11:04:16 SWITCHBOARD CRAWL: ADDED 66 LINKS FROM https://www.criterion.com/current/posts/6381-terrence-malick-s-a-hidden-life, STACKING TIME = 8, PARSING TIME = 10
I 2022/06/09 11:04:16 REJECTED https://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/hidden-life-terrence-malick-franz-jagerst%C3%A4tter-anschluss-story - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://film.avclub.com/terrence-malick-returns-to-the-past-and-scripted-drama-1834888231 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://time.com/5591998/cannes-review-terrence-malick-hidden-life/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://twitter.com/criteriondaily - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://mubi.com/notebook/posts/cannes-correspondences-7-the-wonder-of-werner-herzog-and-terrence-malick - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.indiewire.com/2019/05/a-hidden-life-review-terrence-malick-cannes-1202142833/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.festival-cannes.com/en/festival/films/a-hidden-life - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.thedailybeast.com/a-hidden-life-terrence-malicks-anti-nazi-film-stuns-cannes - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.theguardian.com/film/2019/may/19/a-hidden-life-review-terrence-malicks-rhapsody-to-an-austrian-conscientious-objector - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://theplaylist.net/hidden-life-malick-cannes-review-20190519/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://criterion-production.s3.amazonaws.com/m3q2AzTEm9wjf4SRP3cfw8MonZzQDj.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.rogerebert.com/cannes/cannes-2019-a-hidden-life-the-whistlers - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.youtube.com/embed/HWC4dVB8n1s?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.latimes.com/entertainment/movies/la-et-mn-cannes-terrence-malick-hidden-life-20190519-story.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://thefilmstage.com/reviews/cannes-review-terrence-malicks-a-hidden-life-is-a-wrenching-ode-to-faith/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=hark-tsui
I 2022/06/09 11:04:16 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[vr0cdm_26NP5 (1735154893343162368)]} 0 2
I 2022/06/09 11:04:16 Fulltext indexing: vr0cdm_26NP5 https://www.criterion.com/shop/browse/list?director=hark-tsui
I 2022/06/09 11:04:16 SWITCHBOARD *Indexed 1210 words in URL https://www.criterion.com/shop/browse/list?director=hark-tsui [vr0cdm_26NP5]
Description: Tsui Hark films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12705 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:16 SWITCHBOARD Excluded 29 words in URL https://www.criterion.com/current/posts/6381-terrence-malick-s-a-hidden-life
I 2022/06/09 11:04:16 Fulltext indexing: vZYNRG_26NP5 https://www.criterion.com/current/posts/6381-terrence-malick-s-a-hidden-life
I 2022/06/09 11:04:16 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[vZYNRG_26NP5 (1735154893388251136)]} 0 2
I 2022/06/09 11:04:16 SWITCHBOARD *Indexed 609 words in URL https://www.criterion.com/current/posts/6381-terrence-malick-s-a-hidden-life [vZYNRG_26NP5]
Description: Terrence Malicks A Hidden Life | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 7834 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:16 HTCACHE storing content of url https://www.criterion.com/current/posts/1271-return-to-bergman-island, 71265 bytes
I 2022/06/09 11:04:16 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 419, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 9)) = 241
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5752-/gnXweRQQPalx5EBnZaFZj44S4VP2cH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/5750-/boGqPK1AL5HKAolSxsTj672BomPEta_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7138-/9IhyOWQ46UcTBJrFjjvGLEQCU5iiHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/films/4caa477448c9fe2ee28f80df08f4d89b/NmCRkRglJzsgL3AKwNshj7ENlgQZIN_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/1271-return-to-bergman-island, STACKING TIME = 9, PARSING TIME = 11
I 2022/06/09 11:04:16 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/6280-/LPqq4EJKbWnNGJnGo3OOtR5saxkAki_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 REJECTED http://www.wmagazine.com/artdesign/2009/11/ingmar_bergman - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:16 SWITCHBOARD Excluded 19 words in URL https://www.criterion.com/current/posts/1271-return-to-bergman-island
I 2022/06/09 11:04:16 Fulltext indexing: vZVhXG_26NP5 https://www.criterion.com/current/posts/1271-return-to-bergman-island
I 2022/06/09 11:04:16 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[vZVhXG_26NP5 (1735154893583286272)]} 0 1
I 2022/06/09 11:04:17 SWITCHBOARD *Indexed 292 words in URL https://www.criterion.com/current/posts/1271-return-to-bergman-island [vZVhXG_26NP5]
Description: Return to Bergman Island | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 3431 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:17 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 419, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:17 HTCACHE storing content of url https://www.criterion.com/current/posts/250-once-upon-a-time-french-poet-explains-his-filming-of-fairy-tale, 80465 bytes
I 2022/06/09 11:04:17 SWITCHBOARD CRAWL: ADDED 40 LINKS FROM https://www.criterion.com/current/posts/250-once-upon-a-time-french-poet-explains-his-filming-of-fairy-tale, STACKING TIME = 1, PARSING TIME = 8
I 2022/06/09 11:04:17 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 SWITCHBOARD Excluded 27 words in URL https://www.criterion.com/current/posts/250-once-upon-a-time-french-poet-explains-his-filming-of-fairy-tale
I 2022/06/09 11:04:17 Fulltext indexing: vORULG_26NP5 https://www.criterion.com/current/posts/250-once-upon-a-time-french-poet-explains-his-filming-of-fairy-tale
I 2022/06/09 11:04:17 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[vORULG_26NP5 (1735154894037319680)]} 0 3
I 2022/06/09 11:04:17 SWITCHBOARD *Indexed 475 words in URL https://www.criterion.com/current/posts/250-once-upon-a-time-french-poet-explains-his-filming-of-fairy-tale [vORULG_26NP5]
Description: Once Upon a Time—French Poet Explains His Filming of Fairy Tale | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 6077 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:17 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 412, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:17 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=reisz-karel, 224196 bytes
I 2022/06/09 11:04:17 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=reisz-karel, STACKING TIME = 1, PARSING TIME = 93
I 2022/06/09 11:04:17 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://s3.amazonaws.com/criterion-production/films/9762df3989455ca24cd875d40725f202/fRIqrmeNWzN0xxfdXvd8vFM0gaAUAn_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 417, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:17 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=reisz-karel
I 2022/06/09 11:04:17 Fulltext indexing: vTiKWm_26NP5 https://www.criterion.com/shop/browse/list?director=reisz-karel
I 2022/06/09 11:04:17 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[vTiKWm_26NP5 (1735154894343503872)]} 0 2
I 2022/06/09 11:04:17 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=reisz-karel [vTiKWm_26NP5]
Description: Karel Reisz films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12474 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:17 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=jones-terry, 224217 bytes
I 2022/06/09 11:04:17 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 418, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:17 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=jones-terry, STACKING TIME = 5, PARSING TIME = 30
I 2022/06/09 11:04:17 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:17 REJECTED https://s3.amazonaws.com/criterion-production/films/0945f776cbd0aedef40d8788f93e2a24/DmGZrExFbDqKlQvWY8x9J805tvfkCr_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=jones-terry
I 2022/06/09 11:04:18 Fulltext indexing: vEMZzm_26NP5 https://www.criterion.com/shop/browse/list?director=jones-terry
I 2022/06/09 11:04:18 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[vEMZzm_26NP5 (1735154894682193920)]} 0 2
I 2022/06/09 11:04:18 SWITCHBOARD *Indexed 1190 words in URL https://www.criterion.com/shop/browse/list?director=jones-terry [vEMZzm_26NP5]
Description: Terry Jones films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12489 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:18 HTCACHE storing content of url https://www.criterion.com/current/posts/5651-pawel-pawlikowski-s-cold-war, 73227 bytes
I 2022/06/09 11:04:18 REJECTED https://thefilmstage.com/reviews/cannes-review-pawel-pawlikowskis-cold-war-finds-love-in-a-hopeless-time-and-place/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://film.avclub.com/mads-mikkelsen-endures-a-cold-crucible-but-its-cold-wa-1825957205 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED http://variety.com/2018/film/reviews/cold-war-review-1202804041/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED http://www.indiewire.com/2018/05/cold-war-review-pawel-pawlikowski-cannes-2018-1201962858/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 SWITCHBOARD CRAWL: ADDED 79 LINKS FROM https://www.criterion.com/current/posts/5651-pawel-pawlikowski-s-cold-war, STACKING TIME = 5, PARSING TIME = 8
I 2022/06/09 11:04:18 REJECTED https://www.villagevoice.com/2018/05/14/music-madness-and-memory-at-cannes-part-two-cold-war-sorry-angel-and-the-mysteries-of-love/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED http://www.latimes.com/entertainment/movies/la-et-mn-cannes-diary-ash-is-purest-white-cold-war-20180512-htmlstory.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://twitter.com/criteriondaily?lang=en - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://mubi.com/notebook/posts/cannes-2018-correspondences-4-two-competition-musicals-two-fortnight-curios - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7818-/0E28kyBoWl4yDoYf87uDkvoSBG835t_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.screendaily.com/reviews/cold-war-cannes-review/5129035.article - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7823-/DtUB6i3CG3iQ4nzPrvW0t5R0vmwLSM_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7822-/h0wDEzutyEc1ePj37mWLtb6wYHjKKo_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED http://cineuropa.org/en/interview/354360/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.thedailybeast.com/the-riveting-cold-war-romance-taking-cannes-by-storm - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://theplaylist.net/pawel-pawlikowskis-cold-war-review-20180513/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.festival-cannes.com/en/festival/films/zimna-wojna-cold-war - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.youtube.com/embed/Pc76RxQw8ks?rel=0 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://icsfilm.org/reviews/cannes-2018-review-cold-war-pawel-pawlikowski/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://cine-vue.com/2018/05/cannes-2018-cold-war-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.thewrap.com/cold-war-film-review-romance-postwar-europe-ravishing-haunted/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED http://www.bfi.org.uk/news-opinion/sight-sound-magazine/reviews-recommendations/pawel-pawlikowski-cold-war-zemna-wojna-love-borders - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED http://lwlies.com/festivals/cold-war-cannes-review/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://time.com/5273921/cannes-review-cold-war-pawlikowski/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED http://www.vulture.com/2018/05/cold-war-review.html - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://criterion-production.s3.amazonaws.com/3edDprhk4G7DpgxeuPN2hCWP7f2spw.png - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.theguardian.com/film/2018/may/11/cold-war-review-wounded-love-and-state-sponsored-fear-in-1940s-poland - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.rogerebert.com/cannes/cannes-2018-the-image-book-cold-war - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7819-/onXqzPitNT8mvFThFCMSONStbcXDLY_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.ioncinema.com/reviews/pawel-pawlikowski-cold-war-review - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.hollywoodreporter.com/review/cold-war-film-review-1110299 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED http://cineuropa.org/nw.aspx?t=newsdetail&l=en&did=354139 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.telegraph.co.uk/films/0/cold-war-review-love-finds-way-jazzed-up-war-torn-poland/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 413, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:18 SWITCHBOARD Excluded 25 words in URL https://www.criterion.com/current/posts/5651-pawel-pawlikowski-s-cold-war
I 2022/06/09 11:04:18 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[u7tH4G_26NP5 (1735154894959017984)]} 0 2
I 2022/06/09 11:04:18 Fulltext indexing: u7tH4G_26NP5 https://www.criterion.com/current/posts/5651-pawel-pawlikowski-s-cold-war
I 2022/06/09 11:04:18 SWITCHBOARD *Indexed 467 words in URL https://www.criterion.com/current/posts/5651-pawel-pawlikowski-s-cold-war [u7tH4G_26NP5]
Description: Pawel Pawlikowskis Cold War | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 5422 bytes |
LinkStorageTime: 3 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:18 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 413, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:18 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=anders-allison, 224205 bytes
I 2022/06/09 11:04:18 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=anders-allison, STACKING TIME = 2, PARSING TIME = 22
I 2022/06/09 11:04:18 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://s3.amazonaws.com/criterion-production/films/2131124bf11dd19cde56a791c8fc54f9/o5Y9AGkM9iZWMr9AQm46QYzEAvubcV_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:18 SWITCHBOARD Excluded 10 words in URL https://www.criterion.com/shop/browse/list?director=anders-allison
I 2022/06/09 11:04:18 Fulltext indexing: u75wmm_26NP5 https://www.criterion.com/shop/browse/list?director=anders-allison
I 2022/06/09 11:04:18 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[u75wmm_26NP5 (1735154895259959296)]} 0 4
I 2022/06/09 11:04:18 SWITCHBOARD *Indexed 1189 words in URL https://www.criterion.com/shop/browse/list?director=anders-allison [u75wmm_26NP5]
Description: Allison Anders films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12501 bytes |
LinkStorageTime: 6 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:18 HostQueue forcing crawl-delay of 239 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 423, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 11)) = 239
I 2022/06/09 11:04:18 HTCACHE storing content of url https://www.criterion.com/current/posts/48-armageddon, 2644245 bytes
I 2022/06/09 11:04:18 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=woods-jack, 224125 bytes
I 2022/06/09 11:04:18 HostQueue forcing crawl-delay of 238 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 428, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 12)) = 238
I 2022/06/09 11:04:19 HTCACHE storing content of url https://www.criterion.com/films/30711-once-upon-a-time-in-china-ii, 68990 bytes
I 2022/06/09 11:04:19 HostQueue forcing crawl-delay of 237 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 423, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 13)) = 237
I 2022/06/09 11:04:19 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=bernard-raymond, 225294 bytes
I 2022/06/09 11:04:19 HostQueue forcing crawl-delay of 246 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 431, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 4)) = 246
I 2022/06/09 11:04:19 SWITCHBOARD CRAWL: ADDED 53 LINKS FROM https://www.criterion.com/current/posts/48-armageddon, STACKING TIME = 1, PARSING TIME = 491
I 2022/06/09 11:04:19 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7805-/IZGhTDOXgCBJWhXWr3h4dD9ZsCEYwq_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7797-/unpF07ubIyz7yqguxYn8dvQGCGvtoH_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7816-/RyS9gxk1Hs2BjUEXkawu7sH7bQjYnG_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://s3.amazonaws.com/criterion-production/editorial_content_posts/hero/7814-/f0vwlh7BQXuZBF0nLZZ6QykpRNFFcb_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 HostQueue forcing crawl-delay of 242 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 431, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 9)) = 241
I 2022/06/09 11:04:19 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=el-maanouni-ahmed, 224770 bytes
I 2022/06/09 11:04:19 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=woods-jack, STACKING TIME = 4, PARSING TIME = 292
I 2022/06/09 11:04:19 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://s3.amazonaws.com/criterion-production/films/25a6ff7c494d57431ad8e9db70da7775/IntxxcLKB8OiRTk1jozZohUF58ndJr_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 SWITCHBOARD CRAWL: ADDED 55 LINKS FROM https://www.criterion.com/films/30711-once-upon-a-time-in-china-ii, STACKING TIME = 6, PARSING TIME = 19
I 2022/06/09 11:04:19 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/kZsazXDxTzNigSXBrxJPae14ZEProWcSPqH6unhw.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://s3.amazonaws.com/criterion-production/films/ca13e1d049d2ce3312fa7ff43ceccb6e/CrlQ9weSAFflHtvyZfloPMtBYtWksT_large.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/0c3lp6P9O9agdSQfBDNJXUk4OI9wcCYbdgEMoP4z.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://s3.amazonaws.com/criterion-production/films/be379b0e4fca94f7ac51e4d75e6cbca8/q7qwnaL8LODF5ALB9GZQ3Stezu9pZ8_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1987-beb71f216e96d1ff2d0f8231f5b8b975/44LVkvftLRcr5paF4enJfBFTe5mI2c_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/MKtYsaOTZ4KhZ5gl4xJ0aMdyar1woRXFuPgoQgbz.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://s3.amazonaws.com/criterion-production/films/e8d4cd2c5dd1b2541a3c8325a6d1805f/W5gKGPvtkm5evOJ9devGYL935KMAdy_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/VpVlkhPbglJyOmYCqKAOugr04wAy3NBdKAOxYuXd.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/0yf8Ala6phzkIiRhFTIYDXhXqHpgz6VnPfSpqHh3.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/6i9AfjHO89v8pc6mXcFYhGjilOHWDE2IlseBasii.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://s3.amazonaws.com/criterion-production/films/cd4ae6bdbaea9fd9c9aff1d69f924bc4/5wErYoFwVfkciAfnpRbFIhPqv7tIC5_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 REJECTED https://criterion-production.s3.amazonaws.com/carousel-files/RW8ae3dWlPoi4gSqPYbBHw6CsLBkqgycCmUGV388.jpeg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:19 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=kadar-jan, 224200 bytes
I 2022/06/09 11:04:20 SWITCHBOARD Excluded 31 words in URL https://www.criterion.com/current/posts/48-armageddon
I 2022/06/09 11:04:20 HostQueue forcing crawl-delay of 240 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 438, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 10)) = 240
I 2022/06/09 11:04:20 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse/list?director=bernard-raymond, STACKING TIME = 12, PARSING TIME = 67
I 2022/06/09 11:04:20 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/films/0074d059af8442e00101f385c231abe2/TgMTqWh8AgVHGSJSNHKNjY1oODYw4u_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/films/119cf6abd03957cb617d43f1ba762223/dgadXeBrbXuAf3cGdp9lrLVymRuiSd_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1780-c489370ce256ad8a36a096bca4ed7bb4/9aNOPIxaT7oPmfJpfNfA6Bnm1Subqd_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 Fulltext indexing: u5WbnG_26NP5 https://www.criterion.com/current/posts/48-armageddon
I 2022/06/09 11:04:20 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[u5WbnG_26NP5 (1735154896775151616)]} 0 48
I 2022/06/09 11:04:20 SWITCHBOARD *Indexed 607 words in URL https://www.criterion.com/current/posts/48-armageddon [u5WbnG_26NP5]
Description: Armageddon | Current | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 7559 bytes |
LinkStorageTime: 50 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:20 SWITCHBOARD CRAWL: ADDED 44 LINKS FROM https://www.criterion.com/shop/browse/list?director=el-maanouni-ahmed, STACKING TIME = 6, PARSING TIME = 99
I 2022/06/09 11:04:20 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/product_images/1976-ece4132f4abef8c4e7beb0a0edffc9a8/y26UyQwNxt4FguJSgQIZWpCNlLsjHJ_small.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/films/6f8c86a192c52b78caadd6e47724d28c/tpVGeYjik10Y4PPvJ7gRo2AXgIPf2j_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 SWITCHBOARD Excluded 8 words in URL https://www.criterion.com/shop/browse/list?director=woods-jack
I 2022/06/09 11:04:20 Fulltext indexing: u2KCdm_26NP5 https://www.criterion.com/shop/browse/list?director=woods-jack
I 2022/06/09 11:04:20 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[u2KCdm_26NP5 (1735154896948166656)]} 0 6
I 2022/06/09 11:04:20 SWITCHBOARD *Indexed 1192 words in URL https://www.criterion.com/shop/browse/list?director=woods-jack [u2KCdm_26NP5]
Description: Jack Woods films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12460 bytes |
LinkStorageTime: 9 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:20 SWITCHBOARD Excluded 12 words in URL https://www.criterion.com/films/30711-once-upon-a-time-in-china-ii
I 2022/06/09 11:04:20 Fulltext indexing: uzBEAe_26NP5 https://www.criterion.com/films/30711-once-upon-a-time-in-china-ii
I 2022/06/09 11:04:20 SWITCHBOARD CRAWL: ADDED 43 LINKS FROM https://www.criterion.com/shop/browse/list?director=kadar-jan, STACKING TIME = 10, PARSING TIME = 45
I 2022/06/09 11:04:20 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://s3.amazonaws.com/criterion-production/films/17a09b267d7df228c099117fcc503b0b/HEN7Igx0rZ7xS24SClFcPTBRs9HxSL_thumbnail.jpg - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://twitter.com/Criterion - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.youtube.com/user/criterioncollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[uzBEAe_26NP5 (1735154896972283904)]} 0 4
I 2022/06/09 11:04:20 SWITCHBOARD *Indexed 301 words in URL https://www.criterion.com/films/30711-once-upon-a-time-in-china-ii [uzBEAe_26NP5]
Description: Once Upon a Time in China II (1992) | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 2993 bytes |
LinkStorageTime: 11 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:20 HostQueue forcing crawl-delay of 241 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 438, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 9)) = 241
I 2022/06/09 11:04:20 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=bernard-raymond
I 2022/06/09 11:04:20 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[u0Krxm_26NP5 (1735154897092870144)]} 0 2
I 2022/06/09 11:04:20 Fulltext indexing: u0Krxm_26NP5 https://www.criterion.com/shop/browse/list?director=bernard-raymond
I 2022/06/09 11:04:20 SWITCHBOARD *Indexed 1202 words in URL https://www.criterion.com/shop/browse/list?director=bernard-raymond [u0Krxm_26NP5]
Description: Raymond Bernard films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12560 bytes |
LinkStorageTime: 5 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:20 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=el-maanouni-ahmed
I 2022/06/09 11:04:20 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[uxgJNm_26NP5 (1735154897172561920)]} 0 2
I 2022/06/09 11:04:20 Fulltext indexing: uxgJNm_26NP5 https://www.criterion.com/shop/browse/list?director=el-maanouni-ahmed
I 2022/06/09 11:04:20 SWITCHBOARD *Indexed 1198 words in URL https://www.criterion.com/shop/browse/list?director=el-maanouni-ahmed [uxgJNm_26NP5]
Description: Ahmed El Maanouni films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12525 bytes |
LinkStorageTime: 8 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:20 SWITCHBOARD Excluded 9 words in URL https://www.criterion.com/shop/browse/list?director=kadar-jan
I 2022/06/09 11:04:20 org.apache.solr.update.processor.LogUpdateProcessorFactory [collection1] webapp=null path=/update params={}{add=[uoM-Jm_26NP5 (1735154897236525056)]} 0 2
I 2022/06/09 11:04:20 Fulltext indexing: uoM-Jm_26NP5 https://www.criterion.com/shop/browse/list?director=kadar-jan
I 2022/06/09 11:04:20 SWITCHBOARD *Indexed 1195 words in URL https://www.criterion.com/shop/browse/list?director=kadar-jan [uoM-Jm_26NP5]
Description: Ján Kadár films on Disc and Streaming | The Criterion Collection
MimeType: text/html | Charset: UTF-8 | Size: 12488 bytes |
LinkStorageTime: 4 ms | indexStorageTime: 0 ms
I 2022/06/09 11:04:20 HTCACHE storing content of url https://www.criterion.com/shop/browse?director=litvak-anatole, 224139 bytes
I 2022/06/09 11:04:20 HTCACHE storing content of url https://www.criterion.com/shop/browse/list?director=christensen-benjamin, 224790 bytes
I 2022/06/09 11:04:20 HostQueue forcing crawl-delay of 247 milliseconds for www.criterion.com: minimumDelta = 250, flux = 0, host.average = 448, robots.delay = 0, ((waitig = 250) - (timeSinceLastAccess = 3)) = 247
I 2022/06/09 11:04:20 SWITCHBOARD CRAWL: ADDED 45 LINKS FROM https://www.criterion.com/shop/browse?director=litvak-anatole, STACKING TIME = 1, PARSING TIME = 53
I 2022/06/09 11:04:20 REJECTED http://www.janusfilms.com/ - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/browse?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=header - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=sidebar - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/search?utm_source=criterion.com&utm_medium=referral&utm_campaign=search-redirect&utm_content=quick-search - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.criterionchannel.com/checkout/subscribe/purchase?utm_source=criterion.com&utm_medium=referral&utm_campaign=navigation&utm_content=footer - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.facebook.com/CriterionCollection - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)
I 2022/06/09 11:04:20 REJECTED https://www.googletagmanager.com/ns.html?id=GTM-N5VDQ85 - url does not match must-match filter (smb|ftp|https?)://(www.)?(\Qcriterion.com\E.*)