Resources > Crawlers list > FCCN crawler

logo FCCN crawler

ID: 36289
ProducerFCCN External link
Bot URLhttp://arquivo.pt/faq-crawling External link
StatusActive Active

Known variants

Arquivo-web-crawler

UseragentstringArquivo-web-crawler (compatible; heritrix/3.4.0-20200304 +https://arquivo.pt/faq-crawling)
Category-- Uncategorised --
Respects robots.txtNo
First seen2022-03-11 11:26:03
Last seen2024-10-22 18:27:58
IP addresses3
Walk from
194.210.235.4p100.arquivo.pt PT
194.210.235.5p101.arquivo.pt PT
194.210.235.3p97.arquivo.pt PT
 

Arquivo-web-crawler

Useragentstringarquivo-web-crawler (compatible; heritrix/3.4.0-20200304 +http://arquivo.pt)
Category-- Uncategorised --
Respects robots.txtNo
First seen2020-05-05 20:22:49
Last seen2022-10-21 04:46:15
IP addresses3
Walk from
194.210.235.5p101.arquivo.pt PT
194.210.235.4194.210.235.4 PT
194.210.235.6p102.arquivo.pt PT
 

Arquivo-web-crawler

UseragentstringArquivo-web-crawler (compatible; brozzler/1.5 +http://arquivo.pt/faq-crawling)
Category-- Uncategorised --
First seen2021-02-15 20:27:12
Last seen2021-12-10 09:44:19
IP addresses4
Walk from
194.210.235.17p48.arquivo.pt PT
194.210.235.16p54-pub.arquivo.pt PT
194.210.235.18p49.arquivo.pt PT
194.210.235.6p102.arquivo.pt PT
 

Arquivo-web-crawler

Useragentstringarquivo-web-crawler (compatible; heritrix)
Category-- Uncategorised --
First seen2020-04-27 04:12:21
Last seen2020-04-27 04:12:21
IP addresses1
Walk from
2001:690:a00:4001:b226:28ff:fe12:664e2001:690:a00:4001:b226:28ff:fe12:664e PT
 

Arquivo-web-crawler

Useragentstringarquivo-web-crawler (compatible; heritrix/3.4.0-20190418 +http://arquivo.pt)
Category-- Uncategorised --
Respects robots.txtNo
First seen2020-02-17 18:41:35
Last seen2020-04-11 23:03:54
IP addresses3
Walk from
193.136.192.160p83.arquivo.pt PT
193.136.192.159p82.arquivo.pt PT
193.136.192.56p86.arquivo.pt PT
 

Arquivo-web-crawler

Useragentstringarquivo-web-crawler (compatible; heritrix/3.3.0-SNAPSHOT-2019-08-26T10:34:48Z +http://arquivo.pt)
Category-- Uncategorised --
Respects robots.txtNo
First seen2019-09-13 06:48:57
Last seen2020-01-13 04:30:51
IP addresses2
Walk from
193.136.192.159p82.arquivo.pt PT
193.136.192.160p83.arquivo.pt PT
 

Arquivo-web-crawler

Useragentstringarquivo-web-crawler (compatible; heritrix/3.3.0-SNAPSHOT-2018-05-28T10:30:31Z +http://arquivo.pt)
Category-- Uncategorised --
First seen2018-08-04 15:17:43
Last seen2019-06-25 07:52:17
IP addresses3
Walk from
193.136.192.159p82.arquivo.pt PT
193.136.192.149p81.arquivo.pt PT
193.136.192.169p84.arquivo.pt PT
 

Arquivo-web-crawler

UseragentstringArquivo-web-crawler (compatible; heritrix/1.14.4 +http://arquivo.pt/faq-crawling)
Category-- Uncategorised --
First seen2017-06-15 11:24:16
Last seen2017-06-24 13:15:42
IP addresses1
Walk from
193.136.192.159p82.arquivo.pt PT
 
Among our clients
View more...
 salesforce.com, inc.  
 MailChimp  
 Dailymotion SA  
 Akamai Technologies, Inc.  
 Oracle  
 PayPal Holdings, Inc.