manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From lalit jangra <lalit.j.jan...@gmail.com>
Subject Re: Not able to see results of sharepoint crawls?
Date Thu, 29 May 2014 16:32:13 GMT
Thanks Karl,

With your help, i am able to content indexed in my solr with logs as below
with some meaningful value to literal.allow_token_document variable. But
now i am struggling with not able to get any property indexed from
sharepoint to solr. On MCF job page, i have put sharepoint content
properties  such as Name, Title, GUID etc. which are mapped to fields in my
solr schema but i am able to see only GUID property filled with metadata &
not any other.

Can you help here?

content_id={25CAEB55-ECEB-4ACC-A45F-78E35E877024}&literal.id=
http://testirishwaterportal/sites/hr/Documents/A2.docx&resource.name=A2.docx&literal.allow_token_document=Agrp:GApprovers&literal.allow_token_document=Agrp:GDesigners&literal.allow_token_document=Agrp:GHR%2BMembers&literal.allow_token_document=Agrp:GHR%2BOwners&literal.allow_token_document=Agrp:GHR%2BVisitors&literal.allow_token_document=Agrp:GHierarchy%2BManagers&literal.allow_token_document=Agrp:GRestricted%2BReaders&literal.allow_token_document=Agrp:GViewers&literal.allow_token_document=Agrp:Uc%253A0%2528.s%257Ctrue&literal.allow_token_document=Agrp:Ui%253A0%2523.w%257Ciwater%255Cadministrator&wt=xml&version=2.2}
{add=[http://testirishwaterportal/sites/hr/Documents/A2.docx
(1469453279104598016)]} 0 64

INFO  - 2014-05-29 17:10:51.533;
org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
webapp=/solr1 path=/update/extract
params={literal.deny_token_document=Agrp:DEAD_AUTHORITY&literal.content_id={90CDF9E2-12F3-49C3-A37A-DF3F60DBC44F}&
literal.id=
http://testirishwaterportal/sites/hr/Documents/Test%252011111.docx&resource.name=Test+11111.docx&literal.allow_token_document=Agrp:GApprovers&literal.allow_token_document=Agrp:GDesigners&literal.allow_token_document=Agrp:GHR%2BMembers&literal.allow_token_document=Agrp:GHR%2BOwners&literal.allow_token_document=Agrp:GHR%2BVisitors&literal.allow_token_document=Agrp:GHierarchy%2BManagers&literal.allow_token_document=Agrp:GRestricted%2BReaders&literal.allow_token_document=Agrp:GViewers&literal.allow_token_document=Agrp:Uc%253A0%2528.s%257Ctrue&literal.allow_token_document=Agrp:Ui%253A0%2523.w%257Ciwater%255Cadministrator&wt=xml&version=2.2}
{add=[http://testirishwaterportal/sites/hr/Documents/Test%2011111.docx
(1469453279196872704)]} 0 63



On Thu, May 29, 2014 at 12:28 PM, Karl Wright <daddywri@gmail.com> wrote:

> Hi Lalit,
>
> deny_token_document being set to DEAD_AUTHORITY seems to imply you have
> selected "active directory" as the authorization type for you connection.
> (This is done on the Authority Type tab.)  But it may be the case that you
> are using SharePoint in Claims-based mode.  If that's true, you should have:
>
> - Select "Native" as the authority type
> - Set up a SharePoint/Native authority
> - If you have AD involved, also set up a SharePoint/ActiveDirectory
> authority.
>
> FWIW, the two metadata values you want to watcht in the Solr URL are:
>
> literal.deny_token_document=DEAD_AUTHORITY
> literal.allow_token_document=
>
> You should see something in the allow field if your configuration is
> right, and the SharePoint document is visible to anyone at all.
>
> You will also need to add appropriate fields in Solr for security tokens,
> but I imagine you've already done that.
> Thanks,
> Karl
>
>
>
> On Thu, May 29, 2014 at 6:43 AM, lalit jangra <lalit.j.jangra@gmail.com>
> wrote:
>
>> Hi,
>>
>> I have configured a job to crawl sharepoint with Apache MCF & storing
>> index in solr.I run the job, it works fine without any error on screen &
>> MCF logs and completes elegantly. I have a custom solr schema that works
>> for alfresco fine.
>>
>> Now when i go back to solr admin screen to query added documents, i am
>> not able to see any sharepoint docs at all & see some deny_token details in
>> solr logs. I also have tried to map/unmap properties from sharepoint to
>> solr in MCf job screen but of no avail.
>>
>> Can anyone help me here?
>>
>> INFO  - 2014-05-29 10:34:18.100;
>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>> webapp=/solr1 path=/update/extract params={commit=true&wt=xml&version=2.2}
>> {commit=} 0 95
>> INFO  - 2014-05-29 10:41:16.555;
>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>> webapp=/solr1 path=/update/extract
>> params={literal.GUID={065F52D2-0192-43CF-B0FE-6B7DF4A25561}&literal.deny_token_document=DEAD_AUTHORITY&
>> literal.id=
>> http://sharepontsite/sites/hr/sites/hr/Lists/Announcements/DispForm.aspx?ID%3D3&resource.name=docname&literal.allow_token_document=&wt=xml&version=2.2}
>> {add=[
>> http://sharepontsite/sites/hr/sites/hr/Lists/Announcements/DispForm.aspx?ID=3
>> (1469428768758038528)]} 0 3
>> INFO  - 2014-05-29 10:41:16.576;
>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>> webapp=/solr1 path=/update/extract
>> params={literal.GUID={179F2C90-14A0-4097-A9F4-C0D2CD9D65B1}&literal.deny_token_document=DEAD_AUTHORITY&
>> literal.id=
>> http://sharepontsite/sites/hr/sites/hr/Lists/Announcements/DispForm.aspx?ID%3D2&resource.name=docname&literal.allow_token_document=&wt=xml&version=2.2}
>> {add=[
>> http://sharepontsite/sites/hr/sites/hr/Lists/Announcements/DispForm.aspx?ID=2
>> (1469428768771670016)]} 0 11
>> INFO  - 2014-05-29 10:41:17.004;
>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>> webapp=/solr1 path=/update/extract
>> params={literal.GUID={065F52D2-0192-43CF-B0FE-6B7DF4A25561}:IrishWater_-_ECM_-_High_Availability_Design.docx&literal.deny_token_document=DEAD_AUTHORITY&
>> literal.id=
>> http://sharepontsite/sites/hr/Lists/Announcements/Attachments/3/IrishWater_-_ECM_-_High_Availability_Design.docx&resource.name=IrishWater_-_ECM_-_High_Availability_Design.docx&literal.allow_token_document=&wt=xml&version=2.2}
>> {add=[
>> http://sharepontsite/sites/hr/Lists/Announcements/Attachments/3/IrishWater_-_ECM_-_High_Availability_Design.docx
>> (1469428769214169088)]} 0 199
>> INFO  - 2014-05-29 10:41:17.343;
>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>> webapp=/solr1 path=/update/extract
>> params={literal.GUID={39D4D9C1-301B-4082-94D7-818323509ABC}&literal.deny_token_document=DEAD_AUTHORITY&
>> literal.id=
>> http://sharepontsite/sites/hr/Shared%2520Documents/IrishWater_-_ECM_-_High_Availability_Design.docx&resource.name=IrishWater_-_ECM_-_High_Availability_Design.docx&literal.allow_token_document=&wt=xml&version=2.2}
>> {add=[
>> http://sharepontsite/sites/hr/Shared%20Documents/IrishWater_-_ECM_-_High_Availability_Design.docx
>> (1469428769581170688)]} 0 171
>> INFO  - 2014-05-29 10:41:31.555;
>> org.apache.solr.update.DirectUpdateHandler2; start
>> commit{,optimize=false,openSearcher=false,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
>> INFO  - 2014-05-29 10:41:31.662; org.apache.solr.core.SolrDeletionPolicy;
>> SolrDeletionPolicy.onCommit: commits: num=2
>>
>> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/app/solr/example/solr/collection1/data/index
>> lockFactory=org.apache.lucene.store.NativeFSLockFactory@42ef8490;
>> maxCacheMB=48.0 maxMergeSizeMB=4.0),segFN=segments_9,generation=9}
>>
>>
>> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/app/solr/example/solr/collection1/data/index
>> lockFactory=org.apache.lucene.store.NativeFSLockFactory@42ef8490;
>> maxCacheMB=48.0 maxMergeSizeMB=4.0),segFN=segments_a,generation=10}
>> INFO  - 2014-05-29 10:41:31.663; org.apache.solr.core.SolrDeletionPolicy;
>> newest commit generation = 10
>> INFO  - 2014-05-29 10:41:31.669;
>> org.apache.solr.search.SolrIndexSearcher; Opening Searcher@2cf5f14b
>> realtime
>> INFO  - 2014-05-29 10:41:31.670;
>> org.apache.solr.update.DirectUpdateHandler2; end_commit_flush
>> INFO  - 2014-05-29 10:41:49.886;
>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>> webapp=/solr1 path=/update/extract
>> params={literal.GUID={90CDF9E2-12F3-49C3-A37A-DF3F60DBC44F}&literal.deny_token_document=DEAD_AUTHORITY&
>> literal.id=
>> http://sharepontsite/sites/hr/Documents/Test%252011111.docx&resource.name=Test+11111.docx&literal.allow_token_document=&wt=xml&version=2.2}
>> {add=[http://sharepontsite/sites/hr/Documents/Test%2011111.docx
>> (1469428803708125184)]} 0 55
>> INFO  - 2014-05-29 10:41:49.982;
>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>> webapp=/solr1 path=/update/extract
>> params={literal.GUID={25CAEB55-ECEB-4ACC-A45F-78E35E877024}&literal.deny_token_document=DEAD_AUTHORITY&
>> literal.id=
>> http://sharepontsite/sites/hr/Documents/A2.docx&resource.name=A2.docx&literal.allow_token_document=&wt=xml&version=2.2}
>> {add=[http://sharepontsite/sites/hr/Documents/A2.docx
>> (1469428803806691328)]} 0 49
>> INFO  - 2014-05-29 10:41:58.249;
>> org.apache.solr.update.DirectUpdateHandler2; start
>> commit{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
>> INFO  - 2014-05-29 10:41:58.424; org.apache.solr.core.SolrDeletionPolicy;
>> SolrDeletionPolicy.onCommit: commits: num=2
>>
>> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/app/solr/example/solr/collection1/data/index
>> lockFactory=org.apache.lucene.store.NativeFSLockFactory@42ef8490;
>> maxCacheMB=48.0 maxMergeSizeMB=4.0),segFN=segments_a,generation=10}
>>
>> commit{dir=NRTCachingDirectory(org.apache.lucene.store.MMapDirectory@/app/solr/example/solr/collection1/data/index
>> lockFactory=org.apache.lucene.store.NativeFSLockFactory@42ef8490;
>> maxCacheMB=48.0 maxMergeSizeMB=4.0),segFN=segments_b,generation=11}
>> INFO  - 2014-05-29 10:41:58.425; org.apache.solr.core.SolrDeletionPolicy;
>> newest commit generation = 11
>> INFO  - 2014-05-29 10:41:58.433;
>> org.apache.solr.search.SolrIndexSearcher; Opening Searcher@4f1ea922 main
>> INFO  - 2014-05-29 10:41:58.435;
>> org.apache.solr.core.QuerySenderListener; QuerySenderListener sending
>> requests to Searcher@4f1ea922
>> main{StandardDirectoryReader(segments_b:47:nrt _c(4.6):C4 _e(4.6):C1
>> _d(4.6):C1)}
>> INFO  - 2014-05-29 10:41:58.436;
>> org.apache.solr.core.QuerySenderListener; QuerySenderListener done.
>> INFO  - 2014-05-29 10:41:58.437;
>> org.apache.solr.update.DirectUpdateHandler2; end_commit_flush
>> INFO  - 2014-05-29 10:41:58.441; org.apache.solr.core.SolrCore;
>> [collection1] Registered new searcher Searcher@4f1ea922
>> main{StandardDirectoryReader(segments_b:47:nrt _c(4.6):C4 _e(4.6):C1
>> _d(4.6):C1)}
>> INFO  - 2014-05-29 10:41:58.444;
>> org.apache.solr.update.processor.LogUpdateProcessor; [collection1]
>> webapp=/solr1 path=/update/extract params={commit=true&wt=xml&version=2.2}
>> {commit=} 0 195
>>
>>
>>
>> --
>> Regards,
>> Lalit Jangra.
>>
>
>


-- 
Regards,
Lalit Jangra.

Mime
View raw message