Data Sources & Linking Policy

Data Sources and Linking Policy – Vitae AI
‍
1. Data Compilation and Linking
‍
Vitae AI aggregates publicly and commercially available data from a range of third-party platforms and compliant data providers. This information may include, but is not limited to, links to user-generated content, professional profiles, and publicly accessible metadata.
‍
The following platforms may be among those from which Vitae AI obtains or references data, directly or indirectly:
‍
LinkedIn
X (formerly Twitter)
GitHub
Stack Overflow
Google Scholar
Crunchbase
Conference directories and publications
Dribbble
Facebook
Wellfound (formerly AngelList Talent)
Quora
Xing
Meetup
Medium
Ello
Gravatar
Stack Exchange
Foursquare
Google Developer
GitLab
Reddit
RubyGems
DEV Community
And other similar publicly accessible sources.
‍
This list is not exhaustive and is subject to change as Vitae AI expands its data source network. For a comprehensive list or more specific details, please contact hello@vitae.ai.
‍
2. Contact Information Data
‍
In addition to publicly accessible professional profile data, Vitae AI offers access to contact information at scale. This may include professional or personal email addresses, telephone numbers, and related metadata for over 200 million candidate profiles. All contact data is sourced in accordance with applicable data licensing agreements and regulatory compliance requirements.
For specific information regarding the availability of contact data or sourcing methodology, please contact your designated representative at Vitae AI.
‍
3. Data Accuracy and Updates
‍
Vitae AI employs automated systems and verification processes to refresh candidate profiles regularly. These updates are performed to maintain the relevance, accuracy, and integrity of profile and contact data within the platform.
For more information about data refresh cycles and validation procedures, please contact our support team.
‍
4. Privacy and Legal Compliance
‍
Vitae AI adheres to global data privacy regulations, including but not limited to the General Data Protection Regulation (GDPR), the California Consumer Privacy Act (CCPA), and other applicable data protection frameworks.
We are committed to transparency, accountability, and user control in the handling of personal data. Our internal data governance procedures are designed to ensure the secure processing, storage, and management of all user-related information.
‍
For further details, please consult the following:
‍
[Privacy Policy]
[Terms & Conditions]
[Cookie Policy]
[Privacy Center]
[Trust & Security Hub]
‍
For privacy-related inquiries, data subject access requests (DSAR), or other legal matters pertaining to your personal data, please contact our Privacy Team at hello@vitae.ai.
Vitae AI continuously monitors regulatory developments and adjusts its policies and operational procedures to ensure ongoing legal compliance. We encourage all users, clients, and data subjects to review our publicly available statements and policies to understand how their data is handled.

Vitae AI – Data & Trust Policy
‍
Vitae AI is committed to maintaining the highest standards of transparency, security, and compliance in the management of data used for its artificial intelligence systems. This Data & Trust Policy outlines our approach to data sourcing, model training, privacy, cybersecurity, and responsible AI practices.

1. Data Sourcing & Usage
‍
Sources of Data
‍
Vitae AI collects and utilizes publicly available professional information such as resumes, CVs, and profiles. This data is sourced from open websites and third-party data partners, with the understanding that individuals have shared their information for professional discovery and engagement purposes.The types of data include:Resume detailsContact information (where permitted)Education & professional qualificationsWork history & job titlesSkill sets and experience summariesTraining Data for AI Models
Vitae AI’s PeopleGPT is trained using over 800 million profiles sourced from 30+ publicly available data sources. These include professional experience, skills, and education information.No customer data is used to train Vitae AI’s models, whether in-house or through third-party providers.2. Data Privacy, Storage & Compliance
Personally Identifiable Information (PII)
Vitae AI does not collect or store sensitive PII (such as biometric or health data) from its customers. Where customer data is provided for platform use, it is handled strictly in accordance with data minimization and deletion policies.Data Hosting
AI model data is hosted by providers in the United StatesCloud infrastructure is hosted by AWS (North Virginia) and Google Cloud Platform (Iowa)Alternate hosting locations can be provided upon request.Retention & Deletion
Data is retained only as long as it is required for legitimate business or regulatory purposes. Once obsolete, it is securely archived or deleted. PII is de-identified or destroyed promptly upon the end of its business utility.3. AI Model Monitoring & Fairness
Model Validity & Monitoring
PeopleGPT is regularly evaluated for accuracy and fairness through:Internal validation against hiring outcomesSearch latency and hallucination rate monitoringOngoing client feedbackBias & Fairness Testing
Bias and disparate impact tests are conducted:Semi-annually or upon major model updatesUsing representative query sets over sample data (approx. 2% of the dataset)Human Oversight
The AI model does not make final hiring decisions. All outcomes are determined by human users. The model surfaces relevant candidate data in response to user queries but does not evaluate or recommend hiring actions.4. Cybersecurity & Data Protection
AI Security Practices
Vitae AI implements strong safeguards against:Malicious prompt inputsTraining data poisoningUnauthorized access to proprietary models or dataAccess Controls
Access to systems and information is protected via:Unique user authenticationSSH key authorizationRole-based access restrictionsOversight Responsibility
Cyber-risk, privacy, and compliance oversight is managed by the Chief Technology Officer (CTO) of Vitae AI.5. Third-Party Tools, Audits & Compliance
AI Model Providers
Third-party AI partners are contractually restricted from using customer data for model training. Personal data must be deleted within 30 days unless legal obligations require otherwise.Audits & Certifications
Internal audits follow ISO 42001 standards.Third-party SOC 2 audits have been completed for select tools.Evaluation of large language model performance is currently supported through Humanloop (SOC 2 Type II compliant).Incident Response Protocol
All personnel must report data incidents immediately. Vitae AI follows an established response process for investigation, containment, recovery, and documentation.6. Commitment to Transparency
Vitae AI maintains clear documentation regarding:AI model limitationsData sourcing practicesUse of identity proxies and sensitive dataWhere transparency is limited by the nature of AI systems, we strive to provide detailed disclosures and updates via our documentation, privacy policy, and customer support channels.Contact
For privacy or security-related inquiries, please contact:📩 privacy@vitae.ai
📩 support@vitae.ai
🌐 Visit: https://www.vitae.ai/privacy
‍

Data Sources & Linking Policy

useful links

help & support

Join mailing list